Reference

Everything the Whisper MCP server exposes: 18 tools (10 read-only + 8 write), the evidence model returned with every result, query format/pagination, the error model, and the read-only query-safety rules.

Updated July 2026MCP

Reference for everything the Whisper MCP server exposes: 18 tools, the resources and prompts behind them, the evidence model returned with every result, and example questions agents can answer. For client-by-client install instructions, see the Setup guide.

The graph behind the connector holds ~7.43 billion nodes, ~39.26 billion edges, and ~7.53 million threat-intel edges (43 feed sources, 25 categories) across 39 node labels and 45 materialized edge types. These numbers drift as the graph ingests — the whisper://stats resource is the live source of truth.

What's exposed

The connector advertises 18 tools: 10 read-only and 8 write. The read tools are annotated readOnly:true, destructive:false; the eight write tools are honestly annotated readOnly:false, destructive:true. The free-form query path is provably read-only — write and admin Cypher is rejected before it reaches the database — and the write surface lives entirely in dedicated, narrow tools (whisper_agents, the watch_* subscription tools, and the submit_* contribution tools), never in query. The schema and the docs are discovered on demand rather than dumped into the model's context up front.

Read tools (10)

Tool	Purpose
`query`	Primary Cypher tool — runs an arbitrary read-only Cypher query against the graph and returns `columns`, `rows`, `statistics`, and an `evidence` block. Supports `format` (`table` / `graph` / `compact`) and count-first `page` / `pageSize` pagination. See The `query` tool
`explain_indicator`	Threat assessment for an IP / IPv6 / hostname / CIDR / ASN — composite score, level (NONE…CRITICAL), factors, sources, plus a `coverage` block (granularity, `sharedHost`, advisories). Each verdict is tagged `source: live-explain \| node-cache \| unavailable` — see Evidence & provenance
`explain_schema`	The schema, on demand. No argument → the label catalogue (label + count + scale). With a `label` → that entity's properties, inbound/outbound edges (with directions), a runnable sample traversal, and notes. See Schema introspection
`read_docs`	Pulls the Whisper docs (`whisper.security/docs/**`) on demand: list, search, or fetch one page as Markdown. Keeps the Cypher reference and cookbook out of the always-on context
`list_workflows`	Searches the shared workflow & recipe gallery and returns each item's summary plus its full parameter space — see the Workflow & recipe gallery
`run_workflow`	Runs one or more gallery workflows by slug in a single call (with optional `input` / `params`), returning chained per-step results, derived signals, and an `evidence` trail. One call replaces several `query` round-trips — see the gallery
`identify`	Who runs a set of hostnames — vendor & role attribution (not a threat verdict). `hosts: string[]` (≤256); one row per host with `vendor_id`, `canonical_name`, `confidence`, `category`, `roles`, `band`, `host_class`, `evidence`. A novel host degrades to `band:"UNKNOWN"`, never failing the batch
`assess`	Coverage-qualified threat verdict for hostnames. `hosts: string[]` (≤256); one row per host with `label`, `band`, `sub_labels`, `signals`, `coverage`, `evidence`. Gate on `coverage` (`known-clean` / `no-data` / `structural-only`) — a `no-data` row is populated, never empty, and is not benign
`walk`	Structural neighbourhood of a (possibly novel) host — nearest known vendors and siblings. `host` (required), `depth` (1–6, default 2), `budget_ms` (per-arm, default 3000). On budget expiry returns partial data with `arms.deadline_hit:true`, never an error
`verify_identity`	Keyless verification of a whisper.online agent identity — give it an IPv6 `/128` or an agent FQDN and get the server-side verification chain (PTR, forward-confirmed AAAA, DNSSEC-anchored DANE/TLSA pin, signed identity doc) plus the public RDAP object. Works without any API key

Write tools (8)

Every write tool is honestly annotated readOnly:false, destructive:true. They are narrow, per-action tools (never one fat runtime-discriminated tool), and every write op is audited. The whisper.online control plane is API-key-scoped; the Indenture watch_* / submit_* tools need a real tenant key (anonymous sessions are refused).

Tool	Purpose
`whisper_agents`	The whisper.online agent control plane — provision and govern agents that hold routable IPv6 `/128` identities under your own API key. Ops: `register`, `identity`, `connect` (egress: HTTPS CONNECT proxy / SOCKS5 / WireGuard), `list`, `agent`, `policy`, `logs`, `lookups`, `revoke`, `domain` (bring-your-own-domain). The caller's key is forwarded, so every op runs under that tenant and its scopes
`watch_query`	Subscribe to a stored read-only Cypher query, re-evaluated as the graph mutates. `query` (stored opaque, not run through the validator), `deliver?` (PULL by default; URL push is PRO+). Needs a real tenant key
`watch_verdict`	Subscribe to a host's verdict change. `host` (required), `deliver?`. Same tier gate
`watch_indicator`	Subscribe to an indicator hash. `hash` (required), `deliver?`. Same tier gate
`watch_list`	List your subscriptions (owner-isolated). Returns `subscription_id`, `kind`, `delivery_mode`, `status`, `created_epoch`
`watch_cancel`	Cancel one of your subscriptions. `subscription_id` (required). Owner-isolated
`submit_indicator`	Contribute a threat indicator (K-anonymity-gated promotion, engine-side). `identifier_kind` + `value` (+ optional `observation_id`). Needs the `submit:write` scope on a non-anon key; a plaintext host/URL for a hash field is rejected, never coerced
`submit_feedback`	Contribute a data-quality / false-positive report. `comment` (required), `severity?`, `query?` (stored opaque, never executed), `results?`. Same scope gate

identify / assess / walk are also available as gallery workflows (e.g. indicator for a full identify → assess → walk threat read including WHOIS/BGP history) you run via run_workflow — call list_workflows to find them. explain_schema and read_docs are cached server-side, so an agent can call them as often as it needs without paying a quota cost.

The whisper.online control plane, two tiers

No API key → verify_identity still works (it reads the public keyless gateway), so anyone can answer "is this address a Whisper agent, and who operates it?".
With your API key → whisper_agents unlocks provision + govern + egress. The key on your MCP session is forwarded to the control plane; every op runs under your own tenant and scopes, and the backend answers cap/argument problems with clear typed errors (BAD_ARGS, AGENT_LIMIT, EGRESS_DISABLED, …).

The `query` tool

query is the primary tool. Beyond the Cypher string it accepts two presentation controls:

format — how rows come back:
- table (default for tabular results) — columns + rows, the classic shape.
- graph — nodes + edges, for results that return whole paths or subgraphs.
- compact — a denser encoding for large result sets that minimises token usage.
Pagination — page (1-based) and pageSize. Pagination is count-first: the first page returns the total row count alongside the slice, so an agent knows how many pages exist before walking them.

Every query response also carries an evidence block and a top-level references object and, where they apply, the self-correction fields (autoLimited, rewritten, fix, truncated) — see Self-correcting queries and Evidence & provenance. The response carries no remaining-quota or plan-budget field: the MCP server applies no plan, rate, or usage limits of its own — read the whisper://quota resource for the graph engine's own caps.

Resources

The schema and the doc/cookbook resources are now discovered on demand through explain_schema and read_docs rather than always listed — that's most of what cut the always-listed surface in half. The lightweight, always-useful descriptors stay available as resources:

Resource	Contents
`whisper://schema/full`	The full schema reference: node labels with counts, edge types with directions, and the edge-direction landmines — also surfaced inline by `explain_schema` so an agent rarely needs the raw resource
`whisper://stats`	Live database statistics — nested node and edge counts (`physical` / `virtual` / `total`) plus the `threatIntel` block
`whisper://quota`	Caller's plan tier, query-depth cap, timeouts, and row limit — the graph engine's own limits, surfaced read-only so an agent can self-throttle. The MCP server itself applies no rate or usage limits, so a `query` response carries no remaining-budget field; read this resource at the start of a session for the caps that actually apply
`whisper://server`	Server / deployment descriptor: `serverVersion`, `deploymentName`, the live `readyLayers[]` (which capability layers are active on this deployment), and a `schemaHash` for drift detection

The Cypher function reference, the cookbook, and the agent guide are reachable through read_docs (e.g. read_docs for the Cypher functions page, the cookbook, or any docs page) instead of riding along in every connection.

Prompts

The connector ships 12 slash-style prompts for common investigation patterns, one per gallery workflow. A prompt's depth depends on which capability layers are live on the deployment you're connected to — the readyLayers[] field of the whisper://server resource is the authoritative list.

Each prompt is generated from a gallery workflow, so the prompt surface matches the gallery by construction — every prompt wraps a run_workflow call, embeds that workflow's per-step plan, and ends with an evidence directive. Arguments come from the workflow's inputs; the tunable params are listed in the prompt body (call list_workflows for their options and ranges).

Prompt	Argument(s)	What it does
`indicator`	`indicator`	Full-depth investigation of a domain, IP, ASN, or prefix — verdict, historical WHOIS/BGP context, and everything connected to it
`attack-surface`	`domain`	A domain's external attack surface: DNS, mail, IP, ASN, subdomains, CDN-origin candidates, scored for risk
`typosquat`	`domain`	Registered lookalikes of a brand, scored for which are dangerous
`build-takedown-evidence-package`	`domain`	Assemble a ready-to-submit dossier for taking down a scam or phishing domain
`indicator-enrichment`	`value`	One domain or IP into a full context card — owner, hosting, mail, location, reputation
`infrastructure-mapping`	`value`	Trace one indicator to its true owner and full estate, even behind CDNs and privacy screens
`supply-chain`	`target`	Map what a domain depends on — every external provider by function, with dependency chains and single-vendor (SPOF) signals
`nameserver-hijack-dns-consistency`	`value`	Check a domain's nameservers for the misconfigurations that enable DNS hijacking
`route-health`	`target`	BGP route health: prefixes, peers, MOAS conflicts, RPKI ROA coverage
`bgp-hijack-exposure`	`value`	Grade a network's routing security and trace conflicts to the domains they'd expose
`anycast-dns-root-sovereignty`	`country`	A country's DNS-root resilience if it were cut off from the world
`subdomain-takeover`	`value`	Find subdomains pointing at abandoned services an attacker could claim

Prompts and gallery recipes overlap by design: a prompt is a single-shot, client-triggered template; a gallery recipe (run_workflow) is a server-side, multi-step investigation you run by slug. When in doubt, the gallery is the more capable path — see the Workflow & recipe gallery.

Schema introspection

When an agent isn't sure whether the canonical property is h.name or h.fqdn, or whether to anchor on HOSTNAME or DOMAIN, it should call explain_schema before writing the query. With no argument it returns the label catalogue; pass a label and it returns that entity's properties, edges, and a runnable sample traversal. It's cached server-side and returns in milliseconds — cheap to call, and it prevents the most common query bug (writing a defensive MATCH (h) WHERE h:HOSTNAME OR h:DOMAIN OR h:FQDN ... against a billion-node label, which the validator rejects).

A typical introspection-then-query transcript:

User:     What labels does the Whisper graph have?
Agent:    [calls explain_schema]                 (no argument)
          → {labels: [{label: "HOSTNAME", count: 2_631_997_144, scale: "2.6B"},
                      {label: "IPV4", count: 618_914_961, scale: "619M"},
                      ... 37 more], labelCount: 39, edgeCount: 45}
Agent:    The graph has 39 labels. The biggest are HOSTNAME (2.6B), IPV4 (619M), and
          ANNOUNCED_PREFIX (1.4M). What do you want to look at?

User:     Show me the threat properties on a hostname.
Agent:    [calls explain_schema {label: "HOSTNAME"}]
          → {label: "HOSTNAME", exists: true, count: 2_631_997_144, scale: "2.6B",
             properties: [{name: "name", type: "String"},
                          {name: "threatScore", type: "Double"},
                          {name: "threatLevel", type: "String"},
                          {name: "isThreat", type: "Boolean"},
                          {name: "isC2", type: "Boolean"}, ...],
             outboundEdges: [{type: "RESOLVES_TO", to: ["IPV4", "IPV6"], pattern: "(h:HOSTNAME)-[:RESOLVES_TO]->(:IPV4)"},
                             {type: "CHILD_OF", to: ["HOSTNAME", "TLD"]}, ...],
             inboundEdges:  [{type: "NAMESERVER_FOR", from: ["HOSTNAME"]},
                             {type: "MAIL_FOR", from: ["HOSTNAME"]}, ...],
             sampleTraversal: {intent: "...", cypher: 'MATCH (h:HOSTNAME {name: "example.com"}) RETURN h LIMIT 1'}}
Agent:    [calls query]
          MATCH (h:HOSTNAME {name: "evil.example"})
          RETURN h.threatScore, h.threatLevel, h.isThreat, h.isC2,
                 h.isPhishing, h.isMalware

Agents that introspect first write valid queries on the first try; agents that don't tend to fall back to defensive label-or'ed scans that the validator rejects.

Error model

The query tool returns a typed error envelope that clients and LLMs can branch on instead of parsing free-text messages.

Field	Meaning
`success`	`true` on success, `false` on error
`error`	Human-readable error message
`suggestion`	A concrete fix the agent can apply and retry
`errorCode`	Machine-readable code (see below)
`retryable`	Whether re-running the same query could succeed

errorCode is one of:

Code	Cause	Retryable
`SCHEMA_ERROR`	Bad label, property, relationship type, or column name	No — fix the query
`SYNTAX_ERROR`	The Cypher itself is malformed (bad token, unknown function)	No — fix the query
`LIMIT_ERROR`	`LIMIT` missing, malformed, or over the 5000 cap (usually carries a `CLAMP_LIMIT` fix)	No — fix the query
`VALIDATION_REJECTED`	The query failed a query-safety rule (see below)	No — fix the query
`QUERY_TOO_EXPENSIVE`	Stopped by the database's element / row-budget guard (stopped for size, not duration)	No — narrow it
`DB_TIMEOUT`	The query ran past its time budget	Yes — narrow it
`DB_UNAVAILABLE`	The graph database is unreachable	Yes

QUERY_TOO_EXPENSIVE is the size guard, distinct from DB_TIMEOUT's duration guard — a query can be cheap-per-row but still touch too many elements. retryable is read straight off the code, so a client can branch on it without parsing the message.

Migration note: the old catch-all CYPHER_SYNTAX_ERROR was split into SCHEMA_ERROR (bad label / property / column) and SYNTAX_ERROR (malformed Cypher), and LIMIT_ERROR was carved out of VALIDATION_REJECTED for the over-cap-LIMIT case. Clients that branch on CYPHER_SYNTAX_ERROR should migrate to the new codes.

Procedures callable inside `query` {#procedures}

Beyond the dedicated tools, the query tool accepts a set of procedures directly inside Cypher, so you can compose them into a larger query or run them ad hoc:

Procedure	What it does
`CALL explain("indicator")`	Threat assessment for an IP / hostname / ASN / CIDR (also the `explain_indicator` tool)
`CALL whisper.history("indicator")`	Historical WHOIS / BGP snapshots
`CALL whisper.variants("name" [, "LABEL"] [, checkExisting])`	Typosquat / lookalike variant generation
`CALL whisper.identify($hosts)`	Host identity — vendor, canonical name, `host_class`
`CALL whisper.assess($hosts)`	Coverage-qualified clean/suspicious verdict per host
`CALL whisper.walk($host, $depth, $budgetMs)`	Structural neighborhood when `identify` has no direct match
`CALL whisper.origins($domain)`	Candidate true-origin IPs behind a CDN/proxy
`CALL whisper.lookupTorRelay(...)` / `whisper.lookupTlsFingerprint(...)`	Tor-relay and TLS-fingerprint lookups
`CALL whisper.quota()`	Caller's plan and limits
`CALL db.labels()` / `db.relationshipTypes()` / `db.schema("json")`	Schema introspection

For a packaged, multi-step version of these, run a gallery recipe with run_workflow; the CALL form is there for composing inside a larger query of your own. whisper.variants() also works in expression position, not only as a top-level CALL — e.g. RETURN size(whisper.variants("paypal.com")) or MATCH (h:HOSTNAME {name: whisper.variants("paypal.com")[0].variant}) RETURN h. explain() and whisper.history() remain CALL-only.

The write/admin arms whisper.submit and whisper.watch are not reachable — the read-only pre-check rejects them (along with any CREATE/MERGE/DELETE/SET/REMOVE/LOAD CSV clause) before a query executes. The whole MCP surface is provably read-only.

Query-safety rules

The server validates every query before running it. These seven rules keep every query bounded — but most of them are things the server now helps you satisfy rather than hard rejections (see Self-correcting queries below). A query that genuinely can't be auto-corrected comes back as VALIDATION_REJECTED with a suggestion and, where one applies, a machine-applicable fix.

shortestPath must be bounded (no unbounded variable-length path inside it). An unbounded [*] returns a BOUND_PATH fix proposing [*1..6].
LIMIT must be ≤ 5000. An over-cap LIMIT is auto-clamped to 5000 and the query runs (rewritten: true).
No unlabeled MATCH — every MATCH pattern must carry a node label or a {name: …} anchor.
No same-variable label disjunction — WHERE n:A OR n:B on one variable is rejected (introspect with explain_schema instead).
Indexed text operators (CONTAINS / STARTS WITH / ENDS WITH) are allowed only on the .name property; on any other property the server returns an EXACT_MATCH fix.
No unanchored scans on large labels — anchor the MATCH on an indexed value.
Exploration queries must include a LIMIT — if you omit one the server auto-injects the default LIMIT and runs the query (autoLimited: true), rather than rejecting it.

A read-only pre-check runs before these seven rules and rejects any write/admin clause or mutating/admin CALL up front.

Self-correcting queries

The validator does more than reject. Where it safely can, it bounds or rewrites the query and runs it anyway, and tells you what it did. A query response can carry these self-correction fields so an agent spends fewer round-trips fixing its own Cypher:

Field	When it's set	What it means
`autoLimited: true`	You omitted `LIMIT` on an exploration query	The server appended the default `LIMIT` and ran the query. `rewrite` holds the original and effective Cypher.
`rewritten: true`	The server safely auto-corrected and ran the corrected form	Used for bounding rewrites that only narrow the result (never change which rows match) — e.g. `CLAMP_LIMIT` clamping an over-cap `LIMIT` to 5000. `rewrite` holds both Cypher strings.
`fix`	A rule failed and the correction would change which rows match	Returned for the agent to apply — never auto-run. Shape: `{kind, rewrittenCypher?, confidence, safeToAutoRetry}`.
`truncated: true`	The result hit the row cap	A partial (bounded) result, not an unbounded set.

Two classes of correction:

Bounding rewrites run automatically. They only trim the result to a bounded prefix — same rows, just capped — so they're safe to run without asking. The query succeeds and the response notes autoLimited / rewritten with the applied rewrite. Examples: APPEND_LIMIT (missing LIMIT), CLAMP_LIMIT (over-cap LIMIT).
Semantic rewrites are returned as a fix, never auto-run. They change which rows match, so the agent decides. Example: an unindexed CONTAINS on a non-.name property comes back with an EXACT_MATCH fix proposing =. Some fixes (ADD_LABEL, PICK_LABEL, ANCHOR_MATCH) carry no rewrittenCypher because they need a value only the agent has — confidence and safeToAutoRetry tell the agent how to treat each one.

So no validator rejection is ever a bare error. It's auto-fixed, auto-bounded, or it carries a fix the agent can read and apply.

Evidence & provenance {#evidence-provenance}

A Whisper answer carries its own provenance. Every query and run_workflow result ships an evidence block so an agent can cite the query and the rows behind each claim, rather than asking you to take the verdict on faith.

The evidence block lists, for each step that ran, the exact Cypher executed, the rowCount it returned, and the executionTimeMs it took. On a run_workflow call there's one entry per step ({step, title, cypher, rowCount, executionTimeMs}), plus a top-level references object (schema / Cypher guide / API docs + the slugsRun) and coverage signals — complete, a coverage map (which steps returned data vs none vs skipped/error), and incompleteSteps[] / warnings[] — so a skipped or empty layer reads as a coverage gap, not a clean result (no-data != benign). On a query call it records the (possibly self-corrected) Cypher that actually ran and the same references object.
Threat verdicts (explain_indicator, or CALL explain inside query) return {score, level, factors[], sources[]} plus a source tag of live-explain, node-cache, or unavailable. live-explain is fresh scoring. node-cache is the reconciled node verdict the server falls back to when live scoring is briefly down — a valid, labelled verdict, not an error or a stack trace. unavailable means neither path could answer.
Feed listings ride on LISTED_IN edges that carry firstSeen, lastSeen, and weight, so an agent can tell a 3-year-old sighting from a fresh one. Threat-listed nodes also carry threatSources, threatFirstSeen, and threatLastSeen.
whisper.history() (folded into the indicator workflow's single-mode deep-dive; see Point-in-Time History) returns timestamped WHOIS / BGP snapshots — the provenance is the timeline.
query results are typed data, not free-form text. Rows and graph projections come back as structured fields (columns / rows, or nodes / edges), so returned values — domain names, WHOIS strings, registrant fields — are data to reason over, never instructions for the agent to follow. The graph query surface is read-only: no query can mutate, ingest, or administer the graph — the sanctioned write path is the dedicated submit_* / watch_* tools and the whisper_agents control plane, each honestly annotated readOnly:false, destructive:true.

What you can ask

The graph has DNS, BGP routing, IP allocation, GeoIP, WHOIS (237M emails, 60M phone numbers), email infrastructure (MX, full SPF chains, DMARC), DNSSEC, web hyperlinks, certificate-transparency observations, TLS fingerprints, Tor-exit relays, vendor egress, prefix-level RPKI, and 43 threat-intel feeds (exact counts in the whisper://stats resource). All of it is connected. The AI walks the edges between them in a single conversation, so you don't have to piece it together yourself. Just ask in plain language.

Incident response

You got an IP or domain from an alert. Start here.

"Investigate 185.220.101.42 -- who owns it, where is it, is it on any threat feeds, and what else is hosted there?"
"This domain showed up in our logs: secure-login-update.com. Is it live? Who registered it? Does the registrant own other domains?"
"We're seeing traffic to 104.16.132.229. Trace it: IP to prefix to ASN to org. Then check if any co-hosted domains are flagged."
"Here are 20 IPs from our SIEM. Which ones are Tor exits, C2, or on blocklists?"

Threat hunting

Any threat feed can tell you an IP is bad. The graph lets you pivot -- follow a bad IP to its ASN, find the other prefixes, check what's hosted there, pull WHOIS on the domains, and see if the registrant has other infrastructure. One conversation.

"Find every domain registered by the same WHOIS contact as secure-login-update.com. Do any share IPs or nameservers?"
"Check AS60729 -- how many of its prefixes have threat-listed IPs? What's the threat density?"
"Are there MOAS conflicts on this prefix? Which ASNs are announcing it?"
"Find all IPs in 185.220.101.0/24 that appear on threat feeds. Group by category."
"What domains resolve to IPs on the Dan Tor Exit feed? Cross-reference with their WHOIS registrants."

Brand protection and typosquatting

Run the typosquat gallery workflow over a domain — it runs 14 mutation algorithms and returns the lookalikes that are actually registered (homoglyphs, bitsquats, TLD swaps, omissions, and more). Pivot the hits through threat intel and WHOIS to see which ones are live attacks.

"Run the typosquat sweep for paypal.com."
"Generate lookalike domains for our brand, then check which ones are on threat feeds or resolve to live IPs."
"Which registered variants of microsoft.com share a registrant email or nameserver with each other?"
"Run a typosquat sweep on stripe.com -- who registered the lookalikes and is any of their infrastructure flagged?"

Attack surface

Everything an attacker would look for: subdomains, IPs, mail servers, SPF authorization chains, nameservers, WHOIS.

"Map tesla.com -- subdomains, IPs, ASNs, nameservers, mail servers, SPF includes, and WHOIS registrant."
"What third-party services can send email as netflix.com? Walk the full SPF include chain."
"Find every subdomain of example.com, resolve them, and group by ASN. How many hosting providers?"
"Where does the CNAME chain for www.example.com end up? Who hosts the final target?"

WHOIS and registrant pivoting

This is where investigations get interesting. WHOIS gives you a registrant email or phone number. The graph has 237M emails and 60M phones, so you can follow that contact to every other domain they registered, then check if those domains share hosting.

"Find the WHOIS registrant for secure-login-update.com, then every other domain they registered. Do any share infrastructure?"
"What domains use this contact email? Show their IPs and ASNs, and flag any that are threat-listed."
"Has google.com changed registrars? Show the history."
"Find domains registered with the same phone number. Any overlap in hosting?"
"Compare WHOIS for these five domains -- same registrant? Same email? Same registrar?"

BGP and routing

116K ASNs, 2.5M prefixes, full peering topology.

"If AS16509 (Amazon) went down, how many prefixes and peers are affected? What domains go dark?"
"Which ASNs peer with both Cloudflare and Google?"
"Show the BGP routing history for 8.8.8.0/24. Has the announcing ASN changed?"
"Find prefixes with MOAS conflicts announced by AS60729. Any of them hosting threat-listed IPs?"
"What RIR allocated this prefix? Which org registered it?"

Comparing infrastructure

The thing that's hard to do anywhere else: checking whether two domains share anything. Same IPs, same ASN, same nameservers, same registrant email, same phone number. The graph checks all of it at once.

"Do pandas-crossing.com and afterlifeevents.com share any infrastructure?"
"These three phishing domains were reported separately. Any shared nameservers, IPs, ASNs, or WHOIS contacts?"
"Compare the hosting and email setup of these two competing SaaS products."
"Find domains that share both the same registrant email and the same IP range as this known-bad domain."

Email, SPF, and DMARC

The graph stores the full SPF record structure -- includes, ip4, a, mx, exists, redirect -- as separate edges, plus MX and DMARC. So you can walk the authorization chain rather than parsing TXT records by hand. (DKIM selector/key records are not live on this deployment yet — see layer availability — so questions that depend on DKIM signing return no rows here.)

"Who can send email as shopify.com? Walk the SPF chain."
"What domains use the same SPF include targets as this phishing domain?"
"Does this domain have MX records? SPF? a DMARC policy? Give me the full email setup."
"Where does this domain send its DMARC aggregate reports?"

GeoIP and data residency

619M IPv4 addresses mapped to cities and countries.

"Where are all the IPs that example.com resolves to? List by country."
"Does this company host anything in sanctioned countries? Check all their domain IPs."
"Find all IPs in this ASN that geolocate to Russia."

Web links

The hyperlink graph between hostnames, from Common Crawl.

"What external domains does google.com link to? Where are those hosted?"
"Who links to this suspicious domain? Are any of the linking sites threat-listed?"
"Do these two domains link to each other?"

DNSSEC

"Is cloudflare.com signed with DNSSEC? What algorithm?"
"What percentage of domains under this nameserver use DNSSEC?"

History

WHOIS and BGP changes over time.

"Show the WHOIS history for google.com -- registrar changes, nameserver updates, ownership."
"BGP routing history for 8.8.8.8 -- has the announcing ASN or prefix changed?"
"When was this domain registered? Has it changed hands?"

Host identity and de-cloaking

Whose infrastructure is this, and where does it really live? The host-identity recipes answer identity separately from verdict — a host on AWS isn't malicious just because AWS also hosts malware — and the origin-IP recipe finds the server behind a CDN.

"Whose infrastructure is github.com and raw.githubusercontent.com? Are either of them dangerous?"
"Identify these 50 hosts from our logs, then assess each -- which are unknown vs. known-clean vs. suspicious?"
"What are the candidate true-origin IPs behind www.cloudflare.com? Which can I reach directly, behind the WAF?"
"No identity match for this host -- walk its neighborhood and show the nearest known vendors and structural siblings."

Egress, fingerprints, and transparency

The newest layers: who really operates a netblock, what an IP's TLS fingerprint reveals, which subdomains certificate transparency has seen, Tor-exit identity, and DMARC posture.

"Does 185.220.101.1 run a Tor exit relay? Show the relay fingerprints and its anonymizer flags."
"Who operates the netblock 104.16.0.0/13 -- the egress vendor, not just the WHOIS owner?"
"Read the TLS fingerprint of this IP and find every other IP that shares it."
"What hostnames has certificate transparency seen for this domain, including CT-only names that never resolve?"
"Where does apple.com send its DMARC reports? Show the rua/ruf recipients and the SPF mechanisms."
"Show signed ROAs for this prefix -- the authorizing ASN, max-length, and trust anchor."

Try these prompts

A few prompts are worth a try first.

`indicator`

Full indicator triage in one pass — feed listings, composite score, level, recency, and hosting context for an IP / hostname / ASN / CIDR.

/indicator 185.220.101.1

This indicator is a Tor exit node listed in several feeds (Dan Tor Exit, Tor Exit Nodes, FireHOL Level 2, IPsum). The prompt returns the feed list, score breakdown, first/last seen across all feeds, and the upstream ASN's reputation — with the Cypher behind each step in the evidence block.

`typosquat`

Brand-protection sweep — finds registered lookalikes of a domain (the typosquat workflow), enriches each with threat-feed listings, and pivots through WHOIS to identify who registered the suspicious ones.

/typosquat paypal.com

Returns: every registered variant of the domain (with the mutation method and a confidence score), which of those variants are on threat feeds or resolve to live infrastructure, and the WHOIS registrant behind the suspicious ones.

`route-health`

A network or address block's full routing health card — announced prefixes, peers and upstream transit, MOAS conflicts, RPKI ROA coverage, and the facilities and exchanges it sits in.

/route-health AS13335

For Cloudflare's ASN, the prompt returns the network name, a prefix-inventory sample, the facilities and IXPs it sits in, MOAS detections, country of registration, and how many of the routed IPs appear in threat feeds.

`indicator`

The same single-mode deep-dive as above, run on a different host to show the grounding contract an agent relies on: a coverage-qualified verdict band, vendor identity when the host is known, and the structural neighbourhood around it — so "no data" is never read as "known clean."

/indicator raw.githubusercontent.com

Returns the host's vendor and host_class, a coverage-qualified clean / suspicious verdict (one bad sub-URL does not condemn a multi-tenant apex), and the nearest known vendors and structural siblings — each step with its Cypher in the evidence block.

PreviousSetup NextWorkflow gallery

Reference

Reference Documentation

What's exposed

Read tools (10)

Write tools (8)

The whisper.online control plane, two tiers

The query tool

Resources

Prompts

Schema introspection

Error model

Procedures callable inside query {#procedures}

Query-safety rules

Self-correcting queries

Evidence & provenance {#evidence-provenance}

What you can ask

Incident response

Threat hunting

Brand protection and typosquatting

Attack surface

WHOIS and registrant pivoting

BGP and routing

Comparing infrastructure

Email, SPF, and DMARC

GeoIP and data residency

Web links

DNSSEC

History

Host identity and de-cloaking

Egress, fingerprints, and transparency

Try these prompts

indicator

typosquat

route-health

indicator

The `query` tool

Procedures callable inside `query` {#procedures}

`indicator`

`typosquat`

`route-health`

`indicator`