Question 1

Who qualifies for the free researcher API?

Accepted Answer

The researcher tier at /signup/researcher is free for academic and government institutions — .edu and .gov affiliations — modeled directly on the access pattern of Wharton Research Data Services (WRDS), the CMS Research Data Assistance Center (ResDAC), and the AHRQ Healthcare Cost and Utilization Project (HCUP). In each of those, the price of access is a standard citation in resulting work rather than a license fee, and Fonteum follows the same model. The researcher API provides programmatic access to research snapshots and the provider graph, including semantic search over 6.8M+ active NPI embeddings for natural-language queries by specialty, geography, or clinical context. For researchers who only need the published study files, the static CSV and JSON downloads at /research require no account or API key at all, so an exploratory analysis can begin immediately and the API credential is needed only for programmatic or graph-scale work.

Question 2

What makes Fonteum's datasets reproducible?

Accepted Answer

A published study can be reproduced only to the extent that its page names the source, observation date, methodology, limitations, and downloadable artifact. Coverage varies by study and source. Retained row history is not universal: the July 12 audit found banked rows only in named sanctions, procurement, and provenance tables, with none in the EU sanctions table.

Question 3

What research studies are already published?

Accepted Answer

The /research corpus covers studies built on public-record source families. Representative examples include the Nursing Home Deficiency & Harm Rate study (418,148 citations across 14,635 facilities), MIPS Score Distribution (477,137 PY2023 clinician scores), Open Payments recipient concentration, and Medicare Part D prescriber concentration. Inspect each study page for its available methodology, limitations, downloads, citation, and observation date.

Question 4

How should I cite a Fonteum dataset?

Accepted Answer

Inspect the individual study page for its available citation footer, Dataset JSON-LD, methodology version, observation date, and primary source. Cite the study and source actually named there. Fonteum does not mint external academic identifiers it has not registered, and retained snapshot or row-history coverage must not be inferred where the page does not supply it.

Question 5

What scale and breadth of data can a researcher expect?

Accepted Answer

The provider graph is anchored by CMS NPPES at 6.8M+ active records; Fonteum's newest observed production system date was June 10, 2026. Representative study volumes include 418,148 nursing-home deficiency citations and 1.3M+ PBJ daily records per quarter. The OIG LEIE serving table held 68,055 rows from the May 8 release when checked July 12. The active production registry does not establish loaded, complete, or fresh coverage. Join keys and source metadata vary by dataset.

Question 6

Can I use Fonteum data in a published paper or grant deliverable?

Accepted Answer

The Fonteum-authored layer is released under CC-BY-4.0, while reuse terms for underlying records remain source-specific. Cite the named study and primary source shown on its page. Methodology, limitations, download, and citation coverage varies by dataset, so inspect the individual study before relying on a fixed-snapshot reproduction claim.

Reproducible federal datasets, free for research.

Datasets · researcher API · citation chain

Free, citable, reproducible — to the snapshot

From federal portal to citable dataset

Common questions

Register for the free researcher API.