|
ACTIVE. Primary full-text source. Pre-labeled sections (TITLE, ABSTRACT, METHODS, CONCL). Phase 0 — already wired into ingest pipeline.
|
pmc_bioc |
json |
email_key |
NCBI_API_KEY
✗
NCBI_EMAIL
✗
|
3.0/s |
7 |
0 |
Active
|
Edit
|
|
ACTIVE. PMC/PubMed article search. Returns PMC IDs for a query term. Without key: 3 req/sec. With NCBI_API_KEY: 10 req/sec.
|
ncbi_esearch |
json |
email_key |
NCBI_API_KEY
✗
NCBI_EMAIL
✗
|
3.0/s |
0 |
0 |
Active
|
Edit
|
|
ACTIVE. JATS XML fallback for articles not in BioC OA corpus. Same rate limits as esearch.
|
ncbi_efetch |
xml |
email_key |
NCBI_API_KEY
✗
NCBI_EMAIL
✗
|
3.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. Free scholarly graph: 200M+ works, authors, institutions, concepts. No key required. Add mailto= polite header via OPENALEX_EMAIL env var. Primary data layer — covers ~90% of scholarly literature. Endpoints: /works /sources /authors /institutions. Docs: https://docs.openalex.org
|
openalex |
json |
none |
OPENALEX_EMAIL
✗
|
10.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. DOI registration + metadata normalization. No key required. Add User-Agent mailto: header via CROSSREF_EMAIL. Essential for DOI resolution and deduplication. Endpoint: /works?query=... Docs: https://api.crossref.org
|
crossref |
json |
none |
CROSSREF_EMAIL
✗
|
50.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. Directory of Open Access Journals — vetted OA quality filter. No key required for read endpoints. Endpoints: /search/journals /search/articles. Use for journal whitelist and license credibility check. Docs: https://doaj.org/api/v4/docs
|
doaj |
json |
none |
—
|
10.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. PLOS publisher API — immediate access, no signup. Endpoint: /search?q=... Returns JSON. Fields: title, abstract, author, journal, DOI. ~5 min to usable. Docs: https://api.plos.org
|
plos |
json |
none |
—
|
10.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. Major publisher OA API — requires account + API key. Key goes in api_key= query param. Register at: https://dev.springernature.com ~15-30 min setup. Strongest structured OA dataset from a major publisher.
|
springer_nature |
json |
api_key_param |
SPRINGER_API_KEY
✗
|
5.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. Biomedical full-text + MeSH annotations. No key required. Endpoint: /search?query=... Includes full-text XML where available. Covers life sciences. Docs: https://europepmc.org/RestfulWebService
|
europe_pmc |
json |
none |
EPMC_EMAIL
✗
|
10.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. Frontiers publisher API — auth requirements unknown; verify stability. Check Swagger UI at base_url. Fallback: use OpenAlex filter if unstable. Confidence: medium.
|
frontiers |
json |
none |
—
|
5.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. Universal OAI-PMH harvesting protocol. Used by OJS journals, institutional repositories, and many independent publishers. Endpoint per journal is stored in journal_registry.oai_endpoint. Typical pattern: /oai or /oai/request or /index.php/journal/oai. verb=ListRecords&metadataPrefix=oai_dc for DC metadata; Use &from=DATE for incremental sync. No key required. This is the industry-standard ingestion protocol for independent journals.
|
oai_pmh |
xml |
none |
OAI_EMAIL
✗
|
5.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. Cambridge journals API — register at https://api.cambridge.org/register. Publisher-level journal metadata and content access. Use case: Cambridge-published peer-reviewed journals. Confidence: high (existence confirmed).
|
cambridge_up |
json |
api_key_header |
CAMBRIDGE_API_KEY
✗
|
5.0/s |
0 |
0 |
Active
|
Edit
|
|
PENDING. DOI → OA full-text resolver. No key required; add email param. Given a DOI, returns best available OA full-text link (PMC, publisher, preprint). Endpoint: /v2/{doi}?email=YOUR_EMAIL. Critical for DOI-to-full-text pipeline after Crossref discovery. Docs: https://unpaywall.org/products/api
|
unpaywall |
json |
none |
UNPAYWALL_EMAIL
✗
|
10.0/s |
0 |
0 |
Active
|
Edit
|
|
CONDITIONAL. Major publisher — API exists but more controlled than OA options. Requires institutional token for full text. Register at: https://dev.elsevier.com Good for metadata + abstracts; full-text gated by institution. Confidence: medium (access depends on subscription).
|
elsevier |
json |
api_key_header |
ELSEVIER_API_KEY
✗
ELSEVIER_INST_TOKEN
✗
|
2.0/s |
0 |
0 |
Active
|
Edit
|
|
CONDITIONAL. API token available via admin portal. Primarily institutional/admin integrations rather than open scholarly API. Verify access at: https://help.tandfonline.com — search API Token. Confidence: low as fully open scholarly API.
|
taylor_francis |
json |
api_key_param |
TF_API_TOKEN
✗
|
2.0/s |
0 |
0 |
Active
|
Edit
|