Scattered biomedical datasets: structured, queryable, and AI-ready.
We make public omics data usable.
Each structures a different part of the public biomedical data landscape. Together, they form Devano’s Data Layer: harmonized, queryable, and ready for scientists, workflows, and AI agents.
Gene Expression Omnibus, structured for discovery. Study and sample metadata harmonized across disease, tissue, perturbation, arm role, and experimental design — so GEO becomes queryable instead of free text.
QTL studies and association tables, structured from papers and supplements. Tissues, cohorts, modalities, and analysis types are harmonized so variant-to-function signals can be searched and compared across studies.
GWAS Catalog, structured for downstream use. Traits, diseases, ancestry, sample sizes, and association fields are parsed and ontology-mapped so queries work across inconsistent study labels.
Biomedical literature, continuously scored and structured. Papers are mapped to genes, diseases, topics, and custom criteria so teams can track what matters without manual triage.
Queries that normally require scraping, cleaning, and manual review return structured results in one call. Every output stays linked to the source study, cohort, paper, and curation status behind it.
Powerful on their own. Stronger together.
Wire it to an agent, call it from code, or query it directly. Same structured data, same provenance, same context — however your team works.
Connect any agent to Devano and it can call structured tools instead of scraping raw sources. Search studies, pull contrasts, traverse associations, and return source-linked outputs with provenance attached.
One agent. Different questions. Different shapes of structured context.
Replace scattered GEO scraping, hand−curated case/control matching, and ad hoc metadata cleanup with structured calls to Devano products. Results come back with normalized fields, source links, and provenance attached.
Work with the complete data landscape, not whatever your pipeline happened to reach.
Biology has never had a live public-data layer. Devano builds one.
Structured, connected data — ready for scientists and agents to use directly.
Static, versioned releases
Periodic snapshots. New studies and current literature unavailable until the next release.
Fixed UI, fixed questions
You browse their interface. Custom queries require custom engineering.
Pre-scored, pre-interpreted
Evidence aggregated into a number. Provenance and raw associations abstracted away.
Not agent-queryable
Designed for humans clicking a web interface. AI agents can't call it or cross-reference across indexes.
Continuously improved
Data is continuously updated, corrected, and refined — not frozen in periodic releases.
Cross-index in one query
GWAS, QTL, GEO, and literature pre-aligned to a single ontology. One call replaces scattered scraping and ad hoc metadata cleanup.
Full provenance, raw associations
Every finding traces to source study, cohort, platform, and variant. Answers you can defend.
Plug in any way you work
Query via MCP agent, direct API, or code. Structured data, no fixed interface, no fixed questions.
Built by scientists who ran these analyses themselves
Decades at 23andMe, UCLA, Stanford, and Google — and spent years doing exactly the SOFT parsing, metadata curation, and batch correction this index eliminates.
A structural comparison of how different approaches handle study discovery, schema accuracy, cross-index linkage, and agent access.
| Approach | Study discovery | Schema accuracy | Cross-index | Current | Agent-queryable |
|---|---|---|---|---|---|
![]() |
✓ curated | ✓ validated | ✓ unified | ✓ live | ✓ MCP |
| DIY agent (PubMed + raw) | partial | error-prone | not possible | manual | build it |
| Genetics data platforms | limited scope | ✓ | siloed | versioned | partial |
| Manual research | incomplete | varies | manual | lag | no |
Start a project or talk with our team.
Piloting with biotech and academic teams • Enterprise-grade security • Expert implementation support