BioSkepsis vs ERA (Google DeepMind): AI-Powered Biomedical Literature Synthesis vs Automated Scientific Code Generation

May 22, 2026

Reviewed 22 May 2026

BioSkepsis vs ERA (Google DeepMind): AI-Powered Biomedical Literature Synthesis vs Automated Scientific Code Generation

ERA (Empirical Research Assistance) is Google DeepMind's AI system for writing expert-level scientific software, published in Nature on 19 May 2026, with its Computational Discovery prototype available through a trusted tester programme in Google Labs. BioSkepsis is a biology-native research assistant that synthesises 40M+ life-science papers with Gene Ontology, MeSH, and gene-level retrieval, built by EFEVRE TECH LTD alongside the AMGEL robotic lab platform and VITALE documentation system. These tools solve fundamentally different problems: ERA writes and optimises code for computational experiments; BioSkepsis reads and reasons over published literature. Neutral side-by-side comparison, with sources.

TL;DR ERA is the right tool if you need an AI system that writes, executes, and iteratively optimises scientific software against a defined quality metric, using LLM-driven code generation and tree search to explore thousands of algorithmic variants in parallel. BioSkepsis is the right tool if you need biology-native literature retrieval, full-text reasoning over methods and controls, citation network analysis, citation verification, and hypothesis generation grounded in specific passages across 40M+ curated papers, with the option to connect to AMGEL for automated experiment execution and VITALE for protocol documentation. ERA automates computational experimentation; BioSkepsis automates evidence reasoning and reproducibility assurance. Different jobs, different architectures, minimal overlap.

How ERA generates and optimises scientific software for computational experiments

ERA combines a Large Language Model (Gemini) with a tree search algorithm - Flat UCB Tree Search (FUTS), inspired by AlphaZero - to iteratively generate, execute, and score candidate programs. The input is a "scorable task": a problem description, a scoring metric, and training/validation data. ERA then searches scientific literature for research ideas, generates code implementing those ideas, and uses tree search to navigate the space of possible solutions, balancing exploration of new approaches against exploitation of promising ones.

The system does not produce a single answer. It generates a tree of software candidates, with each node representing a different algorithmic approach, and converges on expert-level solutions by iterating through thousands of code variants. The scoring metric drives the search; ERA keeps refining until it maximises the defined quality measure.

The benchmarks are concrete. In bioinformatics, ERA discovered 40 novel methods for single-cell RNA sequencing batch integration that outperformed the top human-developed methods on the scIB public leaderboard, where nearly 300 existing tools compete. In epidemiology, ERA generated 14 models that outperformed the CDC ensemble and all other individual models for forecasting COVID-19 hospitalisations at a state level across the United States. The system also demonstrated expert-level performance on satellite imagery analysis, neuroscience prediction, general time-series forecasting, and mathematical optimisation benchmarks.

ERA is open-source. The codebase, including the FUTS algorithm and benchmark notebooks, is available on GitHub (google-research/era). Eight published manuscripts apply ERA to specific scientific problems spanning epidemiology, climate, hydrology, economics, neuroscience, and combinatorics.

ERA query example: single-cell RNA-seq batch integration in bioinformatics

Input: a scorable task with scRNA-seq datasets containing batch effects across samples, a scoring metric combining batch effect removal and biological signal conservation, and training/validation splits. ERA searches the literature for integration strategies, generates code implementing novel batch-correction algorithms, and uses tree search to iterate through variants. The result: 40 novel methods that outperformed all existing human-developed tools on the scIB public leaderboard, including methods from nearly 300 competing tools.

How BioSkepsis retrieves and reasons over biomedical literature for researchers

BioSkepsis retrieval is weighted by Gene Ontology terms, MeSH descriptors, gene symbols, and pathway relationships. A query about AMPK activation and its downstream effects on hepatic lipogenesis returns papers linked by the biological concepts involved, not just papers whose abstracts contain matching keywords. This biology-native retrieval layer is what separates BioSkepsis from both general-purpose LLMs and computational experimentation systems that treat biomedical papers as sources of algorithmic ideas rather than as primary evidence.

Answers are grounded in full text including methods, controls, and supplementary data. Every claim links back to the exact passage in the retrieved paper. When evidence is insufficient, BioSkepsis declines to answer rather than generating a plausible-sounding response. The research landscape graph classifies papers by structural role (Foundational, Hub, Bridge, and Novel) and draws on Semantic Scholar's 214M+ corpus for landscape expansion.

BioSkepsis also performs citation verification: a seven-step pipeline (Steps A through G) that audits whether cited papers in an existing document actually support the claims they are attached to. This is a forensic capability; it checks the integrity of literature use, not just its availability.

BioSkepsis query example: mechanistic synthesis in hepatic lipogenesis

A researcher asks: "What mechanisms link AMPK activation to suppression of SREBP-1c-mediated lipogenesis in hepatocytes, and which upstream kinases are involved?" BioSkepsis searches 40M+ papers, retrieves full-text studies covering LKB1, CaMKK2, and ACC phosphorylation, maps the citation network across these papers, classifies each by structural role, and produces a synthesis with every claim traceable to a specific passage and PMID.

The EFEVRE ecosystem: physical execution, documentation, and interpretation in one loop

BioSkepsis is one of three products built by EFEVRE TECH LTD. The other two address problems that ERA does not attempt to solve.

AMGEL (AutoMated GEneral Laboratory) is a patent-pending robotic platform that automates all mainstream wet-bench procedures on one device: pipetting, centrifugation, heating/cooling, magnetic bead isolation, cold storage, and PCR. It runs 860+ pre-set protocols with 24/7 unattended operation. AMGEL physically executes experiments; ERA automates only the computational side.

VITALE (Versatile Integrated Technology Advancing Life-Science Exploration) is the software layer that records every step, parameter, timestamp, and result during AMGEL operation. Its Protocol Designer, Personal Labbook, Scheduling Calendar, and Protocol Library with community scoring ensure that no experimental detail goes undocumented. VITALE addresses the documentation pillar of the reproducibility crisis: even when experiments are performed correctly, poor reporting in publications makes replication structurally impossible.

Together, the three products form a closed loop: AMGEL removes human error from execution, VITALE removes human error from documentation, and BioSkepsis removes human bias from interpretation. ERA addresses a fourth dimension - computational experimentation - but does not touch the physical, documentation, or literature-interpretation layers.

The closed-loop difference

ERA generates an optimised algorithm for batch-correcting scRNA-seq data. That algorithm processes data from a physical experiment. If the original wet-lab protocol was executed inconsistently or poorly documented, the computational analysis is built on unreliable inputs. In the EFEVRE workflow, AMGEL runs the cell isolation and RNA extraction with recorded parameters, VITALE logs every step automatically, and BioSkepsis verifies that the resulting publication cites the literature accurately. ERA optimises the computation; the EFEVRE ecosystem ensures the data feeding that computation is reproducible.

Feature comparison: BioSkepsis (EFEVRE ecosystem) vs ERA (Google DeepMind)

Side-by-side feature comparison
Feature	BioSkepsis (EFEVRE)	ERA (Google DeepMind)
Primary job	Literature synthesis, citation verification, reproducibility assurance	Automated scientific software generation and optimisation
Primary audience	Biomedical and life-science researchers (all career stages)	Computational scientists, bioinformaticians, data scientists
Core output	Citation-grounded evidence syntheses, citation network maps, hypothesis generation	Executable, optimised scientific code (Python, R, etc.)
Paper corpus	40M+ curated biomedical papers (1931 to present, weekly updates)	Literature search for research ideas (corpus not disclosed)
Retrieval model	Biology-native knowledge graph (Gene Ontology + MeSH + gene symbols)	LLM-driven literature search for algorithmic approaches
Full-text reasoning	Yes, including methods, controls, supplementary data	No (reads literature to inform code, not to produce evidence syntheses)
Citation network analysis	Yes (Foundational, Hub, Bridge, Novel paper roles)	No
Citation verification / audit	Yes (seven-step pipeline, Steps A-G)	No
Hypothesis generation	Yes (literature-derived)	No (generates code, not hypotheses; see Co-Scientist for that)
Code generation	No	Yes (LLM + Flat UCB Tree Search, thousands of variants)
Computational experiment automation	No	Yes (generates, executes, scores, and iteratively refines code)
Tree search optimisation	No	Yes (FUTS algorithm inspired by AlphaZero)
Physical lab automation	Yes (AMGEL: 860+ protocols, 24/7 autonomous)	No (computational only)
Experiment documentation	Yes (VITALE: automatic step-by-step recording)	No
Personalised research feed	Yes (all plans; per-plan feed-count cap)	No
Zotero sync	Yes (all tiers)	No
Export formats	PDF, DOCX, Markdown, JSON, APA, Chicago, Harvard, Vancouver, BibTeX, RIS, CSV	Executable code, Jupyter notebooks, solution trees
Access model	Open signup, free tier, no credentialing	Open-source code on GitHub; Computational Discovery via trusted tester programme in Google Labs
Nature publication	No	Yes (May 2026; DOI: 10.1038/s41586-026-10658-6)
Underlying model	Proprietary pipeline (Claude-based synthesis)	Gemini + Flat UCB Tree Search (FUTS)
Domain coverage	Biology, medicine, pharma, biotech, agriculture, food, veterinary, environmental sciences	Multi-disciplinary (genomics, epidemiology, neuroscience, climate, hydrology, economics, mathematics)

Where ERA leads: automated computational experimentation across scientific disciplines

ERA's primary advantage is in automating the most labour-intensive phase of computational science: writing, testing, and refining the software that implements a scientific idea. The tree search architecture is the key differentiator. Rather than generating a single piece of code and hoping it works, ERA explores a branching tree of algorithmic variants, each scored against the defined quality metric, and converges on solutions that outperform expert-written alternatives.

The single-cell RNA-seq results are the strongest demonstration of this capability. Batch integration in scRNA-seq is a well-studied problem with nearly 300 existing human-developed tools and multiple public benchmarks. ERA discovered 40 novel methods that outperformed every one of those tools on the scIB leaderboard. This is not incremental improvement; it represents a qualitative expansion of the solution space that human developers had not explored.

The epidemiological forecasting results carry immediate public health significance. ERA generated 14 models that outperformed the CDC ensemble and all other individual models for forecasting COVID-19 hospitalisations at a state level. Google has since joined the CDC's live forecasting challenges for flu, COVID-19, and RSV, submitting weekly ERA-generated predictions for every U.S. state.

ERA's position within the Gemini for Science ecosystem amplifies its utility. Computational Discovery, the prototype built with ERA and AlphaEvolve, is available through Google Labs and allows researchers to define a scientific problem, provide a scoring metric, and let the system explore thousands of code variations in parallel. For computational scientists whose bottleneck is the iteration cycle of writing, testing, and refining analysis code, this is a direct productivity accelerator.

How the EFEVRE ecosystem complements ERA's computational output

ERA generates an optimised batch-correction algorithm for scRNA-seq data. That algorithm will process data from physical experiments: cell isolation, RNA extraction, library preparation, sequencing. If those upstream steps were executed inconsistently or documented poorly, the computational analysis inherits their noise. AMGEL standardises the physical protocol. VITALE documents every parameter. BioSkepsis verifies that the resulting publication accurately cites the methods and prior work. ERA optimises the computation; the EFEVRE ecosystem ensures the data is trustworthy and the reporting is accurate.

Where BioSkepsis leads: citation integrity, reproducibility, and evidence-grounded reasoning in the life sciences

BioSkepsis occupies territory ERA does not enter. The two systems operate on different substrates entirely: ERA operates on code and data; BioSkepsis operates on scientific text and evidence chains.

Citation verification, the seven-step audit pipeline that checks whether cited papers actually support the claims they are attached to, has no equivalent in the ERA system. ERA reads literature to extract algorithmic ideas for code generation; it does not audit whether papers cited in existing publications correctly support their associated claims. For researchers, reviewers, and editors concerned with the accuracy of the scientific record, this is a meaningful gap that ERA was never designed to fill.

The research landscape graph, with its classification of papers into Foundational, Hub, Bridge, and Novel roles via citation network analysis, provides structural understanding of a field that ERA's code-generation pipeline does not surface. A researcher mapping the state of a sub-discipline, identifying which seminal papers anchor the field and which bridge papers connect separate domains, needs this kind of analysis.

Biology-native retrieval is the most architecturally consequential difference. BioSkepsis indexes papers by Gene Ontology terms, MeSH descriptors, gene symbols, and pathway relationships. A query about AMPK activation via LKB1 returns different papers than a query about AMPK activation via CaMKK2, because the retrieval layer understands the biological distinction. ERA's literature search is oriented toward finding methods and algorithmic approaches, not biological mechanisms.

The EFEVRE ecosystem's physical layer addresses problems no computational system can. AMGEL runs 860+ laboratory protocols autonomously, 24 hours a day. VITALE records every parameter. ERA automates computation; the EFEVRE ecosystem automates execution, documentation, and interpretation.

The reproducibility gap ERA leaves open

Approximately 70% of published life-science data cannot be reproduced. The causes span three layers: inconsistent manual execution, poor documentation of experimental parameters, and biased interpretation of results. ERA addresses a fourth layer - computational experimentation - by ensuring that analysis code is optimised and reproducible. But it does not touch the upstream layers that generate the data being analysed. The EFEVRE ecosystem addresses all three upstream layers: AMGEL standardises execution, VITALE standardises documentation, and BioSkepsis standardises interpretation.

Where BioSkepsis and ERA overlap, and how they complement each other in biomedical research

The overlap is narrow. Both systems use scientific literature as input, but they use it for entirely different purposes. ERA reads papers to extract algorithmic ideas that inform code generation. BioSkepsis reads papers to produce citation-grounded evidence syntheses, map field structure, and verify citation integrity. A researcher querying ERA with a bioinformatics problem gets optimised code. A researcher querying BioSkepsis with a biological question gets an auditable evidence synthesis with PMIDs.

The complementary workflow is clearer here than in most AI-for-science comparisons, because the tools are so architecturally distinct. A computational biologist working on single-cell genomics could use BioSkepsis to survey the literature on batch-effect correction methods, understand which approaches are supported by robust experimental evidence, identify knowledge gaps, and verify the citations in their manuscript draft. They could then use ERA (or the Computational Discovery prototype) to generate and optimise novel batch-correction code that outperforms existing tools. The literature reasoning happens in BioSkepsis; the code optimisation happens in ERA; neither duplicates the other's core job.

The EFEVRE ecosystem extends this further. If the single-cell data comes from a physical experiment, AMGEL standardises the upstream wet-lab steps (cell isolation, RNA extraction) and VITALE documents every parameter. The result is a fully traceable pipeline: reproducible physical execution (AMGEL), automated documentation (VITALE), optimised computation (ERA), and verified literature grounding (BioSkepsis).

Who should use which tool in biomedical and life-science research

ERAComputational scientists seeking automated code generation and optimisation

You need to write, test, and iteratively refine scientific software for a computational experiment with a defined scoring metric. Your bottleneck is the iteration cycle of coding, running, evaluating, and rewriting analysis pipelines. You want an AI system that explores thousands of algorithmic variants in parallel and converges on expert-level solutions. ERA and Computational Discovery are built for this job.

BioSkepsisActive biomedical researchers and systematic reviewers

You need to reason over full-text literature in molecular biology, pharmacology, or the broader life sciences. You want citation network analysis, biology-native retrieval, hypothesis generation grounded in specific passages, and citation verification for manuscripts you are writing or reviewing. You need exportable references in BibTeX, RIS, or direct Zotero sync. BioSkepsis is built for this job.

BioSkepsis + AMGEL + VITALELabs that need reproducible execution, documentation, and interpretation

You run a wet lab and need to standardise experiment execution, automatically document every parameter, and verify that your publications cite the literature accurately. The EFEVRE ecosystem closes the loop from bench to publication. No computational-only platform, including ERA, addresses all three layers.

BothComputational biology groups combining wet-lab and in-silico workflows

Your team generates experimental data (scRNA-seq, proteomics, imaging) and analyses it computationally. ERA optimises the analysis code. AMGEL and VITALE ensure the upstream data generation is reproducible and documented. BioSkepsis verifies the literature grounding of your results and maps the citation network for your publications. Each system handles a distinct phase of the research lifecycle; none duplicates another's core job.

Frequently asked questions

Is BioSkepsis a competitor to Google DeepMind's ERA?

No. They solve fundamentally different problems. ERA is an AI system that writes and optimises scientific software to automate computational experiments. BioSkepsis is a researcher-facing tool for citation-grounded literature reasoning, citation verification, and reproducibility assurance across 40M+ curated papers. ERA generates code; BioSkepsis synthesises evidence. A researcher could use both: ERA to build and optimise computational models, BioSkepsis to ground those models in the published literature and verify citation integrity.

Can ERA perform literature synthesis or citation verification the way BioSkepsis does?

ERA can search scientific literature as input context for code generation, but it does not produce citation-grounded literature syntheses, citation network analysis, or citation verification. Its output is optimised code, not auditable evidence chains. BioSkepsis's seven-step citation verification pipeline and biology-native retrieval across 40M+ papers are designed specifically for literature reasoning.

Does ERA use biology-native retrieval like Gene Ontology and MeSH?

No. ERA uses LLM-driven literature search to gather research ideas that inform code generation. Its retrieval is oriented toward finding algorithmic approaches and methods, not biological concepts. BioSkepsis uses a biology-native knowledge graph weighted by Gene Ontology terms, MeSH descriptors, gene symbols, and pathway relationships for retrieval by biological mechanism.

Is ERA open-source?

Yes. The ERA codebase, including the Flat UCB Tree Search algorithm and benchmark notebooks, is available on GitHub at github.com/google-research/era. The Computational Discovery prototype built with ERA is available through a trusted tester programme in Google Labs. BioSkepsis offers open signup with a free tier and no credentialing requirement.

Which tool should I use for a systematic literature review in molecular biology?

BioSkepsis. ERA does not perform literature reviews; it writes scientific software. BioSkepsis's biology-native retrieval (Gene Ontology, MeSH, gene-level indexing), citation network analysis (Foundational, Hub, Bridge, Novel paper roles), full-text reasoning over methods and controls, and export to Zotero, BibTeX, and RIS are built for structured literature synthesis.

Can BioSkepsis write or optimise scientific code the way ERA does?

No. BioSkepsis generates testable hypotheses and suggests experimental methodologies based on synthesised literature, but it does not write executable code, optimise algorithms, or run computational experiments against scoring metrics. ERA's LLM-plus-tree-search architecture is purpose-built for iteratively generating and evaluating scientific software.

Can ERA replace a wet lab the way AMGEL does?

No. ERA operates entirely in silico, generating and optimising computational code. It does not design or execute physical experiments. AMGEL is robotic hardware that physically executes 860+ laboratory protocols with 24/7 autonomous operation. ERA automates computational experiments; AMGEL automates physical ones.

Try BioSkepsis free for biomedical literature synthesis and citation verification

Biology-native knowledge graph across 40M+ curated biomedical papers. Free tier with full-text reasoning, hypothesis generation, citation network analysis, citation verification, lab-result interpretation, and Zotero sync. No credentials required.

Start free

Sources & further reading

Aygün, E., Belyaeva, A., Comanici, G. et al. An AI system to help scientists write expert-level empirical software. Nature (2026). DOI: 10.1038/s41586-026-10658-6
Empirical Research Assistance (ERA): From Nature publication to catalyzing Computational Discovery - Google Research blog (May 2026) - research.google
Four ways Google Research scientists have been using Empirical Research Assistance - Google Research blog (April 2026) - research.google
Accelerating scientific discovery with AI-powered Empirical Research Assistance - Google Research blog (September 2025) - research.google
ERA open-source repository - github.com/google-research/era
Gemini for Science: AI experiments and tools for a new era of discovery - Google blog (May 2026) - blog.google
BioSkepsis features page - bioskepsis.ai/features
EFEVRE TECH LTD - AMGEL patent: USPTO 62,993,393; EPO EP21020160.4