Hospital "Fingerprint" Helps AI Identify Misdiagnoses in Cancer Tissue

The origin of a sample must not influence the result: Evaluation criteria for trustworthy clinical AI

29-Jun-2026

Illustrative image

AI-generated image

A new study by BIFOLD researchers at TU Berlin, in collaboration with the Berlin-based AI company Aignostics, Ludwig Maximilian University (LMU) in Munich, and the Netherlands cancer Institute (NKI), shows that today’s AI models for pathology can often be influenced simply by the hospital from which the tissue sample being examined originates. The team developed “PathoROB,” the world’s first evaluation metric designed to measure and mitigate this problem. PathoROB is already in widespread use and is thus shaping the next generation of AI models for pathology. The study has now been published in *Nature Communications*.

Artificial intelligence is intended to help doctors diagnose and characterize cancer more quickly and accurately. So-called foundation models—large AI systems pre-trained on millions of microscopic tissue images—are increasingly being used for cancer detection, disease classification, and biomarker prediction in clinical workflows. The new study by the interdisciplinary research team now reveals a critical weakness in these models: Every pathology lab leaves a subtle signature on its tissue sections—differences in the preparation, staining, and digitization of biopsies. These differences are medically irrelevant, but they are visible to AI systems, and the models internalize them. The researchers demonstrated that current foundation models can identify the hospital of origin of a tissue section with an accuracy of 88 to 98 percent based on their learned feature representations. In some cases, a model’s internal “map” of the data was organized primarily by hospital and only secondarily by whether the tissue was healthy or cancerous.

Hidden Hospital “Fingerprints” in the Models

The consequences can be serious. In one particularly striking example, an AI model learned to use the hospital signature as a shortcut for its decisions. As a result, it incorrectly classified a clearly malignant tissue sample as healthy—solely because the sample came from a hospital that had historically sent almost exclusively healthy samples and which the model had therefore associated with healthy tissue.

Subtype annotation for various forms of adenocarcinoma using spectral histopathology

Detecting lung cancer at an early stage

Marker-free, automatic procedure facilitates prognosis regarding tumour aggressiveness

Read news

To quantify this problem, the researchers developed PathoROB, the first publicly available evaluation metric specifically designed to assess the robustness of foundation models in pathology against technical variations. It combines four datasets containing approximately 100,000 tissue sections, 28 biological classes, and 34 medical centers. In addition, it introduces a new “robustness index” that quantifies the extent to which a model’s internal representation is determined by biology rather than by hospital artifacts.

When applied to 20 widely used foundation models, PathoROB identified shortcomings in every single model. Larger models trained on more diverse data, as well as models that combine image data with text reports (vision-language models), achieved the best results. The researchers also tested various post-processing methods for “robustification” and found that these can significantly reduce the risk of such errors—though not entirely. This does not require costly retraining of the underlying model.

“Foundation models for pathology are evolving rapidly, and that’s extremely exciting. However, our results show that strong performance on a standard benchmark is not enough to trust a model in clinical use,” says Julius Hense, co-first author of the study and a researcher at BIFOLD and TU Berlin. “PathoROB provides developers and clinical users with a tool to verify whether a model has actually learned biological relationships or has merely recognized which hospital a specimen comes from.”

Shaping the Next Generation of Pathology AI

PathoROB is already changing the way AI for pathology is developed and evaluated. Aignostics’ next-generation foundation model, “Atlas 2,” developed in collaboration with the Mayo Clinic in the U.S., was specifically designed to address the trade-offs between performance and robustness identified by PathoROB. Furthermore, PathoROB is increasingly establishing itself as the gold standard for evaluating the robustness of foundation models. New models and platforms such as “Histoboard” now present their PathoROB results as one of the evaluation metrics to directly compare pathology AI models with one another.

By making the evaluation metric, the datasets, and the source code, the researchers hope to establish robustness evaluation as an integral part of the validation of biomedical foundation models—before these are used to support clinical decisions and thus potentially influence patient treatments.

Note: This article has been translated using a computer system without human intervention. LUMITOS offers these automatic translations to present a wider range of current news. Since this article has been translated with automatic translation, it is possible that it contains errors in vocabulary, syntax or grammar. The original article in German can be found here.

Original publication

Jonah Kömen, Edwin D. de Jong, Julius Hense, et al.; "Towards robust foundation models for digital pathology"; Nature Communications, Volume 17, 2026-6-11

https://www.analytica-world.com/en/news/1189055/hospital-fingerprint-helps-ai-identify-misdiagnoses-in-cancer-tissue.html

Original publication

Jonah Kömen, Edwin D. de Jong, Julius Hense, et al.; "Towards robust foundation models for digital pathology"; Nature Communications, Volume 17, 2026-6-11

Topics

digital pathology tissue analysis histopathology artificial intelligence pathology tissue samples cancer diagnostics cancer

Show all

Organizations

TU Berlin

Aignostics

LMU

The Netherlands Cancer Institute

More from the department science Subscribe to newsletter

Hospital "Fingerprint" Helps AI Identify Misdiagnoses in Cancer Tissue

The origin of a sample must not influence the result: Evaluation criteria for trustworthy clinical AI

Hidden Hospital “Fingerprints” in the Models

Detecting lung cancer at an early stage

Shaping the Next Generation of Pathology AI

Original publication

Refining Breast Cancer Classification by Multiplexed Imaging

Other news from the department science

2026 Future Insight Prize Goes to Spear’s Vasilis Ntziachristos

Social inequality is linked to faster biological aging

Artificial intelligence evaluates chemical spectra in minutes

Carbon nanotubes make the electronic nose suitable for everyday use for the first time

New polymorph of indomethacin discovered – a rare event in pharmaceutical research

Lab-on-a-Chip platform shows how immune cells attack cancer cells

Designable van der Waals crystal realizes artificial neuronal cell mimicking with light

Chemists achieve breakthrough: Editing molecules instead of rebuilding them

Extending cryo-electron microscopy beyond water

New method enables accurate sequencing of short peptides hidden in food and human body

Interpretable AI in materials discovery: Uncovering how models make predictions

Platinum oxidation observed in real time: the key to longer-lasting electrolysers

AI Diagnoses Brain Tumors in Minutes Instead of Weeks

Precision Measurement Under Impact – When the Balance Itself Becomes the Object of Measurement

Mini labs for lightning-fast food inspections

Standard Tests Do Not Always Detect All Gluten Residues in Barley Beer

TU Graz develops mobile device for the high-precision measurement of air pollutants

Why doesn't coffee taste like caffeine?

Carbon dimer: precision measurement delivers new record value

Researchers solve a 50-year-old mystery: how acid removes water from proteins

Most read news

Detecting heavy metals in soil and water: New method for on-site analysis

Fewer animal experiments thanks to virtual mouse

Why doesn't coffee taste like caffeine?

Holography meets spectroscopy: Ultrafast microscopy method for optical processes

Researchers solve a 50-year-old mystery: how acid removes water from proteins

Haga Bioscience raises $2.3m in oversubscribed seed round to bridge spatial biomarker discovery and clinical translation

Common structural analysis of interfacial water is inadequate, according to a new study

Pyrolysis oil instead of crude oil: Faster fluorine analysis reduces the risk for refineries

New technology detects bacteria on surfaces in five minutes using a smartphone

For the first time, researchers are peering inside record-breaking superconductors

PFAS detection in minutes rather than weeks: deep-tech start-up Grapheal secures €2.5 million in EU funding

Carbon nanotubes make the electronic nose suitable for everyday use for the first time

More news from our other portals

Festo is cutting approximately 1,300 jobs in Germany

It may not just be what’s in ultra-processed foods, but how they’re made

Nordzucker is revising its beet pricing model and investing €160 million in its factories

Atomic reshuffle paves way for record-breaking catalysts for hydrogen production

New drug could slow the development of Alzheimer’s

New research finds that almost all plant-based meat alternatives contain mycotoxins

Focused Energy secures US$240 million: the world’s first laser fusion power plant is set to be built in Germany

New antibiotics discovered to treat multi-resistant germs

Could cultured chocolate unlock the next food revolution?

Water splitting catalyst creates hydrogen at low temperatures

Mini-Brains from Patient Cells Point to Vitamin B3 as Treatment for Rare Childhood Disease

Nestlé to acquire smart food pioneer yfood to accelerate the brand’s international expansion

German plastics recycling on the brink of collapse

Cytospire Therapeutics announces oversubscribed £61 million Series A financing

Less hunger, more environmental problems?

Cooking plastics into oil

Egg consumption is associated with a lower risk of Alzheimer’s Disease

Green light for Arla Foods and DMK Group merger ​

Magnetic field during catalyst synthesis triples ammonia yield

Insect larvae as a screening tool

Researchers find fructose sends a weaker “I’m full” signal to the brain than glucose

Making Chemistry Greener: The 2026 Gerhard Ertl Lecture Award goes to Professor Marc Koper

First European biotech with CAR-T and LNP technology under one roof

Invisible battle between bacteria determines the flavour and safety of salami

MIT researchers develop a low-cost technique to get lithium out of rocks

Daily glass of 100% fruit juice could help support mental wellbeing

Turning food waste into carbon captors

Bacterial factories: A key to climate-friendly chemistry

Miltenyi Biotec expands Cologne production site for clinical reagents

Green light for Arla Foods and DMK Group merger