Small Datasets in Personalized Medicine—Integrating Cross-Center Knowledge in Foundation Models

Challenge: Extracting the Needle from the Haystack Small Datasets

  • Small patient collectives occur in personalized medicine and in rare diseases. Powerful deep learning methods require large amounts of data.
  • Additionally, images in histology vary between clinics and this heterogeneity needs to be covered during training. 
  • Find out more in our podcast episode: AI-Based Precision Pathology: Learning with Small Amounts of Data

 

Solutions and Projects

The image Efficient training of foundation models for multiple tasks simultaneously as presented in the paper "Overcoming Data Scarcity in Biomedical Imaging with a Foundational Multi-Task Model"a joint visualization of differently stained images from histopathology (HE & IHC).
Efficient training of foundation models using multitask-learning. (https://doi.org/m9b5, CC BY)

Training of Use-Case-Specific Foundation Models

  • Models for routine use need to be robust even if only small datasets are available for training.
  • Foundation models trained on large cohorts provide a basis for covering the required data diversity.
  • We build foundation models specifically for applications in pathology and radiology. [1]
  • Tissue Concepts is a specialized model for pathology aiming at cross-center generalizability and interactive training of AI models. [2]


References

[1] Overcoming Data Scarcity in Biomedical Imaging with a Foundational Multi-Task Model. Schäfer et al. 2023. article

[2] Tissue concepts: Supervised foundation models in computational pathology. Nicke et al. 2025. article

 

Foundation models can be adapted interactively using few annotations or fine-tuned to specific research questions.
© Fraunhofer MEVIS
Foundation models can be adapted interactively using few annotations.

Quick Training of Custom Models with Small Datasets

  • We build robust models based small datasets data or few annotations
  • Models can be trained interactively based on pre-computed image features 

(Demo Video: A foundation-model-based few-shot segmentation),  (DGP 2024)

 

 

 

Predicting the outcome from medical images to develop novel digital biomarkers or patient-specific risk scores.
Predicting outcome from medical images to develop novel digital biomarkers or patient-specific risk scores. (https://doi.org/g8zccp, CC BY)

Predicting Outcome from Patient-Level Labels

  • We build models based on clinical data to detect new biomarkers and to predict outcome
  • Most accurate detector of colorectal cancer in the Semicol-challenge and overall second place!

[1] prosurvival.org

[2] semicol.org

[3] Tissue concepts: Supervised foundation models in computational pathology. Nicke et al. 2025. article

 

Federated learning allows scientists to collaborate in the training of joint AI models, e.g., for survival prediction in prostate cancer.
Federated learning allows scientists to collaboratively train joint AI models, e.g., for survival prediction in cancer.

Prostate Survival Prediction and Biomarker Discovery

  • We predict recurrence-free survival time in prostate cancer and analyze predictive patterns in tissue morphology.
  • We develop a cooperative federated learning platform that allows scientists to collaborate between clinical centers.
  • We collaborate on the standardization of data exchange formats for survival studies [1].
  • 2nd place in the LEOPARD challenge for prostate cancer risk prediction [2].
     

References

[1] PROSurvival: A Technical Case Report on Creating and Publishing a Dataset for Federated Learning on Survival Prediction of Prostate Cancer Patients. Xu et al. 2021. article

[2] Leopard

[3] prosurvival.org

 

Contact

Contact Press / Media

Dr.-Ing. Henning Höfener

Principal Scientist Computational Pathology

Fraunhofer Institute for Digital Medicine MEVIS
Max-von-Laue-Str. 2
28359 Bremen

Phone +49 421 17879 2147

Contact Press / Media

Dr. rer. nat. Johannes Lotz

Principal Scientist Computational Pathology, Image Registration

Fraunhofer Institute for Digital Medicine MEVIS
Maria-Goeppert-Str. 3
23562 Lübeck

Phone +49 451 3101-6118

Fax +49 451 3101-6104

Contact Press / Media

Jan Raphael Schäfer

Research Engineer Clinical Decision Support

Fraunhofer Institute for Digital Medicine MEVIS
Max-von-Laue-Str. 2
28359 Bremen

  • Send email

Contact Press / Media

Till Nicke

Fraunhofer Institute for Digital Medicine MEVIS
Max-von-Laue-Str. 2
28359 Bremen