CHAMMI-75: pre-training multi-channel models with heterogeneous microscopy images

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

microscopyrepresentation learningmulti-channel imagingself-supervised learningbiology

Quantifying cell morphology using images and machine learning has proven to be a powerful tool to study the response of cells to treatments. However, the models used to quantify cellular morphology are typically trained with a single microscopy imaging type and under controlled experimental conditions. This results in specialized models that cannot be reused across biological studies because the technical specifications do not match (e.g., different number of channels), or because the target experimental conditions are out of distribution. Here, we present CHAMMI-75, a dataset of heterogeneous, multi-channel microscopy images with 2.8M multi-channel images from 75 diverse biological studies. We curated this resource from publicly available sources to investigate cellular morphology models that are channel-adaptive and can process any microscopy image type. Our experiments show that training with CHAMMI-75 can improve performance in multi-channel bioimaging tasks, opening the way to create the next generation of cellular morphology models for biological studies.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces CHAMMI-75, a large-scale dataset of 2.8M multi-channel microscopy images from 75 biological studies, and demonstrates its utility for pre-training channel-adaptive models. It resides in the Channel-Adaptive Vision Transformers leaf, which contains four papers total. This is a relatively sparse research direction within the broader Self-Supervised Pre-Training Architectures branch, suggesting the problem of handling variable channel configurations in microscopy remains under-explored compared to supervised segmentation methods or phenotypic profiling pipelines.

The taxonomy reveals neighboring work in Masked Autoencoding for Multi-Channel Images and Contrastive and Paired-Cell Learning, both exploring self-supervised objectives but without explicit channel-adaptive mechanisms. Supervised Segmentation and Detection methods like Cellpose Multi-Modality assume fixed channel counts, while Phenotypic Profiling and Drug Discovery Applications focus on extracting biological features rather than architectural flexibility. CHAMMI-75 bridges these areas by providing a foundation for models that generalize across imaging protocols, diverging from task-specific supervised approaches and complementing self-supervised methods that lack channel adaptability.

Among 30 candidates examined, the dataset contribution shows one refutable candidate from 10 examined, indicating some prior work on multi-channel microscopy collections exists but may differ in scale or diversity. The benchmarking contribution and systematic experimental evaluation each examined 10 candidates with zero refutations, suggesting these aspects are less directly addressed in prior literature. The limited search scope means these statistics reflect top-30 semantic matches rather than exhaustive coverage, so additional relevant datasets or evaluation frameworks may exist beyond this analysis window.

Based on the top-30 semantic search results, CHAMMI-75 appears to occupy a moderately novel position by combining large-scale heterogeneous data curation with channel-adaptive pre-training experiments. The dataset contribution has some overlap with existing resources, while the benchmarking and evaluation aspects show less direct prior work within the examined candidates. The sparse Channel-Adaptive Vision Transformers leaf suggests this research direction is still emerging, though the analysis does not cover all possible related work in computer vision or biomedical imaging.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: pre-training multi-channel models for cellular morphology analysis. The field encompasses diverse approaches to learning from multi-channel microscopy images, organized into several major branches. Self-Supervised Pre-Training Architectures explore how to leverage unlabeled data through channel-adaptive vision transformers and masked reconstruction strategies. Supervised Segmentation and Detection methods focus on annotated datasets to delineate cellular structures, while Generative Modeling and Perturbation Prediction aims to synthesize realistic morphologies or predict cellular responses to interventions. Phenotypic Profiling and Drug Discovery Applications emphasize extracting biologically meaningful features for compound screening, and Specialized Imaging Modalities and Reconstruction address technical challenges in advanced microscopy. Clinical and Pathological Applications and Specialized Biological Contexts round out the taxonomy by targeting domain-specific problems such as cancer diagnostics or developmental biology. Within Self-Supervised Pre-Training Architectures, a small cluster of recent works investigates how vision transformers can handle variable numbers of imaging channels without retraining. CHAMMI-75[0] sits squarely in this Channel-Adaptive Vision Transformers niche, alongside Scaling Channel Adaptive[2] and Isolated Channel ViT[3], all exploring flexible encoder designs that generalize across different staining protocols. These methods contrast with earlier supervised pipelines like Cellpose Multi-Modality[14] or phenotypic profiling tools such as PhenoProfiler[7], which typically assume fixed channel configurations. A key open question is whether channel-adaptive pre-training can match or exceed task-specific supervised models when fine-tuned on downstream segmentation or profiling tasks, and how best to balance architectural flexibility with computational efficiency. CHAMMI-75[0] emphasizes scalable pre-training on diverse datasets, positioning itself as a foundation model approach compared to the more narrowly scoped architectures in Isolated Channel ViT[3].

Claimed Contributions

CHAMMI-75 dataset of heterogeneous multi-channel microscopy images

Can Refute

10 retrieved papers

The authors curated CHAMMI-75, a dataset containing 2.8 million multi-channel microscopy images from 75 diverse biological studies. This resource integrates heterogeneous sources with varying numbers of channels (1-7+), organisms, cell lines, and microscopy modalities to enable training of channel-adaptive cellular morphology models.

10 retrieved papers

Can Refute

New benchmarks for multi-channel model evaluation

10 retrieved papers

The authors created two new evaluation benchmarks (CellPHIE with 14-channel images and RBC-MC for cross-domain generalization) alongside adopting existing ones. These benchmarks test model performance on novel channel configurations and imaging modalities not seen during pre-training.

10 retrieved papers

Systematic experimental evaluation of CHAMMI-75 for pre-training

10 retrieved papers

The authors performed comprehensive scaling experiments comparing bag-of-channels versus multi-channel attention approaches, different SSL algorithms, and model sizes. Their results demonstrate that pre-training with CHAMMI-75 improves performance across diverse biological tasks and enables strong generalization to novel channel combinations.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[2] Scaling channel-adaptive self-supervised learning PDF

AV De Lorenci, SE Yi, T Moutakanni (2025)

[3] Isolated channel vision transformers: From single-channel pretraining to multi-channel finetuning PDF

Lian, Wenyi, Micke Patrick, Lindblad, Joakim, Sladoje NataÅ¡a (2025)

[26] Scaling Channel-Invariant Self-Supervised Learning PDF

AV De Lorenci, SE Yi, T Moutakanni, P Bojanowski (0)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

CHAMMI-75 dataset of heterogeneous multi-channel microscopy images

[35] ChAda-ViT : Channel Adaptive Attention for Joint Representation Learning of Heterogeneous Microscopy Images PDF

Can Refute

[11] Learning unsupervised feature representations for single cell microscopy images with paired cell inpainting PDF

Cannot Refute

[31] Correlated multimodal imaging in life sciences: expanding the biomedical horizon PDF

Cannot Refute

[32] Multiâcolor twoâlaser superâresolution structured illumination microscopy for the visualization of multiâorganelle in living cells PDF

Cannot Refute

[33] CHAMMI: A benchmark for channel-adaptive models in microscopy imaging PDF

Cannot Refute

[34] ChAda-ViT : Channel Adaptive Attention for Joint Representation Learning of Heterogeneous Microscopy Image PDF

Cannot Refute

[36] Fast biological imaging with quantum-enhanced Raman microscopy. PDF

Cannot Refute

[37] Microscopy-based high-content screening PDF

Cannot Refute

[38] waveOrder: generalist framework for label-agnostic computational microscopy PDF

Cannot Refute

[39] Broadband stimulated Raman imaging based on multi-channel lock-in detection for spectral histopathology PDF

Cannot Refute

Contribution

New benchmarks for multi-channel model evaluation

[1] CellRep: A Multichannel Image Representation Learning Model PDF

Cannot Refute

[2] Scaling channel-adaptive self-supervised learning PDF

Cannot Refute

[3] Isolated channel vision transformers: From single-channel pretraining to multi-channel finetuning PDF

Cannot Refute

[33] CHAMMI: A benchmark for channel-adaptive models in microscopy imaging PDF

Cannot Refute

[35] ChAda-ViT : Channel Adaptive Attention for Joint Representation Learning of Heterogeneous Microscopy Images PDF

Cannot Refute

[40] Spotiflow: accurate and efficient spot detection for fluorescence microscopy with deep stereographic flow regression PDF

Cannot Refute

[41] -PS: Spectrally Multiplexed Photometric Stereo Under Unknown Spectral Composition PDF

Cannot Refute

[42] Cross-Modality Guided Super-Resolution for Weak-Signal Fluorescence Imaging via a Multi-Channel SwinIR Framework PDF

Cannot Refute

[43] Whole brain vessel graphs: A dataset and benchmark for graph learning and neuroscience PDF

Cannot Refute

[44] 3D fluorescence microscopy data synthesis for segmentation and benchmarking PDF

Cannot Refute

Contribution

Systematic experimental evaluation of CHAMMI-75 for pre-training

[2] Scaling channel-adaptive self-supervised learning PDF

Cannot Refute

[45] A tour of unsupervised deep learning for medical image analysis PDF

Cannot Refute

[46] Scaling Self-Supervised Learning for Histopathology with Masked Image Modeling PDF

Cannot Refute

[47] Prior Visual-Guided Self-Supervised Learning Enables Color Vignetting Correction for High-Throughput Microscopic Imaging PDF

Cannot Refute

[48] An explainable unsupervised learning approach for anomaly detection on corneal in vivo confocal microscopy images PDF

Cannot Refute

[49] Self-supervised learning enables unbiased patient characterization from multiplexed cancer tissue microscopy images PDF

Cannot Refute

[50] Unsupervised Microscopy Video Denoising PDF

Cannot Refute

[51] Single-shot self-supervised object detection in microscopy PDF

Cannot Refute

[52] Unsupervised content-preserving transformation for optical microscopy PDF

Cannot Refute

[53] Microsnoop: A generalist tool for microscopy image representation PDF

Cannot Refute

CHAMMI-75: pre-training multi-channel models with heterogeneous microscopy images

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[2] Scaling channel-adaptive self-supervised learning PDF

[3] Isolated channel vision transformers: From single-channel pretraining to multi-channel finetuning PDF

[26] Scaling Channel-Invariant Self-Supervised Learning PDF

Contribution Analysis

CHAMMI-75 dataset of heterogeneous multi-channel microscopy images

[35] ChAda-ViT : Channel Adaptive Attention for Joint Representation Learning of Heterogeneous Microscopy Images PDF

[11] Learning unsupervised feature representations for single cell microscopy images with paired cell inpainting PDF

[31] Correlated multimodal imaging in life sciences: expanding the biomedical horizon PDF

[32] Multiâcolor twoâlaser superâresolution structured illumination microscopy for the visualization of multiâorganelle in living cells PDF

[33] CHAMMI: A benchmark for channel-adaptive models in microscopy imaging PDF

[34] ChAda-ViT : Channel Adaptive Attention for Joint Representation Learning of Heterogeneous Microscopy Image PDF

[36] Fast biological imaging with quantum-enhanced Raman microscopy. PDF

[37] Microscopy-based high-content screening PDF

[38] waveOrder: generalist framework for label-agnostic computational microscopy PDF

[39] Broadband stimulated Raman imaging based on multi-channel lock-in detection for spectral histopathology PDF

New benchmarks for multi-channel model evaluation

[1] CellRep: A Multichannel Image Representation Learning Model PDF

[2] Scaling channel-adaptive self-supervised learning PDF

[3] Isolated channel vision transformers: From single-channel pretraining to multi-channel finetuning PDF

[33] CHAMMI: A benchmark for channel-adaptive models in microscopy imaging PDF

[35] ChAda-ViT : Channel Adaptive Attention for Joint Representation Learning of Heterogeneous Microscopy Images PDF

[40] Spotiflow: accurate and efficient spot detection for fluorescence microscopy with deep stereographic flow regression PDF

[41] -PS: Spectrally Multiplexed Photometric Stereo Under Unknown Spectral Composition PDF

[42] Cross-Modality Guided Super-Resolution for Weak-Signal Fluorescence Imaging via a Multi-Channel SwinIR Framework PDF

[43] Whole brain vessel graphs: A dataset and benchmark for graph learning and neuroscience PDF

[44] 3D fluorescence microscopy data synthesis for segmentation and benchmarking PDF

Systematic experimental evaluation of CHAMMI-75 for pre-training

[2] Scaling channel-adaptive self-supervised learning PDF

[45] A tour of unsupervised deep learning for medical image analysis PDF

[46] Scaling Self-Supervised Learning for Histopathology with Masked Image Modeling PDF

[47] Prior Visual-Guided Self-Supervised Learning Enables Color Vignetting Correction for High-Throughput Microscopic Imaging PDF

[48] An explainable unsupervised learning approach for anomaly detection on corneal in vivo confocal microscopy images PDF

[49] Self-supervised learning enables unbiased patient characterization from multiplexed cancer tissue microscopy images PDF

[50] Unsupervised Microscopy Video Denoising PDF

[51] Single-shot self-supervised object detection in microscopy PDF

[52] Unsupervised content-preserving transformation for optical microscopy PDF

[53] Microsnoop: A generalist tool for microscopy image representation PDF

Table of Contents

[32] Multiâcolor twoâlaser superâresolution structured illumination microscopy for the visualization of multiâorganelle in living cells PDF