[slides] PRISM%3A A Multi-Modal Generative Foundation Model for Slide-Level Histopathology

PRISM is a multi-modal generative foundation model designed for slide-level histopathology, leveraging clinical report text for pre-training. The model addresses the mismatch between clinical analysis, which operates at the level of whole slide images (WSIs), and existing foundation models that process individual image tiles separately. By aggregating tile embeddings into a single slide embedding, PRISM can generate clinical reports and perform zero-shot cancer detection and sub-typing with performance approaching or surpassing supervised aggregator models. Additionally, fine-tuning PRISM's slide encoder yields label-efficient training for biomarker prediction, even with limited training data. The model is pre-trained using 587,196 WSIs and 195,344 associated clinical text reports, demonstrating its effectiveness in various downstream tasks such as cancer detection, tissue sub-typing, and biomarker prediction. PRISM's capabilities include generating text-based diagnosis reports, zero-shot prediction, and slide-level linear classification, making it a versatile tool for computational pathology.PRISM is a multi-modal generative foundation model designed for slide-level histopathology, leveraging clinical report text for pre-training. The model addresses the mismatch between clinical analysis, which operates at the level of whole slide images (WSIs), and existing foundation models that process individual image tiles separately. By aggregating tile embeddings into a single slide embedding, PRISM can generate clinical reports and perform zero-shot cancer detection and sub-typing with performance approaching or surpassing supervised aggregator models. Additionally, fine-tuning PRISM's slide encoder yields label-efficient training for biomarker prediction, even with limited training data. The model is pre-trained using 587,196 WSIs and 195,344 associated clinical text reports, demonstrating its effectiveness in various downstream tasks such as cancer detection, tissue sub-typing, and biomarker prediction. PRISM's capabilities include generating text-based diagnosis reports, zero-shot prediction, and slide-level linear classification, making it a versatile tool for computational pathology.

PRISM: A MULTI-MODAL GENERATIVE FOUNDATION MODEL FOR SLIDE-LEVEL HISTOPATHOLOGY