Researchmultimodal llmcystoscopyopenai o3in context learning

Multimodal LLMs Evaluate Cystoscopy Image Interpretation

jmir.org

|January 29, 2026

7.2

Relevance Score

Multimodal LLMs Evaluate Cystoscopy Image Interpretation

A 2026 study evaluates four multimodal LLMs (OpenAI-o3, ChatGPT-4o, Gemini 2.5 Pro, MedGemma-27B) on clinician-defined cystoscopy stress-test datasets (401-image free-text task; 113-image 7-class classification). OpenAI-o3 showed best overall balance with 88.3% lesion detection accuracy, 92% sensitivity, 73.1% specificity, and biopsy-classification accuracy 73.5%. Authors conclude MM-LLMs offer assistive, interpretable outputs but require further optimization before clinical deployment.

Multimodal LLMs Evaluate Cystoscopy Image Interpretation

More AI & Data Science News

Digital Identity Shifts Toward Continuous Verification

Nvidia Helps DeepSeek Hone Models For China's Military

Scoring Rationale

Sources

SK hynix Seeks Overwhelming HBM4 Market Leadership

China Faces Massive AI-Driven Job Losses