ACM Conference Paper

DISSECT: Diagnostic Evaluation
of Scientific Visual Reasoning

A five-mode evaluation framework that decomposes VLM failures into perception, reasoning, and language-prior components across Biology and Chemistry visual question answering.

5
Evaluation Modes
186
Total Questions
2
Subjects
7
Prompt Templates
DISSECT_appendix.pdf
Download PDF

Dataset Explorer

0Total
0Biology
0Chemistry