Shan Chen

shanchen

·

https://shanchen.dev/

AI & ML interests

I train and eval pretty ok

Organizations

published an article 6 months ago

Article

Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments

shanchen

•

Jan 13

• 10

published an article 9 months ago

Article

Budget Alignment: Making Models Reason in the User’s Language

shanchen

•

Nov 4, 2025

• 12

published an article over 1 year ago

Article

What We Learned About LLM/VLMs in Healthcare AI Evaluation:

shanchen

•

Nov 8, 2024

• 16