·
AI & ML interests
I train and eval pretty ok
Organizations
view article Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments
view article Budget Alignment: Making Models Reason in the User’s Language
shanchen
• • 11
published an article over 1 year ago view article What We Learned About LLM/VLMs in Healthcare AI Evaluation:
shanchen
• • 16