Submitted by Xinge Liu 4 Stargazer: A Scalable Model-Fitting Benchmark Environment for AI Agents under Astrophysical Constraints University of Toronto 6 2