LTX-2: Efficient Joint Audio-Visual Foundation Model Paper โข 2601.03233 โข Published Jan 6 โข 149
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation Paper โข 2504.17502 โข Published Apr 24, 2025 โข 55