Back into Plato's Cave: Examining Cross-modal Representational Convergence at Scale Paper • 2604.18572 • Published Apr 20 • 1
VGGSounder: Audio-Visual Evaluations for Foundation Models Paper • 2508.08237 • Published Oct 18, 2025 • 1
VGGSounder: Audio-Visual Evaluations for Foundation Models Paper • 2508.08237 • Published Oct 18, 2025 • 1
VGGSounder: Audio-Visual Evaluations for Foundation Models Paper • 2508.08237 • Published Oct 18, 2025 • 1 • 1