Catching Confident Hallucinations from the Inside: A Linear Probe on a Model's Own Activations, and Where It Stops Working assafpet • about 8 hours ago
Hidden-State Hallucination Probes Beat Confidence Baselines — And the Gap Grows with Scale sarimahsan101 • about 10 hours ago
Left vs. Right Alignment, A case study from porting LeRobot PI05's subtask prediction from JAX to PyTorch jorgemunozl • 2 days ago • 1
Building an agent that researches competitors and ships Meta ad creatives mesmertech • 3 days ago • 1
Teaching Machines to Read Silicon: An Open, Generated Dataset of SoC RTL hasankursun • 3 days ago • 1
80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your laptop hugging-science • 3 days ago • 17
🔧 L'architecture est un seuil, pas un levier — ce que j'ai appris en optimisant un LLM français de 15M de paramètres 🇫🇷 RDTvlokip • 3 days ago • 1
🔧 Architecture is a threshold, not a lever — what I learned optimizing a 15M French LLM 🇫🇷 RDTvlokip • 3 days ago • 1