Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AI Safety Research's picture
23 41 262

AI Safety Research

AISafety
webxos's profile picture 21world's profile picture t-edward's profile picture
·
https://humanaligned.ai

AI & ML interests

LLMs, planning, EA

Recent Activity

upvoted a collection 1 day ago
Mistral Small 4
liked a model 5 days ago
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
liked a dataset about 2 months ago
LightningRodLabs/future-as-label-paper-training-dataset
View all activity

Organizations

Hugging Face Discord Community's profile picture

AISafety 's collections 3

Model building
  • Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

    Paper • 2511.06221 • Published Nov 9, 2025 • 133
Inference efficiency
  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 626
Safety and transparency
  • OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

    Paper • 2504.07096 • Published Apr 9, 2025 • 77
Model building
  • Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

    Paper • 2511.06221 • Published Nov 9, 2025 • 133
Safety and transparency
  • OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

    Paper • 2504.07096 • Published Apr 9, 2025 • 77
Inference efficiency
  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 626
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs