Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
CKeibel
's Collections
SLMs
PII
Code-Embeddings
Speech2Text (ASR)
Seq2Seq
Reward Models
diffusion models
Text-Classification
Data
PEFT (Papers)
LLMs (Papers)
Causal LMs, seq2seq models
Embedding models
Vision stuff
datasets
NER
BERT based tasks (models)
Multimodal
Data
updated
Feb 13
Upvote
-
HuggingFaceFW/fineweb-2
Viewer
•
Updated
Oct 27, 2025
•
4.48B
•
38.4k
•
775
allenai/c4
Viewer
•
Updated
Jan 9, 2024
•
10.4B
•
617k
•
539
ServiceNow-AI/R1-Distill-SFT
Viewer
•
Updated
Feb 8, 2025
•
1.85M
•
2.13k
•
315
PrimeIntellect/INTELLECT-2-RL-Dataset
Viewer
•
Updated
May 13, 2025
•
285k
•
103
•
66
togethercomputer/RedPajama-Data-V2
Updated
Nov 21, 2024
•
6.1k
•
399
wikimedia/wikipedia
Viewer
•
Updated
Jan 9, 2024
•
61.6M
•
94.8k
•
1.17k
avemio/German-RAG-EMBEDDING-TRIPLES-HESSIAN-AI
Viewer
•
Updated
Oct 16, 2024
•
294k
•
9
•
1
urchade/synthetic-pii-ner-mistral-v1
Updated
Apr 20, 2024
•
293
•
15
yahma/alpaca-cleaned
Viewer
•
Updated
Apr 10, 2023
•
51.8k
•
29.1k
•
802
Upvote
-
Share collection
View history
Collection guide
Browse collections