Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
CEIA Reinforcement Learning
university
Activity Feed
Follow
7
AI & ML interests
None defined yet.
Recent Activity
luanagbmartins
Â
updated
a model
about 22 hours ago
CEIA-RL/qwen3-4b-dw-lr-dpo
luanagbmartins
Â
updated
a dataset
2 days ago
CEIA-RL/Safety-Preference-Energy
luanagbmartins
Â
published
a dataset
2 days ago
CEIA-RL/Safety-Preference-Energy
View all activity
Team members
5
spaces
1
pinned
Sleeping
Agents
LLMasJudgeEval
🥇
models
4
Sort:Â Recently updated
CEIA-RL/qwen3-4b-dw-lr-dpo
Text Generation
•
4B
•
Updated
about 4 hours ago
•
1.57k
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
Text Generation
•
4B
•
Updated
10 days ago
•
6.88k
CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B
Updated
10 days ago
•
58
CEIA-RL/qwen3-4b-dw-lr-dpo-offline
Text Generation
•
4B
•
Updated
23 days ago
•
633
datasets
16
Sort:Â Recently updated
CEIA-RL/Safety-Preference-Energy
Viewer
•
Updated
2 days ago
•
48.2k
•
13
CEIA-RL/questions-Gemma4-31B
Viewer
•
Updated
7 days ago
•
20.9k
•
26
CEIA-RL/questions-GPT-OSS-120B-RL
Viewer
•
Updated
7 days ago
•
4.3k
•
60
CEIA-RL/questions-Gemma4-31B-rl-source
Viewer
•
Updated
7 days ago
•
20.9k
•
29
CEIA-RL/questions-GPT-OSS-120B-RL-source
Viewer
•
Updated
8 days ago
•
4.3k
•
22
CEIA-RL/questions-GPT-OSS-120B
Viewer
•
Updated
13 days ago
•
21.5k
•
49
CEIA-RL/Synthetic-Questions-Energy
Viewer
•
Updated
15 days ago
•
18.2k
•
33
CEIA-RL/Safety-Questions-Energy
Viewer
•
Updated
15 days ago
•
53.1k
•
70
CEIA-RL/synth_regulacao_eng_qa_v0
Viewer
•
Updated
29 days ago
•
2.32k
•
31
CEIA-RL/QA-Energy
Viewer
•
Updated
29 days ago
•
43
•
39
View 16 datasets