CEIA Reinforcement Learning

university

AI & ML interests

None defined yet.

Recent Activity

luanagbmartins updated a dataset about 2 hours ago

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy-GRPO_v3

luanagbmartins updated a dataset about 2 hours ago

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energy-exp1-dpo-offline_v3

luanagbmartins updated a dataset about 2 hours ago

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-GRPO_v3

View all activity

models 14

CEIA-RL/energy-gpt-regulatorio-v2-GRPO

Updated 2 days ago • 6 • 1

CEIA-RL/energyv2-dpo-offline-GRPO

4B • Updated 6 days ago • 106

CEIA-RL/qwen3-4b-dw-lr-SLERP

Text Generation • 4B • Updated 19 days ago • 86

CEIA-RL/qwen3-4b-dw-lr-GRPO-mix-preference

Updated 19 days ago • 11

CEIA-RL/qwen3-4b-dw-lr-GRPO

Updated 19 days ago • 164

CEIA-RL/energy-exp1-dpo-offline

Text Generation • 4B • Updated 22 days ago • 202

CEIA-RL/energyv2-dpo-offline

Text Generation • 4B • Updated 23 days ago • 356

CEIA-RL/qwen3-4b-dw-lr-dpo-offline-energy-GRPO

Text Generation • 4B • Updated 29 days ago • 263

CEIA-RL/qwen3-4b-dw-lr-dpo-offline-energy

Text Generation • 4B • Updated May 6 • 192

CEIA-RL/Qwen3-4B-Instruct-2507

Text Generation • 4B • Updated May 4 • 5

datasets 13

CEIA-RL/energy-eval-filtered_evaluations_v3

Updated about 2 hours ago • 107

CEIA-RL/energy-eval-filtered_responses_multichoice_Qwen_Qwen3-4B_v3

Updated about 2 hours ago • 27

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy-GRPO_v3

Updated about 2 hours ago • 35

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energy-exp1-dpo-offline_v3

Updated about 2 hours ago • 44

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-GRPO_v3

Updated about 2 hours ago • 39

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy_v3

Updated about 2 hours ago • 44

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline_v3

Updated about 3 hours ago • 49

CEIA-RL/energy-eval-filtered_responses_multichoice_cemig-nlp-releases_enregy-gpt-regulatorio_v3

Updated about 3 hours ago • 53

CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energyv2-dpo-offline-GRPO_v3

Updated about 7 hours ago • 29

CEIA-RL/energy-eval-filtered_responses_multichoice_cemig-nlp-releases_enregy-gpt-regulatorio-v2_v3

Updated about 7 hours ago • 30

View 13 datasets