What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
updated
a dataset
1 day ago
JackBAI/jack-latest-vllm-stack
published
a dataset
1 day ago
JackBAI/jack-latest-vllm-stack
authored
a paper
7 days ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning