arxiv:2507.03211
Liangyu Wang
ly4096
ยท
AI & ML interests
Efficient reinforcement learning (RL) for LLMs reasoning
Distributed training and inference of LLMs
Efficient algorithm and infrastructure design for LLMs
Recent Activity
submitted
a paper
about 2 hours ago
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers
upvoted
a
paper
about 4 hours ago
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers
upvoted
a
paper
2 days ago
FlashDP: Private Training Large Language Models with Efficient DP-SGD