AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

christopher 
in bigscience/bloom about 1 month ago

[SPAM] Deleted

3
#289 opened about 1 month ago by
sarthak-saxena
stas 
posted an update about 1 month ago
view post
Post
205
Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into
HuggingFace Trainer, Accelerate and TRL

For extensive details please see this writeup:
https://huggingface.co/blog/ulysses-sp

Thanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration.
christopher 
in bigscience/bloom about 1 month ago

pretokenizer Regex issues?

8
#278 opened almost 2 years ago by
hpcpony
christopher 
in bigscience/bloom about 1 month ago

Test PR

#286 opened about 1 month ago by
FIRSTACCOUNT69

Test discussion

#287 opened about 1 month ago by
FIRSTACCOUNT69

Test discussion

#288 opened about 1 month ago by
FIRSTACCOUNT69