Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Nitesh Kumar Sharma's picture
2 7 2

Nitesh Kumar Sharma

carbene101
21world's profile picture cmpatino's profile picture ngsnethawarya's profile picture
ยท
  • 95-percent-ci

AI & ML interests

LLMs, OCR

Recent Activity

reacted to sergiopaniego's post with ๐Ÿ”ฅ 3 days ago
New TRL + OpenEnv example! ๐Ÿ’ฅ Fine tune an LLM for playing Sudoku using an RL env via OpenEnv Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook. Enjoy! Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py
upvoted a paper 2 months ago
Architecture Decoupling Is Not All You Need For Unified Multimodal Model
reacted to sergiopaniego's post with ๐Ÿ”ฅ 2 months ago
we've just added several example scripts to TRL showing how to train models with GRPO using some of the new OpenEnv environments train a model to interact with a browser (๐ŸŽฎ BrowserGym Env), play Wordle (๐ŸŽฎ Wordle Env) and moooore! TRL (GRPO + vLLM) + OpenEnv! โšก๏ธ ๐Ÿ“ go play with them: https://github.com/huggingface/trl/tree/main/examples/scripts/openenv ๐Ÿ“ examples list: https://huggingface.co/docs/trl/main/en/example_overview#scripts
View all activity

Organizations

Hugging Face Discord Community's profile picture

carbene101 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs