Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FasterDecoding

community
https://github.com/FasterDecoding
FasterDecoding
Activity Feed Request to join this org

AI & ML interests

Making model inference more efficient by model-system codesign.

Recent Activity

Gsunshine  authored a paper about 19 hours ago
Representation Fréchet Loss for Visual Generation
tianlecai  authored a paper 9 months ago
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
tianlecai  authored a paper about 2 years ago
SnapKV: LLM Knows What You are Looking for Before Generation
View all activity

Tianle Cai's profile pictureZhengyang Geng's profile pictureJames Liu's profile picture
Organization Card
Community About org cards

Think deeper, decode faster

models 8

FasterDecoding/BitDelta_Mistral_combo

Updated Feb 14, 2024

FasterDecoding/medusa-1.0-vicuna-13b-v1.5

Text Generation • Updated Jan 25, 2024 • 14 • 1

FasterDecoding/medusa-1.0-vicuna-33b-v1.3

Text Generation • Updated Dec 18, 2023 • 13

FasterDecoding/medusa-1.0-zephyr-7b-beta

Text Generation • Updated Dec 18, 2023 • 46 • 1

FasterDecoding/medusa-v1.0-vicuna-7b-v1.5

Text Generation • Updated Oct 29, 2023 • 1.06k

FasterDecoding/medusa-vicuna-33b-v1.3

Updated Sep 11, 2023 • 58 • 4

FasterDecoding/medusa-vicuna-13b-v1.3

Updated Sep 11, 2023 • 138 • 5

FasterDecoding/medusa-vicuna-7b-v1.3

Updated Sep 11, 2023 • 1.7k • 17

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs