rainprob / medusa Goto Github PK
View Code? Open in Web Editor NEWThis project forked from fasterdecoding/medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Home Page: https://sites.google.com/view/medusa-llm
License: Apache License 2.0