-
Notifications
You must be signed in to change notification settings - Fork 0
Quick Metric hack #2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Yang Wang <[email protected]>
…uests for Corresponding Kernels (vllm-project#17146) Signed-off-by: Chih-Chieh-Yang <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: Michael Goin <[email protected]>
…ter CI (vllm-project#17717) Signed-off-by: mgoin <[email protected]>
…le head (vllm-project#17740) Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
…n RoPE (vllm-project#17726) Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: SzymonOzog <[email protected]> Signed-off-by: SzymonOzog <[email protected]> Signed-off-by: Isotr0py <[email protected]> Co-authored-by: Isotr0py <[email protected]>
…oject#17735) Signed-off-by: evian <[email protected]> Co-authored-by: evian <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
…c on-device sampling (vllm-project#16357) Signed-off-by: Satyajith Chilappagari <[email protected]> Co-authored-by: Aaron Dou <[email protected]> Co-authored-by: Shashwat Srijan <[email protected]> Co-authored-by: Chongming Ni <[email protected]> Co-authored-by: Amulya Ballakur <[email protected]> Co-authored-by: Patrick Lange <[email protected]> Co-authored-by: Elaine Zhao <[email protected]> Co-authored-by: Lin Lin Pan <[email protected]> Co-authored-by: Navyadhara Gogineni <[email protected]> Co-authored-by: Yishan McNabb <[email protected]> Co-authored-by: Mrinal Shukla <[email protected]>
…#17758) Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: Yong Hoon Shin <[email protected]>
Signed-off-by: Yong Hoon Shin <[email protected]>
Signed-off-by: reidliu41 <[email protected]> Co-authored-by: reidliu41 <[email protected]>
vllm-project#17139) Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: Christian Heimes <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
…project#17793) Signed-off-by: Isotr0py <[email protected]>
…llm-project#14238) Signed-off-by: Akshat Tripathi <[email protected]> Signed-off-by: Chengji Yao <[email protected]> Co-authored-by: Chengji Yao <[email protected]>
…llm-project#17811) Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: Wallas Santos <[email protected]>
…t#17815) Signed-off-by: Aaron Pham <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
) Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: Chanh Nguyen <[email protected]> Co-authored-by: Chanh Nguyen <[email protected]>
Signed-off-by: Vadim Markovtsev <[email protected]>
… unquantizedMethod to reenable LLama4 BF16 (vllm-project#18205) Signed-off-by: tjtanaa <[email protected]>
Signed-off-by: NickLucche <[email protected]>
Signed-off-by: Lucia Fang <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
…-project#18229) Signed-off-by: Lucas Wilkinson <[email protected]>
…attention on ROCm (vllm-project#18093) Signed-off-by: kf <[email protected]>
Signed-off-by: lisiqi23 <[email protected]> Signed-off-by: skylee-01 <[email protected]> Co-authored-by: lisiqi23 <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
…llm-project#18209) Signed-off-by: Will Eaton <[email protected]>
…ce for V1 (vllm-project#17827) Signed-off-by: Lucia Fang <[email protected]>
Signed-off-by: David Xia <[email protected]>
vllm-project#17973) Signed-off-by: Vadim Gimpelson <[email protected]>
Signed-off-by: Seiji Eicher <[email protected]>
vllm-project#18214) Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Felix Marty <[email protected]>
Signed-off-by: learner0810 <[email protected]>
Signed-off-by: reidliu41 <[email protected]> Co-authored-by: reidliu41 <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
…ject#18211) Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: Bowen Wang <[email protected]> Co-authored-by: mgoin <[email protected]>
Signed-off-by: mgoin <[email protected]>
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
…ect#26445) Signed-off-by: Nick Hill <[email protected]>
Updates #1 . Just cherry pick the one commit of this branch and run
downlaod the MTBench from https://github.com/SafeAILab/EAGLE/tree/main/eagle/data and point it to below cmd