Skip to content
View Hongbosherlock's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Hongbosherlock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Hongbosherlock/README.md

Hi, I'm Hongbosherlock.

  • 📖 I graduated from the University of Chinese Academy of Sciences
  • 🔭 I’m currently focusing on inference and compression of LLM (quantization, pruning).
  • 🌱 I’m currently learning CUDA and C++.
  • 👯 I’m looking to collaborate on LLM infra.
  • 💬 Ask me about LLM quantization and inference.
  • 📫 How to reach me: [email protected]
  • ⚡ Fun fact: I am an amateur photographer📷. My work can be found at: https://photo.leoneo.top

Hongbosherlock

Hongbosherlock's github stats

Pinned Loading

  1. sgl-project/sglang sgl-project/sglang Public

    SGLang is a fast serving framework for large language models and vision language models.

    Python 20.9k 3.6k

  2. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 64.7k 11.8k

  3. Infrasys-AI/AISystem Infrasys-AI/AISystem Public

    AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

    Jupyter Notebook 15.8k 2.3k

  4. QuantLLM QuantLLM Public

    Quantization Kernel Library for LLM Inference

    C++ 2