Skip to content
View bowling233's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@ZJUSCT @ckc-agc

Block or report bowling233

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bowling233/README.md

👋 Hi, I'm Baolin Zhu

🎓 Direct PhD Student @ Zhejiang University (ZJU)
💡 Researching Computer Architecture, DPU (SmartNIC & RDMA) and AI Infrastructure
🚀 Captain @ ZJUSCT — leading the ZJU Supercomputing Team in world HPC competitions
🛠 Founder @ ckc-agc — building the learning community @ ZJU CHU KOCHEN HONORS COLLEGE
🏢 ex-Intern @ Tencent CSIG HPN Center
✍️ Writing technical notes on Bowling's TechStack

Architecture powers intelligence. Infra builds the future.


🧩 Tech Stack

  • Parallel Computing / Performance: MPI · OpenMP · CUDA · Nsight · perf
  • Networking: RDMA · InfiniBand · UCX · libfabric · NCCL
  • System Development: C/C++ · Linux Kernel · QEMU · GDB
  • Infra / Observability: eBPF · Containers · CI/CD · Databases
  • Other Tools: Python · Flutter · OpenGL

🏆 Achievements

  • 🥈 (Asia) ASC 24/25 — International Supercomputing Contest, Team Second Prize
  • 🥇 (Europe) ISC SCC 24/25 — International Supercomputing Contest, Team 5th Place
  • 🧑‍🏫 Teaching Assistant: Operating System, Computer Systems III, Supercomputing Training
  • 🌟 ZJU Peer Learning Star, Scholarship x2, CKC Academic Guidance Center Chair

🔬 Research & Projects

  • SmartNS — A terabit-scale flexible network stack on DPU
    Reduced Host–DPU memory overhead via TX/RX decoupling; paper under SC/NSDI submission.

  • ICON Climate Model Optimization — C++17 Parallel Algorithms + GPU Offloading
    Accelerated physical computation kernels with CUDA backends.

  • m5C RNA Site Tools Optimization — Multithreading redesign, -70% runtime
    Rewrote synchronization model with conditional variables.


💬 About Me

I love low-level systems, high-performance computing, and exploring how hardware–software co-design can push AI infrastructure further.
If I’m not debugging a kernel module, I’m probably benchmarking an RDMA stack 😎

bowling233' github stats

Pinned Loading

  1. torvalds/linux torvalds/linux Public

    Linux kernel source tree

    C 208k 58.5k

  2. spack/spack spack/spack Public

    A flexible package manager that supports multiple versions, configurations, platforms, and compilers.

    Python 4.9k 2.4k

  3. open-telemetry/opentelemetry-collector-contrib open-telemetry/opentelemetry-collector-contrib Public

    Contrib repository for the OpenTelemetry Collector

    Go 4.2k 3.2k

  4. rafaelkallis/adaptive-radix-tree rafaelkallis/adaptive-radix-tree Public

    An adaptive radix tree for efficient indexing in main memory.

    C++ 166 33

  5. ZJUSCT/HPC101 ZJUSCT/HPC101 Public

    Course Lab documents for HPC 101 (a.k.a. 课程综合实践 Ⅰ / CS1030M)of Zhejiang University

    TeX 75 19

  6. ckc-agc/study-assist ckc-agc/study-assist Public

    浙江大学竺可桢学院辅学计划仓库

    65 11