🎓 Direct PhD Student @ Zhejiang University (ZJU)
💡 Researching Computer Architecture, DPU (SmartNIC & RDMA) and AI Infrastructure
🚀 Captain @ ZJUSCT — leading the ZJU Supercomputing Team in world HPC competitions
🛠 Founder @ ckc-agc — building the learning community @ ZJU CHU KOCHEN HONORS COLLEGE
🏢 ex-Intern @ Tencent CSIG HPN Center
✍️ Writing technical notes on Bowling's TechStack
Architecture powers intelligence. Infra builds the future.
- Parallel Computing / Performance: MPI · OpenMP · CUDA · Nsight · perf
- Networking: RDMA · InfiniBand · UCX · libfabric · NCCL
- System Development: C/C++ · Linux Kernel · QEMU · GDB
- Infra / Observability: eBPF · Containers · CI/CD · Databases
- Other Tools: Python · Flutter · OpenGL
- 🥈 (Asia) ASC 24/25 — International Supercomputing Contest, Team Second Prize
- 🥇 (Europe) ISC SCC 24/25 — International Supercomputing Contest, Team 5th Place
- 🧑🏫 Teaching Assistant: Operating System, Computer Systems III, Supercomputing Training
- 🌟 ZJU Peer Learning Star, Scholarship x2, CKC Academic Guidance Center Chair
-
SmartNS — A terabit-scale flexible network stack on DPU
Reduced Host–DPU memory overhead via TX/RX decoupling; paper under SC/NSDI submission. -
ICON Climate Model Optimization — C++17 Parallel Algorithms + GPU Offloading
Accelerated physical computation kernels with CUDA backends. -
m5C RNA Site Tools Optimization — Multithreading redesign, -70% runtime
Rewrote synchronization model with conditional variables.
I love low-level systems, high-performance computing, and exploring how hardware–software co-design can push AI infrastructure further.
If I’m not debugging a kernel module, I’m probably benchmarking an RDMA stack 😎



