mikelasby-cohere

Follow

Mike Lasby mikelasby-cohere

Follow

Research Intern @ Cohere PhD Candidate @ UofC

0 followers · 6 following

Popular repositories Loading

vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
RULER RULER Public

Forked from NVIDIA/RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python
flash-attention flash-attention Public

Forked from vllm-project/flash-attention

Fast and memory-efficient exact attention

Python
x-attention x-attention Public

Forked from mit-han-lab/x-attention

XAttention: Block Sparse Attention with Antidiagonal Scoring

Python
Block-Sparse-Attention Block-Sparse-Attention Public

Forked from mit-han-lab/Block-Sparse-Attention

A sparse attention kernel supporting mix sparse patterns

C++