Skip to content

Pull requests: ServiceNow/Fast-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Distributed unit tests, newer transformers and trainer fixes
#387 opened Nov 13, 2025 by bigximik Loading…
1 of 8 tasks
Vision model
#384 opened Nov 7, 2025 by jlamypoirier Draft
Vsion dataset and preprocessing
#383 opened Oct 30, 2025 by jlamypoirier Loading…
New memmap dataset format
#381 opened Oct 18, 2025 by jlamypoirier Loading…
Cleanup modeling file for Apriel-H
#379 opened Oct 16, 2025 by nitsanluke Loading…
Language model sample
#378 opened Oct 16, 2025 by jlamypoirier Loading…
Dataset interface
#377 opened Oct 15, 2025 by jlamypoirier Loading…
Add stochastic mixer for supernet training
#373 opened Oct 12, 2025 by tscholak Loading…
Nemotron-H mamba2
#355 opened Aug 21, 2025 by oleksost Loading…
2 of 26 tasks
[Dev] Hybrid dev branch
#347 opened Aug 7, 2025 by RaymondLi0 Loading…
fix loss masking
#345 opened Aug 6, 2025 by RaymondLi0 Draft
1 of 26 tasks
[WIP] Multimodal SSM + TP
#338 opened Jul 29, 2025 by RaymondLi0 Draft
25 tasks
WIP: Hybrid Multimodal
#332 opened Jul 21, 2025 by RaymondLi0 Draft
26 tasks
[work in progress] support for kv_cache
#322 opened Jul 4, 2025 by bigximik Draft
25 tasks
Masked Diffusion Training with Shift
#294 opened Jun 10, 2025 by nitsanluke Draft
1 of 25 tasks
[Prototype] Multimodal Audio
#272 opened May 15, 2025 by tobyzl2 Draft
25 tasks
[Prototype] Multimodal (vision) support
#227 opened Apr 8, 2025 by sohamparikh Loading…
8 tasks
[inactive] Track entropy and MI of routing distribution for topk MoE enhancement New feature or request
#188 opened Mar 14, 2025 by oleksost Draft
9 of 22 tasks
ProTip! Follow long discussions with comments:>50.