Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
167 commits
Select commit Hold shift + click to select a range
5748352
Update expected values for one more `test_speculative_generation` aft…
ydshieh Sep 18, 2025
564fde1
FIX(trainer): ensure final checkpoint is saved when resuming training…
rangehow Sep 18, 2025
c532575
Add new model LFM2-VL (#40624)
zucchini-nlp Sep 18, 2025
f610418
Fix outdated version checks of accelerator (#40969)
cyyever Sep 18, 2025
7cf1f5c
Use `skip_predictor=True` in vjepa2 `get_vision_features` (#40966)
hamishs Sep 18, 2025
9378f87
[Trainer] Fix DP loss (#40799)
SunMarc Sep 18, 2025
6e51ac3
[timm_wrapper] better handling of "Unknown model" exception in timm (…
harshaljanjani Sep 18, 2025
2ce35a2
Fix Issue #39030: AutoTokenizer.from_pretrained does not propagate to…
brandenkmurray Sep 18, 2025
dd7ac4c
[tests] Really use small models in all fast tests (#40945)
Cyrilvallez Sep 18, 2025
738b223
Add captured actual outputs to CI artifacts (#40965)
ydshieh Sep 18, 2025
d9d7f6a
Revert change in `compile_friendly_resize` (#40645)
qubvel Sep 18, 2025
5ac3c51
Track the CI (model) jobs that don't produce test output files (proce…
ydshieh Sep 18, 2025
5c2f566
Remove `set_model_tester_for_less_flaky_tests` (#40982)
Cyrilvallez Sep 18, 2025
47c1a1b
Benchmarking v2 GH workflows (#40716)
ahadnagy Sep 19, 2025
5a24613
ENH: Enable readline support for transformers chat (#40911)
BenjaminBossan Sep 19, 2025
103fe0d
[testing] test `num_hidden_layers` being small in model tester (#40992)
ydshieh Sep 19, 2025
a5ffae6
blt wip (#38579)
itazap Sep 19, 2025
78f3e08
[`RMSNorm`] Fix rms norm init for models that center around 1 (#40796)
vasqu Sep 19, 2025
a89ed71
Make `EfficientLoFTRModelTest` faster (#41000)
ydshieh Sep 19, 2025
662ea95
Fix typoes in src and tests (#40845)
cyyever Sep 19, 2025
f73f73d
Fix more dates in model cards and wrong modalities in _toctree.yml (#…
yonigozlan Sep 19, 2025
6e1270d
RUFF fix on CI scripts (#40805)
cyyever Sep 19, 2025
251825a
fix dict like init for ModelOutput (#41002)
SunMarc Sep 19, 2025
f47c651
[tests] update `test_left_padding_compatibility` (and minimize overwr…
gante Sep 19, 2025
b164209
Patch more `unittest.case.TestCase.assertXXX` methods (#41008)
ydshieh Sep 19, 2025
d6d2d03
🚨 [lightglue] fix: matches order changed because of early stopped ind…
sbucaille Sep 19, 2025
b2b5044
Fix `PhimoeIntegrationTest` (#41007)
ydshieh Sep 19, 2025
e5a9a1d
Fix Glm4v test (#41011)
Cyrilvallez Sep 19, 2025
9de898e
Update after #41007 (#41014)
ydshieh Sep 19, 2025
c1cf8de
Fix benchmark runner argument name (#41012)
ahadnagy Sep 20, 2025
41813d3
Adding support for Qwen3Omni (#41025)
BakerBunker Sep 21, 2025
71f768b
Making compute_loss_func always take priority in Trainer (#40632)
Flakes342 Sep 22, 2025
23d0c62
Modify Qwen3Omni parameter name since VL changed it (#41045)
BakerBunker Sep 22, 2025
f1a8aff
Fix Qwen video tests (#41049)
zucchini-nlp Sep 22, 2025
c6d3d0b
[testing] Fix `qwen2_audio` (#41018)
ydshieh Sep 22, 2025
30dadfd
Fix typing of tuples (#41028)
cyyever Sep 22, 2025
c931992
Remove optax (#41030)
cyyever Sep 22, 2025
8460053
Fix typos in English/Chinese documentation (#41031)
cyyever Sep 22, 2025
e6f5f94
Use torch.autocast (#40975)
cyyever Sep 22, 2025
1ca9181
docs: improved RoPE function Docstrings (#41004)
RyanMullins Sep 22, 2025
7425f6d
Fix condition for emitting warning when generation exceeds max model …
yannicks1 Sep 22, 2025
9b221a8
Fix outdated torch version check (#40925)
cyyever Sep 22, 2025
c2c9074
Add Whole Word Masking and Padding Strategy to DataCollatorForLanguag…
rjgleaton Sep 22, 2025
5fb3b35
[testing] Fix `seed_oss` (#41052)
ydshieh Sep 22, 2025
3691102
Remove repeated import (#40937)
cyyever Sep 22, 2025
d43b73c
Simplify unnecessary Optional typing (#40839)
cyyever Sep 22, 2025
9de77d7
Add write token for uploading benchmark results to the Hub (#41047)
ahadnagy Sep 22, 2025
98e87db
Ci utils (#40978)
remi-or Sep 22, 2025
bdbe987
Fix CI jobs being all red 🔴 (false positive) (#41059)
ydshieh Sep 22, 2025
abbf0ed
Update quantization CI (#41068)
SunMarc Sep 22, 2025
a9266c9
[i18n-bn] Add Bengali language README file (#40935)
saidurpulok Sep 22, 2025
ed8d3aa
Improve documentation and errors in Mamba2-based models (#41063)
mapmeld Sep 22, 2025
fc974a9
Update team member list for some CI workflows (#41094)
ydshieh Sep 23, 2025
dca053d
fix crash when using chat to send 2+ request to gptoss (#40536)
sywangyi Sep 23, 2025
ea92b1a
Minor addition, no split modules for VideoMAEE (#41051)
DuyguA Sep 23, 2025
722be9f
Switch to `python:3.10-slim` for CircleCI docker images (#41067)
ydshieh Sep 23, 2025
e140ee3
Fix argument name in benchmarking script (#41086)
ahadnagy Sep 23, 2025
9957b44
Fix typos in documentation (#41087)
cyyever Sep 23, 2025
281b8b6
Fix typing (#40788)
cyyever Sep 23, 2025
72e7f34
Remove unused arguments (#40916)
cyyever Sep 23, 2025
93655f3
fix wrong height and width when read video use torchvision (#41091)
Juude Sep 23, 2025
c42b27b
docs: Fix Tool Use links and remove dead RAG links (#41104)
RyanMullins Sep 23, 2025
9d9177f
[tests] gpt2 + `CausalLMModelTester` (#41003)
gante Sep 23, 2025
8291a7f
Fix `_get_test_info` for inherited tests (#41106)
ydshieh Sep 23, 2025
7bf0c7d
Remove bad test skips (#41109)
Cyrilvallez Sep 23, 2025
1f7c653
Format empty lines and white space in markdown files. (#41100)
cyyever Sep 23, 2025
a5a8882
Update ruff to 0.13.1 + target Python 3.10 + apply fixes (#37809)
cyyever Sep 24, 2025
38c30bb
Support loading LFM2 GGUF (#41111)
HaroldBenoit Sep 24, 2025
f212a0b
[torchao safetensors] integrate torchao safetensors support with tran…
liangel-02 Sep 24, 2025
957b556
[Qwen3-next] Fix dimension mismatch in torch_chunk_gated_delta_rule a…
notkisk Sep 24, 2025
7fde975
Fix the error where a keyword argument appearing before *args (#41099)
cyyever Sep 24, 2025
c6f31ab
Fix broken `` expressions in markdown files (#41113)
cyyever Sep 24, 2025
48c8c8d
Remove self-assignment (#41062)
cyyever Sep 24, 2025
25c8ac5
Fixed MXFP4 model storage issue (#41118)
YangKai0616 Sep 24, 2025
0bc795f
Fixed loading LongT5 from legacy checkpoints (#40724)
Szustarol Sep 24, 2025
99630b8
dummy commit (#41133)
ydshieh Sep 24, 2025
6e913fc
Fix loading logic flaw with regards to unexpected and missing keys (#…
LysandreJik Sep 24, 2025
477b7a3
Fix: align Qwen2.5-VL inference rope index with training by passing s…
Xqle Sep 25, 2025
287652a
Fix single quotes in markdown (#41154)
cyyever Sep 25, 2025
174a5c4
extend gemma3n integration ut cases on XPU (#41071)
yao-matrix Sep 25, 2025
53ce2f8
Add Parakeet (#39062)
nithinraok Sep 25, 2025
bd77d70
Fix format of compressed_tensors.md (#41155)
cyyever Sep 25, 2025
6566998
Simplify and improve model loading logic (#41103)
Cyrilvallez Sep 25, 2025
83fc0ee
Force new vision models addition to include a fast image processor (#…
yonigozlan Sep 25, 2025
d3d0292
Add language specifiers to code blocks of markdown files (#41114)
cyyever Sep 25, 2025
46a138c
Improve `add_dates` script (#41167)
yonigozlan Sep 25, 2025
2e49826
Fix flash-attn for paged_attention when no kernels (#41078)
remi-or Sep 26, 2025
ac8703d
Remove data from examples (#41168)
LysandreJik Sep 26, 2025
14b4558
Enable fa in amd docker (#41069)
remi-or Sep 26, 2025
d815261
handle flash slow tests (#41072)
itazap Sep 26, 2025
cd154ae
Modernbert fix (#41056)
remi-or Sep 29, 2025
56a74c3
CI Runners - move amd runners mi355 and 325 to runner group (#41193)
glegendre01 Sep 29, 2025
9a76ebf
[XPU] Add MXFP4 support for XPU (#41117)
YangKai0616 Sep 29, 2025
97ee50f
[tests] `CausalLMTester` automatically infers other test classes from…
gante Sep 29, 2025
deac453
More typing fixes (#41102)
cyyever Sep 29, 2025
9b7c343
enable flex attention ut cases on XPU (#40989)
yao-matrix Sep 29, 2025
469336d
fix(trainer): Avoid moving model with device_map (#41032)
The5cheduler Sep 29, 2025
01d8cc0
Fix attention sink implementation in flex attention (#41083)
SamuelBarryCS Sep 29, 2025
fae2d67
Separate docker images for Nvidia and AMD in benchmarking (#41119)
ahadnagy Sep 29, 2025
3017f04
Make quantizers good citizens loading-wise (#41138)
Cyrilvallez Sep 29, 2025
389115c
[`Kernels Attention`] Change fallback logic to error out on explicit …
vasqu Sep 29, 2025
ec368a2
Add EdgeTAM (#39800)
yonigozlan Sep 29, 2025
068e709
Fix EXAONE-4.0 dummy id (#41089)
lkm2835 Sep 29, 2025
a2b6ccf
Fix 8bit bnb loading (#41200)
SunMarc Sep 29, 2025
a2cdccc
Fix docker quantization (#41201)
SunMarc Sep 29, 2025
be826ba
Embed interactive timeline in docs (#41015)
yonigozlan Sep 30, 2025
090ad5d
[docs] Fix links (#41110)
stevhliu Sep 30, 2025
e097370
Remove unnecessary Optional typing (#41198)
cyyever Sep 30, 2025
5c0fd10
docs/examples(speech): pin CTC commands to Hub datasets; add Windows …
tayo4christ Sep 30, 2025
f588aa8
Fix Qwen3-Omni audio_token_id serialization issue (#41192)
eun2ce Sep 30, 2025
4c54a98
Wait for main process in _save_checkpoint to ensure best checkpoint e…
ssharpe42 Sep 30, 2025
7e698ed
Avoid assumption that model has config attribute in deepspeed (#41207)
tomaarsen Sep 30, 2025
4886248
Trainer: Pass `num_items_in_batch` to `compute_loss` in `prediction_s…
pramodith Sep 30, 2025
d1fd30d
[ESM] add accepts_loss_kwargs=False to EsmPreTrainedModel (#41006)
pstjohn Sep 30, 2025
9f7da26
Align pull request template to bug report template (#41220)
tomaarsen Sep 30, 2025
8a23f34
[generate] cache missing custom generate file (#41216)
gante Sep 30, 2025
99fbb87
Remove old Python code (#41226)
cyyever Sep 30, 2025
4c6f26e
Adapt to the SDPA interface to enable the NPU to call FlashAttentionS…
frozenleaves Sep 30, 2025
3b03f55
update code owners (#41221)
ydshieh Sep 30, 2025
12c4e6a
Unify is_torchvision_v2_available with is_torchvision_available (#41227)
cyyever Sep 30, 2025
b0427d6
Fix typing of train_args (#41142)
cyyever Sep 30, 2025
50907d3
Fix sliding window attn mask (#41228)
remi-or Sep 30, 2025
7db6284
Revert "Fix DeepSpeed mixed precision precedence over Accelerate defa…
SunMarc Sep 30, 2025
86982f2
[docs] Fix tp_plan (#41205)
stevhliu Sep 30, 2025
ac69a8f
Fix white space in documentation (#41157)
cyyever Sep 30, 2025
186a357
fix qwen text config (#41158)
zucchini-nlp Sep 30, 2025
46954e4
Video processor accepts single frames on cuda (#41218)
zucchini-nlp Oct 1, 2025
25e2641
Use math.log2 (#41241)
cyyever Oct 1, 2025
7076da6
fix TrainerIntegrationDeepSpeed UT failures (#41236)
yao-matrix Oct 1, 2025
9e60961
[repo utils] Update `models_to_deprecate.py` (#41231)
gante Oct 1, 2025
7aca328
Use removeprefix and removesuffix (#41240)
cyyever Oct 1, 2025
c6af1ca
Fix pylint warnings (#41222)
cyyever Oct 1, 2025
b7757de
Remove all instances of `is_safetensors_available` (#41233)
SunMarc Oct 1, 2025
c7616fd
FP-Quant NVFP4 and Python 3.9 support (#39876)
BlackSamorez Oct 1, 2025
f672ee0
[`FA3`] Fix masking and loading logic in same process (#41217)
vasqu Oct 1, 2025
066ca8e
[t5gemma] fix `get_text_config` and related fixes (#40939)
gante Oct 1, 2025
e49d3d6
Don't convert to `safetensors` on the fly if the call is from testing…
ydshieh Oct 1, 2025
9ab2d57
Resolve remote custom module path warnings (#41243)
XuehaiPan Oct 1, 2025
a6f470f
add peft team members to issue/pr template (#41262)
ydshieh Oct 1, 2025
1982692
docs: update bitsandbytes platform support (#41266)
matthewdouglas Oct 1, 2025
9e34b40
add more activation kernels, follow up (#40944)
MekkCyber Oct 2, 2025
e80da3a
fix asr pipeline ut failures (#41275)
yao-matrix Oct 2, 2025
d8566bc
Use regex defailed flags (#41264)
cyyever Oct 2, 2025
d88a0fb
Fix multi-video timestamp bug in Qwen-3-VL and GLM4V (#41229)
tim120526 Oct 2, 2025
54c026e
Fix binding of video frames to video placeholder in `InternVL` model …
daskol Oct 2, 2025
03d976d
Deprecate Trackio environment variables and deploy to Spaces by defau…
qgallouedec Oct 2, 2025
37f1f5d
Allow private Space id for Trackio (#40948)
qgallouedec Oct 2, 2025
247d21a
fix async client for transformers chat (#41255)
SunMarc Oct 2, 2025
26c57ef
Unify is_torchvision_v2_available with is_torchvision_available (#41259)
cyyever Oct 2, 2025
91e1bdd
Use max/min (#41280)
cyyever Oct 2, 2025
4f1faa0
Biogptlogits (#41270)
0x-avi Oct 2, 2025
9d67585
Fix unnecessary single-item container checks (#41279)
cyyever Oct 2, 2025
89d5349
Fix pylint generator warnings (#41258)
cyyever Oct 2, 2025
f8ec172
feat: use `aws-highcpu-32-priv` for amd docker img build (#41285)
McPatate Oct 2, 2025
a2de293
Add processor and intergration test for qwen3vl (#41277)
JJJYmmm Oct 2, 2025
27b9c79
Remove `test_initialization` (#41261)
Cyrilvallez Oct 2, 2025
0995a48
Remove some previous team members from allow list of triggering Githu…
ydshieh Oct 2, 2025
41eae7a
Build doc in 2 jobs: `en` and `other languages` (#41290)
ydshieh Oct 2, 2025
aca2380
Fix mxfp4 dequantization (#41292)
Cyrilvallez Oct 2, 2025
531bb75
[`Flex Attn`] Fix lse x attention sinks logic (#41249)
vasqu Oct 2, 2025
cf88fbb
FIX: Bug in PEFT integration delete_adapter method (#41252)
BenjaminBossan Oct 2, 2025
40329a8
Italian translation for README.md (#41269)
fedtti Oct 2, 2025
e656e26
Fix README.md error when installing from source (#41303)
TKONIY Oct 2, 2025
a6e9ec4
download and use HF Hub Cache (#41181)
ydshieh Oct 3, 2025
010896e
fix some merge issues
ArthurZucker Oct 3, 2025
8270a0f
[test_all]
ArthurZucker Oct 3, 2025
e6d8087
[test-all]
ArthurZucker Oct 3, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
31 changes: 19 additions & 12 deletions .circleci/create_circleci_config.py
Original file line number Diff line number Diff line change
Expand Up @@ -16,10 +16,9 @@
import argparse
import copy
import os
import random
from dataclasses import dataclass
from typing import Any, Dict, List, Optional
import glob
from typing import Any, Optional

import yaml


Expand All @@ -30,6 +29,7 @@
"RUN_PIPELINE_TESTS": False,
# will be adjust in `CircleCIJob.to_dict`.
"RUN_FLAKY": True,
"DISABLE_SAFETENSORS_CONVERSION": True,
}
# Disable the use of {"s": None} as the output is way too long, causing the navigation on CircleCI impractical
COMMON_PYTEST_OPTIONS = {"max-worker-restart": 0, "vvv": None, "rsfE":None}
Expand Down Expand Up @@ -82,15 +82,15 @@ def to_dict(self):
@dataclass
class CircleCIJob:
name: str
additional_env: Dict[str, Any] = None
docker_image: List[Dict[str, str]] = None
install_steps: List[str] = None
additional_env: dict[str, Any] = None
docker_image: list[dict[str, str]] = None
install_steps: list[str] = None
marker: Optional[str] = None
parallelism: Optional[int] = 0
pytest_num_workers: int = 8
pytest_options: Dict[str, Any] = None
pytest_options: dict[str, Any] = None
resource_class: Optional[str] = "xlarge"
tests_to_run: Optional[List[str]] = None
tests_to_run: Optional[list[str]] = None
num_test_files_per_worker: Optional[int] = 10
# This should be only used for doctest job!
command_timeout: Optional[int] = None
Expand Down Expand Up @@ -130,6 +130,12 @@ def __post_init__(self):

def to_dict(self):
env = COMMON_ENV_VARIABLES.copy()
if self.job_name != "tests_hub":
# fmt: off
# not critical
env.update({"HF_TOKEN": "".join(["h", "f", "_", "H", "o", "d", "V", "u", "M", "q", "b", "R", "m", "t", "b", "z", "F", "Q", "O", "Q", "A", "J", "G", "D", "l", "V", "Q", "r", "R", "N", "w", "D", "M", "V", "C", "s", "d"])})
# fmt: on

# Do not run tests decorated by @is_flaky on pull requests
env['RUN_FLAKY'] = os.environ.get("CIRCLE_PULL_REQUEST", "") == ""
env.update(self.additional_env)
Expand All @@ -149,7 +155,7 @@ def to_dict(self):
# Examples special case: we need to download NLTK files in advance to avoid cuncurrency issues
timeout_cmd = f"timeout {self.command_timeout} " if self.command_timeout else ""
marker_cmd = f"-m '{self.marker}'" if self.marker is not None else ""
junit_flags = f" -p no:warning -o junit_family=xunit1 --junitxml=test-results/junit.xml"
junit_flags = " -p no:warning -o junit_family=xunit1 --junitxml=test-results/junit.xml"
joined_flaky_patterns = "|".join(FLAKY_TEST_FAILURE_PATTERNS)
repeat_on_failure_flags = f"--reruns 5 --reruns-delay 2 --only-rerun '({joined_flaky_patterns})'"
parallel = f' << pipeline.parameters.{self.job_name}_parallelism >> '
Expand Down Expand Up @@ -180,6 +186,7 @@ def to_dict(self):
# During the CircleCI docker images build time, we might already (or not) download the data.
# If it's done already, the files are inside the directory `/test_data/`.
{"run": {"name": "fetch hub objects before pytest", "command": "cp -r /test_data/* . 2>/dev/null || true; python3 utils/fetch_hub_objects_for_ci.py"}},
{"run": {"name": "download and unzip hub cache", "command": 'curl -L -o huggingface-cache.tar.gz https://huggingface.co/datasets/hf-internal-testing/hf_hub_cache/resolve/main/huggingface-cache.tar.gz && apt-get install pigz && tar --use-compress-program="pigz -d -p 8" -xf huggingface-cache.tar.gz && mv -n hub/* /root/.cache/huggingface/hub/ && ls -la /root/.cache/huggingface/hub/'}},
{"run": {
"name": "Run tests",
"command": f"({timeout_cmd} python3 -m pytest {marker_cmd} -n {self.pytest_num_workers} {junit_flags} {repeat_on_failure_flags} {' '.join(pytest_flags)} $(cat splitted_tests.txt) | tee tests_output.txt)"}
Expand All @@ -200,9 +207,9 @@ def to_dict(self):
fi"""
},
},
{"run": {"name": "Expand to show skipped tests", "when": "always", "command": f"python3 .circleci/parse_test_outputs.py --file tests_output.txt --skip"}},
{"run": {"name": "Failed tests: show reasons", "when": "always", "command": f"python3 .circleci/parse_test_outputs.py --file tests_output.txt --fail"}},
{"run": {"name": "Errors", "when": "always", "command": f"python3 .circleci/parse_test_outputs.py --file tests_output.txt --errors"}},
{"run": {"name": "Expand to show skipped tests", "when": "always", "command": "python3 .circleci/parse_test_outputs.py --file tests_output.txt --skip"}},
{"run": {"name": "Failed tests: show reasons", "when": "always", "command": "python3 .circleci/parse_test_outputs.py --file tests_output.txt --fail"}},
{"run": {"name": "Errors", "when": "always", "command": "python3 .circleci/parse_test_outputs.py --file tests_output.txt --errors"}},
{"store_test_results": {"path": "test-results"}},
{"store_artifacts": {"path": "test-results/junit.xml"}},
{"store_artifacts": {"path": "reports"}},
Expand Down
3 changes: 2 additions & 1 deletion .circleci/parse_test_outputs.py
Original file line number Diff line number Diff line change
@@ -1,5 +1,6 @@
import re
import argparse
import re


def parse_pytest_output(file_path):
skipped_tests = {}
Expand Down
1 change: 1 addition & 0 deletions .github/ISSUE_TEMPLATE/bug-report.yml
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,7 @@ body:
- Big Model Inference: @SunMarc
- quantization (bitsandbytes, autogpt): @SunMarc @MekkCyber
- kernels: @MekkCyber @drbh
- peft: @BenjaminBossan @githubnemo

Devices/Backends:

Expand Down
38 changes: 19 additions & 19 deletions .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,41 +39,41 @@ members/contributors who may be interested in your PR.

Models:

- text models: @ArthurZucker
- vision models: @amyeroberts, @qubvel
- speech models: @eustlb
- text models: @ArthurZucker @Cyrilvallez
- vision models: @yonigozlan @molbap
- audio models: @eustlb @ebezzam @vasqu
- multimodal models: @zucchini-nlp
- graph models: @clefourrier

Library:

- flax: @gante and @Rocketknight1
- generate: @zucchini-nlp (visual-language models) or @gante (all others)
- continuous batching: @remi-or @ArthurZucker @McPatate
- pipelines: @Rocketknight1
- tensorflow: @gante and @Rocketknight1
- tokenizers: @ArthurZucker
- trainer: @zach-huggingface, @SunMarc and @qgallouedec
- chat templates: @Rocketknight1
- tokenizers: @ArthurZucker and @itazap
- trainer: @zach-huggingface @SunMarc
- attention: @vasqu @ArthurZucker @CyrilVallez
- model loading (from pretrained, etc): @CyrilVallez
- distributed: @3outeille @ArthurZucker @S1ro1
- CIs: @ydshieh

Integrations:

- deepspeed: HF Trainer/Accelerate: @SunMarc @zach-huggingface
- ray/raytune: @richardliaw, @amogkam
- Big Model Inference: @SunMarc
- quantization (bitsandbytes, autogpt): @SunMarc @MekkCyber
- kernels: @MekkCyber @drbh
- peft: @BenjaminBossan @githubnemo

Documentation: @stevhliu

HF projects:
Devices/Backends:

- accelerate: [different repo](https://github.com/huggingface/accelerate)
- datasets: [different repo](https://github.com/huggingface/datasets)
- diffusers: [different repo](https://github.com/huggingface/diffusers)
- rust tokenizers: [different repo](https://github.com/huggingface/tokenizers)
- AMD ROCm: @ivarflakstad
- Intel XPU: @IlyasMoutawwakil
- Ascend NPU: @ivarflakstad

Maintained examples (not research project or legacy):
Documentation: @stevhliu

- Flax: @Rocketknight1
- PyTorch: See Models above and tag the person corresponding to the modality of the example.
- TensorFlow: @Rocketknight1
Research projects are not maintained and should be taken as is.

-->
8 changes: 5 additions & 3 deletions .github/scripts/assign_reviewers.py
Original file line number Diff line number Diff line change
Expand Up @@ -13,14 +13,16 @@
# See the License for the specific language governing permissions and
# limitations under the License.

import os
import github
import json
from github import Github
import os
import re
from collections import Counter
from pathlib import Path

import github
from github import Github


def pattern_to_regex(pattern):
if pattern.startswith("/"):
start_anchor = True
Expand Down
130 changes: 65 additions & 65 deletions .github/scripts/codeowners_for_review_action
Original file line number Diff line number Diff line change
Expand Up @@ -7,8 +7,8 @@ docs/ @stevhliu
/docker/ @ydshieh @ArthurZucker

# More high-level globs catch cases when specific rules later don't apply
/src/transformers/models/*/processing* @molbap @yonigozlan @qubvel
/src/transformers/models/*/image_processing* @qubvel
/src/transformers/models/*/processing* @molbap @yonigozlan
/src/transformers/models/*/image_processing* @yonigozlan
/src/transformers/models/*/image_processing_*_fast* @yonigozlan

# Owners of subsections of the library
Expand Down Expand Up @@ -186,65 +186,65 @@ trainer_utils.py @zach-huggingface @SunMarc
/src/transformers/models/zamba/mod*_zamba* @ArthurZucker

# Vision models
/src/transformers/models/beit/mod*_beit* @amyeroberts @qubvel
/src/transformers/models/bit/mod*_bit* @amyeroberts @qubvel
/src/transformers/models/conditional_detr/mod*_conditional_detr* @amyeroberts @qubvel
/src/transformers/models/convnext/mod*_convnext* @amyeroberts @qubvel
/src/transformers/models/convnextv2/mod*_convnextv2* @amyeroberts @qubvel
/src/transformers/models/cvt/mod*_cvt* @amyeroberts @qubvel
/src/transformers/models/deformable_detr/mod*_deformable_detr* @amyeroberts @qubvel
/src/transformers/models/deit/mod*_deit* @amyeroberts @qubvel
/src/transformers/models/depth_anything/mod*_depth_anything* @amyeroberts @qubvel
/src/transformers/models/depth_anything_v2/mod*_depth_anything_v2* @amyeroberts @qubvel
/src/transformers/models/deta/mod*_deta* @amyeroberts @qubvel
/src/transformers/models/detr/mod*_detr* @amyeroberts @qubvel
/src/transformers/models/dinat/mod*_dinat* @amyeroberts @qubvel
/src/transformers/models/dinov2/mod*_dinov2* @amyeroberts @qubvel
/src/transformers/models/dinov2_with_registers/mod*_dinov2_with_registers* @amyeroberts @qubvel
/src/transformers/models/dit/mod*_dit* @amyeroberts @qubvel
/src/transformers/models/dpt/mod*_dpt* @amyeroberts @qubvel
/src/transformers/models/efficientformer/mod*_efficientformer* @amyeroberts @qubvel
/src/transformers/models/efficientnet/mod*_efficientnet* @amyeroberts @qubvel
/src/transformers/models/focalnet/mod*_focalnet* @amyeroberts @qubvel
/src/transformers/models/glpn/mod*_glpn* @amyeroberts @qubvel
/src/transformers/models/hiera/mod*_hiera* @amyeroberts @qubvel
/src/transformers/models/ijepa/mod*_ijepa* @amyeroberts @qubvel
/src/transformers/models/imagegpt/mod*_imagegpt* @amyeroberts @qubvel
/src/transformers/models/levit/mod*_levit* @amyeroberts @qubvel
/src/transformers/models/mask2former/mod*_mask2former* @amyeroberts @qubvel
/src/transformers/models/maskformer/mod*_maskformer* @amyeroberts @qubvel
/src/transformers/models/mobilenet_v1/mod*_mobilenet_v1* @amyeroberts @qubvel
/src/transformers/models/mobilenet_v2/mod*_mobilenet_v2* @amyeroberts @qubvel
/src/transformers/models/mobilevit/mod*_mobilevit* @amyeroberts @qubvel
/src/transformers/models/mobilevitv2/mod*_mobilevitv2* @amyeroberts @qubvel
/src/transformers/models/nat/mod*_nat* @amyeroberts @qubvel
/src/transformers/models/poolformer/mod*_poolformer* @amyeroberts @qubvel
/src/transformers/models/pvt/mod*_pvt* @amyeroberts @qubvel
/src/transformers/models/pvt_v2/mod*_pvt_v2* @amyeroberts @qubvel
/src/transformers/models/regnet/mod*_regnet* @amyeroberts @qubvel
/src/transformers/models/resnet/mod*_resnet* @amyeroberts @qubvel
/src/transformers/models/rt_detr/mod*_rt_detr* @amyeroberts @qubvel
/src/transformers/models/segformer/mod*_segformer* @amyeroberts @qubvel
/src/transformers/models/seggpt/mod*_seggpt* @amyeroberts @qubvel
/src/transformers/models/superpoint/mod*_superpoint* @amyeroberts @qubvel
/src/transformers/models/swiftformer/mod*_swiftformer* @amyeroberts @qubvel
/src/transformers/models/swin/mod*_swin* @amyeroberts @qubvel
/src/transformers/models/swinv2/mod*_swinv2* @amyeroberts @qubvel
/src/transformers/models/swin2sr/mod*_swin2sr* @amyeroberts @qubvel
/src/transformers/models/table_transformer/mod*_table_transformer* @amyeroberts @qubvel
/src/transformers/models/textnet/mod*_textnet* @amyeroberts @qubvel
/src/transformers/models/timm_wrapper/mod*_timm_wrapper* @amyeroberts @qubvel
/src/transformers/models/upernet/mod*_upernet* @amyeroberts @qubvel
/src/transformers/models/van/mod*_van* @amyeroberts @qubvel
/src/transformers/models/vit/mod*_vit* @amyeroberts @qubvel
/src/transformers/models/vit_hybrid/mod*_vit_hybrid* @amyeroberts @qubvel
/src/transformers/models/vitdet/mod*_vitdet* @amyeroberts @qubvel
/src/transformers/models/vit_mae/mod*_vit_mae* @amyeroberts @qubvel
/src/transformers/models/vitmatte/mod*_vitmatte* @amyeroberts @qubvel
/src/transformers/models/vit_msn/mod*_vit_msn* @amyeroberts @qubvel
/src/transformers/models/vitpose/mod*_vitpose* @amyeroberts @qubvel
/src/transformers/models/yolos/mod*_yolos* @amyeroberts @qubvel
/src/transformers/models/zoedepth/mod*_zoedepth* @amyeroberts @qubvel
/src/transformers/models/beit/mod*_beit* @yonigozlan @molbap
/src/transformers/models/bit/mod*_bit* @yonigozlan @molbap
/src/transformers/models/conditional_detr/mod*_conditional_detr* @yonigozlan @molbap
/src/transformers/models/convnext/mod*_convnext* @yonigozlan @molbap
/src/transformers/models/convnextv2/mod*_convnextv2* @yonigozlan @molbap
/src/transformers/models/cvt/mod*_cvt* @yonigozlan @molbap
/src/transformers/models/deformable_detr/mod*_deformable_detr* @yonigozlan @molbap
/src/transformers/models/deit/mod*_deit* @yonigozlan @molbap
/src/transformers/models/depth_anything/mod*_depth_anything* @yonigozlan @molbap
/src/transformers/models/depth_anything_v2/mod*_depth_anything_v2* @yonigozlan @molbap
/src/transformers/models/deta/mod*_deta* @yonigozlan @molbap
/src/transformers/models/detr/mod*_detr* @yonigozlan @molbap
/src/transformers/models/dinat/mod*_dinat* @yonigozlan @molbap
/src/transformers/models/dinov2/mod*_dinov2* @yonigozlan @molbap
/src/transformers/models/dinov2_with_registers/mod*_dinov2_with_registers* @yonigozlan @molbap
/src/transformers/models/dit/mod*_dit* @yonigozlan @molbap
/src/transformers/models/dpt/mod*_dpt* @yonigozlan @molbap
/src/transformers/models/efficientformer/mod*_efficientformer* @yonigozlan @molbap
/src/transformers/models/efficientnet/mod*_efficientnet* @yonigozlan @molbap
/src/transformers/models/focalnet/mod*_focalnet* @yonigozlan @molbap
/src/transformers/models/glpn/mod*_glpn* @yonigozlan @molbap
/src/transformers/models/hiera/mod*_hiera* @yonigozlan @molbap
/src/transformers/models/ijepa/mod*_ijepa* @yonigozlan @molbap
/src/transformers/models/imagegpt/mod*_imagegpt* @yonigozlan @molbap
/src/transformers/models/levit/mod*_levit* @yonigozlan @molbap
/src/transformers/models/mask2former/mod*_mask2former* @yonigozlan @molbap
/src/transformers/models/maskformer/mod*_maskformer* @yonigozlan @molbap
/src/transformers/models/mobilenet_v1/mod*_mobilenet_v1* @yonigozlan @molbap
/src/transformers/models/mobilenet_v2/mod*_mobilenet_v2* @yonigozlan @molbap
/src/transformers/models/mobilevit/mod*_mobilevit* @yonigozlan @molbap
/src/transformers/models/mobilevitv2/mod*_mobilevitv2* @yonigozlan @molbap
/src/transformers/models/nat/mod*_nat* @yonigozlan @molbap
/src/transformers/models/poolformer/mod*_poolformer* @yonigozlan @molbap
/src/transformers/models/pvt/mod*_pvt* @yonigozlan @molbap
/src/transformers/models/pvt_v2/mod*_pvt_v2* @yonigozlan @molbap
/src/transformers/models/regnet/mod*_regnet* @yonigozlan @molbap
/src/transformers/models/resnet/mod*_resnet* @yonigozlan @molbap
/src/transformers/models/rt_detr/mod*_rt_detr* @yonigozlan @molbap
/src/transformers/models/segformer/mod*_segformer* @yonigozlan @molbap
/src/transformers/models/seggpt/mod*_seggpt* @yonigozlan @molbap
/src/transformers/models/superpoint/mod*_superpoint* @yonigozlan @molbap
/src/transformers/models/swiftformer/mod*_swiftformer* @yonigozlan @molbap
/src/transformers/models/swin/mod*_swin* @yonigozlan @molbap
/src/transformers/models/swinv2/mod*_swinv2* @yonigozlan @molbap
/src/transformers/models/swin2sr/mod*_swin2sr* @yonigozlan @molbap
/src/transformers/models/table_transformer/mod*_table_transformer* @yonigozlan @molbap
/src/transformers/models/textnet/mod*_textnet* @yonigozlan @molbap
/src/transformers/models/timm_wrapper/mod*_timm_wrapper* @yonigozlan @molbap
/src/transformers/models/upernet/mod*_upernet* @yonigozlan @molbap
/src/transformers/models/van/mod*_van* @yonigozlan @molbap
/src/transformers/models/vit/mod*_vit* @yonigozlan @molbap
/src/transformers/models/vit_hybrid/mod*_vit_hybrid* @yonigozlan @molbap
/src/transformers/models/vitdet/mod*_vitdet* @yonigozlan @molbap
/src/transformers/models/vit_mae/mod*_vit_mae* @yonigozlan @molbap
/src/transformers/models/vitmatte/mod*_vitmatte* @yonigozlan @molbap
/src/transformers/models/vit_msn/mod*_vit_msn* @yonigozlan @molbap
/src/transformers/models/vitpose/mod*_vitpose* @yonigozlan @molbap
/src/transformers/models/yolos/mod*_yolos* @yonigozlan @molbap
/src/transformers/models/zoedepth/mod*_zoedepth* @yonigozlan @molbap

# Audio models
/src/transformers/models/audio_spectrogram_transformer/mod*_audio_spectrogram_transformer* @eustlb
Expand Down Expand Up @@ -304,7 +304,7 @@ trainer_utils.py @zach-huggingface @SunMarc
/src/transformers/models/donut/mod*_donut* @zucchini-nlp
/src/transformers/models/flava/mod*_flava* @zucchini-nlp
/src/transformers/models/git/mod*_git* @zucchini-nlp
/src/transformers/models/grounding_dino/mod*_grounding_dino* @qubvel
/src/transformers/models/grounding_dino/mod*_grounding_dino* @yonigozlan
/src/transformers/models/groupvit/mod*_groupvit* @zucchini-nlp
/src/transformers/models/idefics/mod*_idefics* @zucchini-nlp
/src/transformers/models/idefics2/mod*_idefics2* @zucchini-nlp
Expand All @@ -326,10 +326,10 @@ trainer_utils.py @zach-huggingface @SunMarc
/src/transformers/models/mgp_str/mod*_mgp_str* @zucchini-nlp
/src/transformers/models/mllama/mod*_mllama* @zucchini-nlp
/src/transformers/models/nougat/mod*_nougat* @NielsRogge
/src/transformers/models/omdet_turbo/mod*_omdet_turbo* @qubvel @yonigozlan
/src/transformers/models/omdet_turbo/mod*_omdet_turbo* @yonigozlan
/src/transformers/models/oneformer/mod*_oneformer* @zucchini-nlp
/src/transformers/models/owlvit/mod*_owlvit* @qubvel
/src/transformers/models/owlv2/mod*_owlv2* @qubvel
/src/transformers/models/owlvit/mod*_owlvit* @yonigozlan
/src/transformers/models/owlv2/mod*_owlv2* @yonigozlan
/src/transformers/models/paligemma/mod*_paligemma* @zucchini-nlp @molbap
/src/transformers/models/perceiver/mod*_perceiver* @zucchini-nlp
/src/transformers/models/pix2struct/mod*_pix2struct* @zucchini-nlp
Expand Down
Loading
Loading