Update dependency huggingface-hub to v0.36.0 #62

red-hat-konflux · 2025-11-06T04:46:39Z

This PR contains the following updates:

Package	Change	Age	Confidence
huggingface-hub	`==0.15.1` -> `==0.36.0`

Warning

Some dependencies could not be looked up. Check the warning logs for more information.

Release Notes

huggingface/huggingface_hub (huggingface-hub)

`v0.36.0`: [v0.36.0] Last Stop Before 1.0

Compare Source

This is the final minor release before v1.0.0. This release focuses on performance optimizations to HfFileSystem and adds a new get_organization_overview API endpoint.

We'll continue to release security patches as needed, but v0.37 will not happen. The next release will be 1.0.0. We’re also deeply grateful to the entire Hugging Face community for their feedback, bug reports, and suggestions that have shaped this library.

Full Changelog: huggingface/huggingface_hub@v0.35.0...v0.36.0

📁 `HfFileSystem`

Major optimizations have been implemented in HfFileSystem:

Cache is kept when pickling a fs instance. This is particularily useful when streaming datasets in a distributed training environment. Each worker won't have to rebuild their cache anymore

[HfFileSystem] Keep cache on pickle by @lhoestq in #3443

Listing files with .glob() has been greatly optimized:

from huggingface_hub import HfFileSystem

HfFileSystem().glob("datasets/HuggingFaceFW/fineweb-edu/data/*/*")

### Before: ~100 /tree calls (one per subdirectory)
### Now: 1 /tree call

[HfFileSystem] Optimize maxdepth: do less /tree calls in glob() by @lhoestq in #3389

Minor updates:

add block_size in init by @lhoestq in #3425
hffs minor fix by @lhoestq in #3449
HTTP backoff: Retry on ChunkedEncodingError by @lhoestq in #3437

🌍 `HfApi`

It is now possible to get high-level information about an organization, the same way it is already possible to do with users:

>>> from huggingface_hub import get_organization_overview
>>> get_organization_overview("huggingface")
Organization(
    avatar_url='https://cdn-avatars.huggingface.co/v1/production/uploads/1583856921041-5dd96eb166059660ed1ee413.png',
    name='huggingface',
    fullname='Hugging Face',
    details='The AI community building the future.',
    is_verified=True,
    is_following=True,
    num_users=198,
    num_models=164, num_spaces=96,
    num_datasets=1043,
    num_followers=64814
)

Add client support for the organization overview endpoint by @BastienGimbert in #3436

🛠️ Small fixes and maintenance

🐛 Bug and typo fixes

Add quotes for better shell compatibility by @aopstudio in #3369
update the sentence_similarity docstring by @tolgaakar in #3374
Do not retry on 429 (only on 5xx) by @Wauplin in #3377
Use git xet transfer to check if xet is enabled by @hanouticelina in #3381
Replace pkgx install instruction with uv by @gary149 in #3420
The error message as previously displayed... by @goldnode in #3405
Use all tools unless explicit allowed_tools by @Mithil467 in #3397
[type validation] skip unresolved forward ref by @zucchini-nlp in #3376
document job stage possible values by @hanouticelina in #3431
update token parameter docstring by @hanouticelina in #3447

🏗️ internal

bump to 0.36.0.dev0 by @Wauplin (direct commit on main)
[Workflow] security fix by @glegendre01 in #3383
migrate tip blocks by @hanouticelina in #3392
[Internal] Fix ty quality by @hanouticelina in #3441
backward compatible cli tracking (v0.x) by @Wauplin in #3460

Community contributions

The following contributors have made changes to the library over the last release. Thank you!

@aopstudio
* Add quotes for better shell compatibility (#3369)
@tolgaakar
* update the sentence_similarity docstring (#3374) (#3375)
@Mithil467
* Use all tools unless explicit allowed_tools (#3397)
@goldnode
* The error message as previously displayed... (#3405)
@BastienGimbert
* Add client support for the organization overview endpoint (#3436)

`v0.35.3`: [v0.35.3] Fix `image-to-image` target size parameter mapping & tiny agents allow tools list bug

Compare Source

This release includes two bug fixes:

[Inference] Fix target size mapping for fal-ai's image-to-image in #3399 by @hanouticelina flagged by @iam-tsr
[Tiny-Agents] Use all tools unless allowed_tools is set explicitly in #3397 by @Mithil467

Full Changelog: huggingface/huggingface_hub@v0.35.2...v0.35.3

`v0.35.2`: [v0.35.2] Welcoming Z.ai as Inference Providers!

Compare Source

Full Changelog: huggingface/huggingface_hub@v0.35.1...v0.35.2

New inference provider! 🔥

Z.ai is now officially an Inference Provider on the Hub. See full documentation here: https://huggingface.co/docs/inference-providers/providers/zai-org.

from huggingface_hub import InferenceClient

client = InferenceClient(provider="zai-org")
completion = client.chat.completions.create(
    model="zai-org/GLM-4.5",
    messages=[{"role": "user", "content": "What is the capital of France?"}],
)

print("\nThinking:")
print(completion.choices[0].message.reasoning_content)
print("\nOutput:")
print(completion.choices[0].message.content)

Thinking:
Okay, the user is asking about the capital of France. That's a pretty straightforward geography question. 

Hmm, I wonder if this is just a casual inquiry or if they need it for something specific like homework or travel planning. The question is very basic though, so probably just general knowledge. 

Paris is definitely the correct answer here. It's been the capital for centuries, since the Capetian dynasty made it the seat of power. Should I mention any historical context? Nah, the user didn't ask for details - just the capital. 

I recall Paris is also France's largest city and major cultural hub. But again, extra info might be overkill unless they follow up. Better keep it simple and accurate. 

The answer should be clear and direct: "Paris". No need to overcomplicate a simple fact. If they want more, they'll ask.

Output:
The capital of France is **Paris**.  

Paris has been the political and cultural center of France for centuries, serving as the seat of government, the residence of the President (Élysée Palace), and home to iconic landmarks like the Eiffel Tower, the Louvre Museum, and Notre-Dame Cathedral. It is also France's largest city and a global hub for art, fashion, gastronomy, and history.

feat: support zai as inference provider by @tomsun28 in #3395

Misc:

[HfFileSystem] Optimize maxdepth: do less /tree calls in glob() by @lhoestq in #3389

`v0.35.1`: [v0.35.1] Do not retry on 429 and skip forward ref in strict dataclass

Compare Source

Do not retry on 429 (only on 5xx) #3377
Skip unresolved forward ref in strict dataclasses #3376

Full Changelog: huggingface/huggingface_hub@v0.35.0...v0.35.1

`v0.35.0`: [v0.35.0] Announcing Scheduled Jobs: run cron jobs on GPU on the Hugging Face Hub!

Compare Source

Scheduled Jobs

In v0.34.0 release, we announced Jobs, a new way to run compute on the Hugging Face Hub. In this new release, we are announcing Scheduled Jobs to run Jobs on a regular basic. Think "cron jobs running on GPU".

This comes with a fully-fledge CLI:

hf jobs scheduled run @&#8203;hourly ubuntu echo hello world
hf jobs scheduled run "0 * * * *" ubuntu echo hello world
hf jobs scheduled ps -a
hf jobs scheduled inspect <id>
hf jobs scheduled delete <id>
hf jobs scheduled suspend <id>
hf jobs scheduled resume <id>
hf jobs scheduled uv run @&#8203;weekly train.py

[Jobs] Add scheduled jobs api by @lhoestq in #3306

It is now possible to run a command with uv run:

hf jobs uv run --with lighteval -s HF_TOKEN lighteval endpoint inference-providers "model_name=openai/gpt-oss-20b,provider=groq" "lighteval|gsm8k|0|0"

[Jobs] Support commands in hf jobs uv run by @lhoestq in #3303

Some other improvements have been added to the existing Jobs API for a better UX.

[Jobs] Use current or stored token in a Job secrets by @lhoestq in #3272
update uv image by @lhoestq in #3270

And finally, Jobs documentation has been updated with new examples (and some fixes):

Fix bash history expansion in hf jobs example by @nyuuzyou in #3277
Add timeout info to Jobs guide docs by @davanstrien in #3281
Update jobs.md by @tre3x in #3297
docs: Add link to uv-scripts organization in Jobs guide by @davanstrien in #3326
docs: Add Docker images section for UV scripts in Jobs guide by @davanstrien in #3327
docs: add link to TRL jobs training documentation by @davanstrien in #3330

CLI updates

In addition to the Scheduled Jobs, some improvements have been added to the hf CLI.

[CLI] print help if no command provided by @Wauplin in #3262
update hf auth whoami output by @hanouticelina in #3274
Add 'user:' prefix to whoami command output for consistency by @gary149 in #3267
Whoami: custom message only on unauthorized by @Wauplin in #3288

Inference Providers

Welcome Scaleway and PublicAI!

Two new partners have been integrated to Inference Providers: Scaleway and PublicAI! (as part of releases 0.34.5 and 0.34.6).

feat: add scaleway inference provider by @Gnoale in #3356
Add PublicAI provider by @Wauplin in #3367

Image-to-video

Image to video is now supported in the InferenceClient:

from huggingface_hub import InferenceClient

client = InferenceClient(provider="fal-ai")

video = client.image_to_video(
    "cat.png",
    prompt="The cat starts to dance",
    model="Wan-AI/Wan2.2-I2V-A14B",
)

[Inference] Support image to video task by @hanouticelina in #3289

Miscellaneous

Header content-type is now correctly set when sending an image or audio request (e.g. for image-to-image task). It is inferred either from the filename or the URL provided by the user. If user is directly passing raw bytes, the content-type header has to be set manually.

[InferenceClient] Add content-type header whenever possible + refacto by @Wauplin in #3321

A .reasoning field has been added to the Chat Completion output. This is used by some providers to return reasoning tokens separated from the .content stream of tokens.

Add reasoning field in chat completion output by @Wauplin in #3338

MCP & tiny-agents updates

tiny-agents now handles AGENTS.md instruction file (see https://agents.md/).

allow use of AGENTS.md as well as PROMPT.md by @evalstate in #3317

Tools filtering has already been improved to avoid loading non-relevant tools from an MCP server:

[MCP] Handle Ollama's deviation from the OpenAI tool streaming spec by @hanouticelina in #3140
[Tiny Agents] Add tools to config by @NielsRogge in #3242
fix allowed tools by @Wauplin (direct commit on main)

🛠️ Small fixes and maintenance

🐛 Bug and typo fixes

Fix bad total size after resuming download by @DKingAlpha in #3234)
bug fix: only extend path on window sys by @vealocia in #3265
[Update] HF Jobs Documentation by @ariG23498 in #3268
Improve Git Credential Helper Detection for Linux (GCM & libsecret support) by @danchev in #3264
Make requests decode content by @rasmusfaber in #3271
Add validation warnings for repository limits in upload_large_folder by @davanstrien in #3280
Include HF_HUB_DISABLE_XET in the environment dump by @hanouticelina in #3290
Add type to job owner by @drbh in #3291
Update to use only summary bars for uploads when in notebooks by @hoytak in #3243
Deprecate library/tags/task/... filtering in list_models by @Wauplin in #3318
Added apps as a parameter to HfApi.list_models by @anirbanbasu in #3322
Update error message to improve shell compatibility by @aopstudio in #3333
docs: minor typo fix in /en/guides/manage-cache by @Manith-Ratnayake in #3353

🏗️ internal

Prepare for v0.35 by @Wauplin in #3261
fix-ish CI by @Wauplin (direct commit on main)
Fix lfs test in CI by @Wauplin in #3275
[Internal] Use ty type checker by @hanouticelina in #3294
[Internal] fix tycheck quality by @hanouticelina in #3320
Return early in is_jsonable if circular reference by @Wauplin in #3348

Community contributions

The following contributors have made changes to the library over the last release. Thank you!

@DKingAlpha
- Fix bad total size after resuming download (#3234) (#3248)
@vealocia
- bug fix: only extend path on window sys (#3265)
@danchev
- Improve Git Credential Helper Detection for Linux (GCM & libsecret support) (#3264)
@rasmusfaber
- Make requests decode content (#3271)
@nyuuzyou
- Fix bash history expansion in hf jobs example (#3277)
@tre3x
- Update jobs.md (#3297)
@hoytak
- Update to use only summary bars for uploads when in notebooks (#3243)
@anirbanbasu
- Added apps as a parameter to HfApi.list_models (#3322)
@aopstudio
- Update error message to improve shell compatibility (#3333)
@Manith-Ratnayake
- docs: minor typo fix in /en/guides/manage-cache (#3353)
@Gnoale
- feat: add scaleway inference provider (#3356)

`v0.34.6`: [v0.34.6]: Welcoming PublicAI as Inference Providers!

Compare Source

Full Changelog: huggingface/huggingface_hub@v0.34.5...v0.34.6

⚡ New provider: PublicAI

[!Tip]
All supported PublicAI models can be found here.

Public AI Inference Utility is a nonprofit, open-source project building products and organizing advocacy to support the work of public AI model builders like the Swiss AI Initiative, AI Singapore, AI Sweden, and the Barcelona Supercomputing Center. Think of a BBC for AI, a public utility for AI, or public libraries for AI.

from huggingface_hub import InferenceClient

client = InferenceClient(provider="publicai")
completion = client.chat.completions.create(
    model="swiss-ai/Apertus-70B-Instruct-2509",
    messages=[{"role": "user", "content": "What is the capital of Switzerland?"}],
)

print(completion.choices[0].message.content)

Add PublicAI provider by @Wauplin in #3367

`v0.34.5`: [v0.34.5]: Welcoming Scaleway as Inference Providers!

Compare Source

Full Changelog: huggingface/huggingface_hub@v0.34.4...v0.34.5

⚡ New provider: Scaleway

[!Tip]
All supported Scaleway models can be found here. For more details, check out its documentation page.

Scaleway is a European cloud provider, serving latest LLM models through its Generative APIs alongside a complete cloud ecosystem.

from huggingface_hub import InferenceClient

client = InferenceClient(provider="scaleway")

completion = client.chat.completions.create(
    model="Qwen/Qwen3-235B-A22B-Instruct-2507",
    messages=[
        {
            "role": "user",
            "content": "What is the capital of France?"
        }
    ],
)

print(completion.choices[0].message)

feat: add scaleway inference provider by @Gnoale in #1925

`v0.34.4`: [v0.34.4] Support Image to Video inference + QoL in jobs API, auth and utilities

Compare Source

Biggest update is the support of Image-To-Video task with inference provider Fal AI

[Inference] Support image to video task #3289 by @hanouticelina

>>> from huggingface_hub import InferenceClient
>>> client = InferenceClient()
>>> video = client.image_to_video("cat.jpg", model="Wan-AI/Wan2.2-I2V-A14B", prompt="turn the cat into a tiger")
>>> with open("tiger.mp4", "wb") as f:
 ...     f.write(video)

And some quality of life improvements:

Add type to job owner #3291 by @drbh
Include HF_HUB_DISABLE_XET in the environment dump #3290 by @hanouticelina
Whoami: custom message only on unauthorized #3288 by @Wauplin
Add validation warnings for repository limits in upload_large_folder #3280 by @davanstrien
Add timeout info to Jobs guide docs #3281 by @davanstrien
[Jobs] Use current or stored token in a Job secrets #3272 by @lhoestq
Fix bash history expansion in hf jobs example #3277 by @nyuuzyou

Full Changelog: huggingface/huggingface_hub@v0.34.3...v0.34.4

`v0.34.3`: [v0.34.3] Jobs improvements and `whoami` user prefix

Compare Source

[Jobs] Update uv image #3270 by @lhoestq
[Update] HF Jobs Documentation #3268 by @ariG23498
Add 'user:' prefix to whoami command output #3267 by @gary149

Full Changelog: huggingface/huggingface_hub@v0.34.2...v0.34.3

`v0.34.2`: [v0.34.2] Bug fixes: Windows path handling & resume download size fix

Compare Source

bug fix: only extend path on window sys in #3265 by @vealocia
Fix bad total size after resuming download in #3234 by @DKingAlpha

Full Changelog: huggingface/huggingface_hub@v0.34.1...v0.34.2

`v0.34.1`: [v0.34.1] [CLI] print help if no command provided

Compare Source

Full Changelog: huggingface/huggingface_hub@v0.34.0...v0.34.1

`v0.34.0`: [v0.34.0] Announcing Jobs: a new way to run compute on Hugging Face!

Compare Source

🔥🔥🔥 Announcing Jobs: a new way to run compute on Hugging Face!

We're thrilled to introduce a powerful new command-line interface for running and managing compute jobs on Hugging Face infrastructure! With the new hf jobs command, you can now seamlessly launch, monitor, and manage jobs using a familiar Docker-like experience. Run any command in Docker images (from Docker Hub, Hugging Face Spaces, or your own custom images) on a variety of hardware including CPUs, GPUs, and TPUs - all with simple, intuitive commands.

Key features:

🐳 Docker-like CLI: Familiar commands (run, ps, logs, inspect, cancel) to run and manage jobs
🔥 Any Hardware: Instantly access CPUs, T4/A10G/A100 GPUs, and TPUs with a simple flag
📦 Run Anything: Use Docker images, HF Spaces, or custom containers
📊 Live Monitoring: Stream logs in real-time, just like running locally
💰 Pay-as-you-go: Only pay for the seconds you use
🧬 UV Runner: Run Python scripts with inline dependencies using uv (experimental)

All features are available both from Python (run_job, list_jobs, etc.) and the CLI (hf jobs).

Example usage:

### Run a Python script on the cloud
hf jobs run python:3.12 python -c "print('Hello from the cloud!')"

### Use a GPU
hf jobs run --flavor=t4-small --namespace=huggingface ubuntu nvidia-smi

### List your jobs
hf jobs ps

### Stream logs from a job
hf jobs logs <job-id>

### Inspect job details
hf jobs inspect <job-id>

### Cancel a running job
hf jobs cancel <job-id>

### Run a UV script (experimental)
hf jobs uv run my_script.py --flavor=a10g-small --with=trl

You can also pass environment variables and secrets, select hardware flavors, run jobs in organizations, and use the experimental uv runner for Python scripts with inline dependencies.

Check out the Jobs guide for more examples and details.

[Jobs] Add huggingface-cli jobs commands by @lhoestq #3211
Rename huggingface-cli jobs to hf jobs by @Wauplin #3250
Docs: link to jobs cli docs by @lhoestq #3253
[Jobs] Mention PRO is required by @Wauplin #3257

🚀 The CLI is now `hf`! (formerly `huggingface-cli`)

Glad to announce a long awaited quality-of-life improvement: the Hugging Face CLI has been officially renamed from huggingface-cli to hf! The legacy huggingface-cli remains available without any breaking change, but is officially deprecated. We took the opportunity update the syntax to a more modern command format hf <resource> <action> [options] (e.g. hf auth login, hf repo create, hf jobs run).

Run hf --help to know more about the CLI options.

✗ hf --help
usage: hf <command> [<args>]

positional arguments:
  {auth,cache,download,jobs,repo,repo-files,upload,upload-large-folder,env,version,lfs-enable-largefiles,lfs-multipart-upload}
                        hf command helpers
    auth                Manage authentication (login, logout, etc.).
    cache               Manage local cache directory.
    download            Download files from the Hub
    jobs                Run and manage Jobs on the Hub.
    repo                Manage repos on the Hub.
    repo-files          Manage files in a repo on the Hub.
    upload              Upload a file or a folder to the Hub. Recommended for single-commit uploads.
    upload-large-folder
                        Upload a large folder to the Hub. Recommended for resumable uploads.
    env                 Print information about the environment.
    version             Print information about the hf version.

options:
  -h, --help            show this help message and exit

Rename CLI to 'hf' + reorganize syntax by @Wauplin in #3229
Rename huggingface-cli jobs to hf jobs by @Wauplin in #3250

⚡ Inference

🖼️ Image-to-image

Added support for image-to-image task in the InferenceClient for Replicate and fal.ai providers, allowing quick image generation using FLUX.1-Kontext-dev:

from huggingface_hub import InferenceClient

client = InferenceClient(provider="fal-ai")
client = InferenceClient(provider="replicate")

with open("cat.png", "rb") as image_file:
   input_image = image_file.read()

### output is a PIL.Image object
image = client.image_to_image(
    input_image,
    prompt="Turn the cat into a tiger.",
    model="black-forest-labs/FLUX.1-Kontext-dev",
)

[Inference Providers] add image-to-image support for Replicate provider by @hanouticelina in #3188
[Inference Providers] add image-to-image support for fal.ai provider by @hanouticelina in #3187

In addition to this, it is now possible to directly pass a PIL.Image as input to the InferenceClient.

Add PIL Image support to InferenceClient by @NielsRogge in #3199

🤖 Tiny-Agents

tiny-agents got a nice update to deal with environment variables and secrets. We've also changed its input format to follow more closely the config format from VSCode. Here is an up to date config to run Github MCP Server with a token:

{
  "model": "Qwen/Qwen2.5-72B-Instruct",
  "provider": "nebius",
  "inputs": [
    {
      "type": "promptString",
      "id": "github-personal-access-token",
      "description": "Github Personal Access Token (read-only)",
      "password": true
    }
  ],
  "servers": [
    {
     "type": "stdio",
     "command": "docker",
     "args": [
       "run",
       "-i",
       "--rm",
       "-e",
       "GITHUB_PERSONAL_ACCESS_TOKEN",
       "-e",
       "GITHUB_TOOLSETS=repos,issues,pull_requests",
       "ghcr.io/github/github-mcp-server"
     ],
     "env": {
       "GITHUB_PERSONAL_ACCESS_TOKEN": "${input:github-personal-access-token}"
     }
    }
  ]
}

[Tiny-Agent] Fix headers handling + secrets management by @Wauplin in #3166
[tiny-agents] Configure inference API key from inputs + keep empty dicts in chat completion payload by @hanouticelina in #3226

🐛 Bug fixes

InferenceClient and tiny-agents got a few quality of life improvements and bug fixes:

Recursive filter_none in Inference Providers by @Wauplin in #3178
[Inference] Remove default params values for text generation by @hanouticelina in #3192
[Inference] Correctly build chat completion URL with query parameters by @hanouticelina in #3200
Update tiny-agents example by @Wauplin in #3205
Fix "failed to parse tools" due to mcp EXIT_LOOP_TOOLS not following the ChatCompletionInputFunctionDefinition model by @nicoloddo in #3219
[Tiny agents] Add tool call to messages by @NielsRogge in #3159
omit parameters for default tools in tiny-agent by @hanouticelina in #3214

📤 Xet

Integration of Xet is now stable and production-ready. A majority of file transfer are now handled using this protocol on new repos. A few improvements have been shipped to ease developer experience during uploads:

Improved progress reporting for Xet uploads by @hoytak in #3096
upload large folder operations uses batches of files for preupload-lfs jobs for xet-enabled repositories by @assafvayner in #3228
Override xet refresh route's base URL with HF Endpoint by @hanouticelina in #3180

Documentation has already been written to explain better the protocol and its options:

Updates to Xet upload/download docs by @jsulz in #3174
Updating Xet caching docs by @jsulz in #3190
Suppress xet install WARN if HF_HUB_DISABLE_XET by @rajatarya in #3206

🛠️ Small fixes and maintenance

🐛 Bug and typo fixes

fix: update payload preparation to merge parameters into the output dictionary by @mishig25 in #3160
fix(inference_endpoints): use GET healthRoute instead of GET / to check status by @mfuntowicz in #3165
Update hf_api.py by @andimarafioti in #3194
[Docs] Remove Inference API references in docs by @hanouticelina in #3197
Align HfFileSystem and HfApi for the expand argument when listing files in repos by @lhoestq in #3195
Solve encoding issue of repocard.py by @WilliamRabuel in #3235
Fix pagination test by @Wauplin in #3246
Fix Incomplete File Not found on windows systems by @JorgeMIng in #3247
[Internal] Fix docstring param spacing check and libcst incompatibility with Python 3.13 by @hanouticelina in #3251
[Bot] Update inference types by @HuggingFaceInfra in #3104
Fix snapshot_download when unreliable number of files by @Wauplin in #3241
fix typo by @Wauplin (direct commit on main)
fix sessions closing warning with AsyncInferenceClient by @hanouticelina in #3252
Deprecate missing_mfa, missing_sso, adding security_restrictions @Kakulukian #3254

🏗️ internal

swap gh style bot action token by @hanouticelina in #3171
improve style bot comment (notify earlier and update later) by @ydshieh in #3179
Update tests following server-side changes by @hanouticelina in #3181
[FIX DOCSTRING] Update hf_api.py by @cakiki in #3182
Bump to 0.34.0.dev0 by @Wauplin in #3222
Do not generate Chat Completion types anymore by @Wauplin in #3231

`v0.33.5`: [v0.33.5] [Inference] Fix a `UserWarning` when streaming with `AsyncInferenceClient`

Compare Source

Fix: "UserWarning: ... sessions are still open..." when streaming with AsyncInferenceClient #3252

Full Changelog: huggingface/huggingface_hub@v0.33.4...v0.33.5

`v0.33.4`: [v0.33.4] [Tiny-Agent]: Fix schema validation error for default MCP tools

Compare Source

Omit parameters in default tools of tiny-agent #3214

Full Changelog: huggingface/huggingface_hub@v0.33.3...v0.33.4

`v0.33.3`: [v0.33.3] [Tiny-Agent]: Update tiny-agents example

Compare Source

Update tiny-agents example #3205

Full Changelog: huggingface/huggingface_hub@v0.33.2...v0.33.3

`v0.33.2`: [v0.33.2] [Tiny-Agent]: Switch to VSCode MCP format

Compare Source

Full Changelog: huggingface/huggingface_hub@v0.33.1...v0.33.2

[Tiny-Agent] Switch to VSCode MCP format + fix headers handling #3166 by @Wauplin

Breaking changes:

no more config nested mapping => everything at root level
headers at root level instead of inside options.requestInit
updated the way values are pulled from ENV (based on input id)

Example of agent.json:

{
  "model": "Qwen/Qwen2.5-72B-Instruct",
  "provider": "nebius",
  "inputs": [
    {
      "type": "promptString",
      "id": "hf-token",
      "description": "Token for Hugging Face API access",
      "password": true
    }
  ],
  "servers": [
    {
      "type": "http",
      "url": "https://huggingface.co/mcp",
      "headers": {
        "Authorization": "Bearer ${input:hf-token}"
      }
    }
  ]
}

Find more examples in https://huggingface.co/datasets/tiny-agents/tiny-agents

`v0.33.1`: [v0.33.1]: Inference Providers Bug Fixes, Tiny-Agents Message handling Improvement, and Inference Endpoints Health Check Update

Compare Source

Full Changelog: huggingface/huggingface_hub@v0.33.0...v0.33.1

This release introduces bug fixes for chat completion type compatibility and feature extraction parameters, enhanced message handling in tiny-agents, and updated inference endpoint health check:

[Tiny agents] Add tool call to messages #3159 by @NielsRogge
fix: update payload preparation to merge parameters into the output dictionary #3160 by @mishig25
fix(inference_endpoints): use GET healthRoute instead of GET / to check status #3165 by @mfuntowicz
Recursive filter_none in Inference Providers #3178 by @Wauplin

`v0.33.0`: [v0.33.0]: Welcoming Featherless.AI and Groq as Inference Providers!

Compare Source

⚡ New provider: Featherless.AI

Featherless AI is a serverless AI inference provider with unique model loading and GPU orchestration abilities that makes an exceptionally large catalog of models available for users. Providers often offer either a low cost of access to a limited set of models, or an unlimited range of models with users managing servers and the associated costs of operation. Featherless provides the best of both worlds offering unmatched model range and variety but with serverless pricing. Find the full list of supported models on the models page.

from huggingface_hub import InferenceClient

client = InferenceClient(provider="featherless-ai")

completion = client.chat.completions.create(
    model="deepseek-ai/DeepSeek-R1-0528", 
    messages=[
        {
            "role": "user",
            "content": "What is the capital of France?"
        }
    ], 
)

print(completion.choices[0].message)

✨ Support for Featherless.ai as inference provider by @pohnean in #3081

⚡ New provider: Groq

At the heart of Groq's technology is the Language Processing Unit (LPU™), a new type of end-to-end processing unit system that provides the fastest inference for computationally intensive applications with a sequential component, such as Large Language Models (LLMs). LPUs are designed to overcome the limitations of GPUs for inference, offering significantly lower latency and higher throughput. This makes them ideal for real-time AI applications.

Groq offers fast AI inference for openly-available models. They provide an API that allows developers to easily integrate these models into their applications. It offers an on-demand, pay-as-you-go model for accessing a wide range of openly-available LLMs.

from huggingface_hub import InferenceClient

client = InferenceClient(provider="groq")

completion = client.chat.completions.create(
    model="meta-llama/Llama-4-Scout-17B-16E-Instruct",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "Describe this image in one sentence."},
                {
                    "type": "image_url",
                    "image_url": {"url": "https://vagabundler.com/wp-content/uploads/2019/06/P3160166-Copy.jpg"},
                },
            ],
        }
    ],
)

print(completion.choices[0].message)

ADd Groq provider by @Wauplin in #3157

🤖 MCP and Tiny-agents

It is now possible to run tiny-agents using a local server e.g. llama.cpp. 100% local agents are right behind the corner!

[MCP] Add local/remote endpoint inference support by @hanouticelina in #3121

Fixing some DX issues in the tiny-agents CLI.

Fix tiny-agents cli exit issues by @Wauplin in #3125
[MCP] reinject JSON parse & runtime tool errors back into the chat history by @hanouticelina in #3137

📚 Documentation

New translation from the Hindi-speaking community, for the community!

Added Hindi translation for git_vs_http.md in concepts section by @february-king in #3156

🛠️ Small fixes and maintenance

😌 QoL improvements

Make hf-xet more silent by @Wauplin in #3124
[HfApi] Collections in collections by @hanouticelina in #3120
Fix inference search by @Wauplin in #3022
[Inference Providers] Raise warning if provider's status is in error mode by @hanouticelina in #3141

🐛 Bug and typo fixes

Fix snapshot_download on very large repo (>50k files) by @Wauplin in #3122
fix tqdm_class argument of subclass of tqdm by @andyxning in #3111
fix quality by @hanouticelina in #3128
second example in oauth documentation

Configuration

📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.

If you want to rebase/retry this PR, check this box

To execute skipped test pipelines write comment /ok-to-test.

Documentation

Find out how to configure dependency updates in MintMaker documentation or see all available configuration options in Renovate documentation.

Signed-off-by: red-hat-konflux <126015336+red-hat-konflux[bot]@users.noreply.github.com>

red-hat-konflux · 2025-11-17T05:24:35Z

Autoclosing Skipped

This PR has been flagged for autoclosing. However, it is being skipped due to the branch being already modified. Please close/delete it manually or report a bug if you think this is in error.

Update dependency huggingface-hub to v0.36.0

8e02cdc

Signed-off-by: red-hat-konflux <126015336+red-hat-konflux[bot]@users.noreply.github.com>

red-hat-konflux bot changed the title ~~Update dependency huggingface-hub to v0.36.0~~ Update dependency huggingface-hub to v0.36.0 - abandoned Nov 17, 2025

red-hat-konflux bot changed the title ~~Update dependency huggingface-hub to v0.36.0 - abandoned~~ Update dependency huggingface-hub to v0.36.0 Nov 17, 2025

red-hat-konflux bot changed the title ~~Update dependency huggingface-hub to v0.36.0~~ Update dependency huggingface-hub to v0.36.0 - abandoned Nov 18, 2025

red-hat-konflux bot changed the title ~~Update dependency huggingface-hub to v0.36.0 - abandoned~~ Update dependency huggingface-hub to v0.36.0 Nov 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update dependency huggingface-hub to v0.36.0 #62

Update dependency huggingface-hub to v0.36.0 #62

Uh oh!

red-hat-konflux bot commented Nov 6, 2025

Uh oh!

red-hat-konflux bot commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Update dependency huggingface-hub to v0.36.0 #62

Are you sure you want to change the base?

Update dependency huggingface-hub to v0.36.0 #62

Uh oh!

Conversation

red-hat-konflux bot commented Nov 6, 2025

Release Notes

v0.36.0: [v0.36.0] Last Stop Before 1.0

📁 HfFileSystem

🌍 HfApi

🛠️ Small fixes and maintenance

🐛 Bug and typo fixes

🏗️ internal

Community contributions

v0.35.3: [v0.35.3] Fix image-to-image target size parameter mapping & tiny agents allow tools list bug

v0.35.2: [v0.35.2] Welcoming Z.ai as Inference Providers!

v0.35.1: [v0.35.1] Do not retry on 429 and skip forward ref in strict dataclass

v0.35.0: [v0.35.0] Announcing Scheduled Jobs: run cron jobs on GPU on the Hugging Face Hub!

Scheduled Jobs

CLI updates

Inference Providers

Welcome Scaleway and PublicAI!

Image-to-video

Miscellaneous

MCP & tiny-agents updates

🛠️ Small fixes and maintenance

🐛 Bug and typo fixes

🏗️ internal

Community contributions

v0.34.6: [v0.34.6]: Welcoming PublicAI as Inference Providers!

⚡ New provider: PublicAI

v0.34.5: [v0.34.5]: Welcoming Scaleway as Inference Providers!

⚡ New provider: Scaleway

v0.34.4: [v0.34.4] Support Image to Video inference + QoL in jobs API, auth and utilities

v0.34.3: [v0.34.3] Jobs improvements and whoami user prefix

v0.34.2: [v0.34.2] Bug fixes: Windows path handling & resume download size fix

v0.34.1: [v0.34.1] [CLI] print help if no command provided

v0.34.0: [v0.34.0] Announcing Jobs: a new way to run compute on Hugging Face!

🔥🔥🔥 Announcing Jobs: a new way to run compute on Hugging Face!

🚀 The CLI is now hf! (formerly huggingface-cli)

⚡ Inference

🖼️ Image-to-image

🤖 Tiny-Agents

🐛 Bug fixes

📤 Xet

🛠️ Small fixes and maintenance

🐛 Bug and typo fixes

🏗️ internal

v0.33.5: [v0.33.5] [Inference] Fix a UserWarning when streaming with AsyncInferenceClient

v0.33.4: [v0.33.4] [Tiny-Agent]: Fix schema validation error for default MCP tools

v0.33.3: [v0.33.3] [Tiny-Agent]: Update tiny-agents example

v0.33.2: [v0.33.2] [Tiny-Agent]: Switch to VSCode MCP format

v0.33.1: [v0.33.1]: Inference Providers Bug Fixes, Tiny-Agents Message handling Improvement, and Inference Endpoints Health Check Update

v0.33.0: [v0.33.0]: Welcoming Featherless.AI and Groq as Inference Providers!

⚡ New provider: Featherless.AI

⚡ New provider: Groq

🤖 MCP and Tiny-agents

📚 Documentation

🛠️ Small fixes and maintenance

😌 QoL improvements

🐛 Bug and typo fixes

Configuration

Documentation

Uh oh!

red-hat-konflux bot commented Nov 17, 2025

Autoclosing Skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

`v0.36.0`: [v0.36.0] Last Stop Before 1.0

📁 `HfFileSystem`

🌍 `HfApi`

`v0.35.3`: [v0.35.3] Fix `image-to-image` target size parameter mapping & tiny agents allow tools list bug

`v0.35.2`: [v0.35.2] Welcoming Z.ai as Inference Providers!

`v0.35.1`: [v0.35.1] Do not retry on 429 and skip forward ref in strict dataclass

`v0.35.0`: [v0.35.0] Announcing Scheduled Jobs: run cron jobs on GPU on the Hugging Face Hub!

`v0.34.6`: [v0.34.6]: Welcoming PublicAI as Inference Providers!

`v0.34.5`: [v0.34.5]: Welcoming Scaleway as Inference Providers!

`v0.34.4`: [v0.34.4] Support Image to Video inference + QoL in jobs API, auth and utilities

`v0.34.3`: [v0.34.3] Jobs improvements and `whoami` user prefix

`v0.34.2`: [v0.34.2] Bug fixes: Windows path handling & resume download size fix

`v0.34.1`: [v0.34.1] [CLI] print help if no command provided

`v0.34.0`: [v0.34.0] Announcing Jobs: a new way to run compute on Hugging Face!

🚀 The CLI is now `hf`! (formerly `huggingface-cli`)

`v0.33.5`: [v0.33.5] [Inference] Fix a `UserWarning` when streaming with `AsyncInferenceClient`

`v0.33.4`: [v0.33.4] [Tiny-Agent]: Fix schema validation error for default MCP tools

`v0.33.3`: [v0.33.3] [Tiny-Agent]: Update tiny-agents example

`v0.33.2`: [v0.33.2] [Tiny-Agent]: Switch to VSCode MCP format

`v0.33.1`: [v0.33.1]: Inference Providers Bug Fixes, Tiny-Agents Message handling Improvement, and Inference Endpoints Health Check Update

`v0.33.0`: [v0.33.0]: Welcoming Featherless.AI and Groq as Inference Providers!