Support custom http headers in inference requests by achandrasekar · Pull Request #192 · kubernetes-sigs/inference-perf

achandrasekar · 2025-08-21T06:23:22Z

Fixes #181

k8s-ci-robot · 2025-08-21T06:23:28Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: achandrasekar

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [achandrasekar]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

achandrasekar · 2025-08-21T06:25:18Z

Tested with the below config

None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and only tokenizers, configuration and file/data utilities can be used.
2025-08-21 06:07:16,957 - inference_perf.config - INFO - Using configuration from: config.yml
2025-08-21 06:07:16,964 - inference_perf.config - INFO - Benchmarking with the following config:

api:
  type: completion
  streaming: true
  headers:
    x-something-header: something-header-value
    x-something-else-header: something-else-header-value
data:
...

2025-08-21 06:07:16,965 - inference_perf.client.filestorage.local - INFO - Report files will be stored at: ./reports
2025-08-21 06:07:19,617 - inference_perf.client.modelserver.vllm_client - INFO - Inferred model google/gemma-3-27b-it
2025-08-21 06:07:21,886 - inference_perf.loadgen.load_generator - INFO - Stage 0 - run started
2025-08-21 06:07:23,122 - inference_perf.client.modelserver.vllm_client - INFO - Headers in request {'Content-Type': 'application/json', 'x-something-header': 'something-header-value', 'x-something-else-header': 'something-else-header-value'}
2025-08-21 06:07:24,191 - inference_perf.client.modelserver.vllm_client - INFO - Headers in request {'Content-Type': 'application/json', 'x-something-header': 'something-header-value', 'x-something-else-header': 'something-else-header-value'}
2025-08-21 06:07:26,727 - inference_perf.client.modelserver.vllm_client - INFO - Headers in request {'Content-Type': 'application/json', 'x-something-header': 'something-header-value', 'x-something-else-header': 'something-else-header-value'}

jjk-g · 2025-08-21T23:05:36Z

/lgtm

Support custom http headers in inference requests

e06bc2a

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Aug 21, 2025

k8s-ci-robot requested review from ArangoGutierrez and jjk-g August 21, 2025 06:23

k8s-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Aug 21, 2025

k8s-ci-robot assigned jjk-g Aug 21, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 21, 2025

achandrasekar merged commit 64eb360 into kubernetes-sigs:main Aug 21, 2025
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support custom http headers in inference requests#192

Support custom http headers in inference requests#192
achandrasekar merged 1 commit into
kubernetes-sigs:mainfrom
achandrasekar:headers

achandrasekar commented Aug 21, 2025

Uh oh!

k8s-ci-robot commented Aug 21, 2025

Uh oh!

achandrasekar commented Aug 21, 2025

Uh oh!

jjk-g commented Aug 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

achandrasekar commented Aug 21, 2025

Uh oh!

k8s-ci-robot commented Aug 21, 2025

Uh oh!

achandrasekar commented Aug 21, 2025

Tested with the below config

Uh oh!

jjk-g commented Aug 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants