Skip to content

Conversation

@archana-ramalingam
Copy link
Collaborator

@archana-ramalingam archana-ramalingam commented Oct 16, 2025

Standardize toy Llama IREE test to more effectively monitor regressions in the production model runs.

  • Add production compile flags to toy llama test
  • Move TestToyLlamaIree to GPU
  • attn_head_dim must be at least 64 in toy llama

TODO:
Regenerate the inputs IDs in test_llama.py and toy_llama_test.py, as toy llama has changed

@github-actions
Copy link
Contributor

github-actions bot commented Oct 16, 2025

Coverage report

Click to see where and how coverage changed

FileStatementsMissingCoverageCoverage
(new stmts)
Lines missing
  sharktank/sharktank/examples
  export_paged_llm_v1.py
  sharktank/sharktank/layers
  paged_attention.py
  sharktank/sharktank/layers/configs
  llm_configs.py
  sharktank/sharktank/models/llm
  export.py
  sharktank/sharktank/utils
  llm_artifacts.py
  llm_utils.py
  sharktank/tests/models/llama
  toy_llama_test.py 34-42
Project Total  

This report was generated by python-coverage-comment-action

@archana-ramalingam archana-ramalingam changed the title [sharktank] Add compile flags to toy llama [sharktank] Standardize the Toy Llama test to track regressions Oct 17, 2025
@archana-ramalingam archana-ramalingam changed the title [sharktank] Standardize the Toy Llama test to track regressions [sharktank] Standardize Toy Llama IREE test to track regressions Oct 17, 2025
@archana-ramalingam archana-ramalingam marked this pull request as draft October 21, 2025 20:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants