Conversation
- Use cryptographically secure ID generation with node:crypto - Implement runtime validation for JSON parsing using Zod - Optimize circular dependency validation to avoid N+1 file reads
Summary of ChangesHello @anj-s, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request introduces a comprehensive suite of behavioral evaluations for the Highlights
Changelog
Activity
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
|
Hi @anj-s, thank you so much for your contribution to Gemini CLI! We really appreciate the time and effort you've put into this. We're making some updates to our contribution process to improve how we track and review changes. Please take a moment to review our recent discussion post: Improving Our Contribution Process & Introducing New Guidelines. Key Update: Starting January 26, 2026, the Gemini CLI project will require all pull requests to be associated with an existing issue. Any pull requests not linked to an issue by that date will be automatically closed. Thank you for your understanding and for being a part of our community! |
|
Hi there! Thank you for your contribution to Gemini CLI. To improve our contribution process and better track changes, we now require all pull requests to be associated with an existing issue, as announced in our recent discussion and as detailed in our CONTRIBUTING.md. This pull request is being closed because it is not currently linked to an issue. Once you have updated the description of this PR to link an issue (e.g., by adding How to link an issue: Thank you for your understanding and for being a part of our community! |
|
Size Change: -4 B (0%) Total Size: 26.2 MB ℹ️ View Unchanged
|
There was a problem hiding this comment.
Code Review
This pull request introduces valuable behavioral evaluation tests for the task tracker feature, covering both explicit and implicit tool usage scenarios. The tests are well-structured and the prompts are clear. The identified logical issue in an assertion was valid, and a suggestion has been provided to correct it.
|
FYI: Ran these through the nightly run: https://github.com/google-gemini/gemini-cli/actions/runs/22870896466 Looks like they pass 66-100% of the time, depending on model, though it fails at 0% for some models. Not necessarily blocking. |
|
The report lists the results from the last ~7 runs. The rightmost column is the current run. The columns left of that are from previous runs. No numbers means the test did not run in that run, in this case, because it didn't exist. |

Summary
This PR introduces behavioral evaluations for the Task Tracker in
evals/tracker.eval.ts. These tests ensure the model correctly utilizes tracker tools (tracker_create_task,tracker_update_task) in both explicit and implicit scenarios whenApprovalMode.YOLOis enabled.Details
experimental.taskTracker = trueinto the model settings.Related Issues
Fixes #19965
How to Validate
Run the evaluation tests locally:
Pre-Merge Checklist