feat(py): Add type safety to Python SDK #4309 #4310

huangjeff5 · 2026-01-28T19:13:00Z

Summary

This PR adds full type safety to the Python SDK to match what we have in JS. The core goal: when you call generate(), define_prompt(), or use a flow, the return types should be known at dev time so your IDE can autocomplete and catch errors before runtime.

What was broken

Type information was getting lost at key boundaries:

# Types lost - IDE shows `Any` or `object`
response = await ai.generate(output_schema=Recipe)
response.output.ingredients  # No autocomplete, no error checking

result = await recipe_prompt(input)
[result.output.name](http://result.output.name/)  # Also untyped

What this PR does

Generic Action class

Made Action generic so types flow through:

class Action(Generic[InputT, OutputT, ChunkT]):
    async def arun(self, input: InputT) -> ActionResponse[OutputT]: ...

Typed Output[T] for generate()

from genkit import Output

class Recipe(BaseModel):
    name: str
    ingredients: list[str]

response = await ai.generate(
    prompt="Give me a pasta recipe",
    output=Output(schema=Recipe),
)

# Now these are typed
[response.output.name](http://response.output.name/)         # str
response.output.ingredients  # list[str]
response.output.typo         # IDE error - caught before runtime

Typed generate_stream()

Same pattern works for streaming:

stream, future = ai.generate_stream(
    prompt="Give me a recipe",
    output=Output(schema=Recipe),
)

response = await future
[response.output.name](http://response.output.name/)  # Typed as str

Typed prompts with Input[T] and Output[T]

Prompts (including dotprompt files) get full type safety:

from genkit import Input, Output

class RecipeInput(BaseModel):
    dish: str
    servings: int

class Recipe(BaseModel):
    name: str
    ingredients: list[str]

recipe_prompt = ai.define_prompt(
    name="recipe",
    prompt="Create a recipe for {{dish}} serving {{servings}} people",
    input=Input(schema=RecipeInput),
    output=Output(schema=Recipe),
)

# Input is type-checked
response = await recipe_prompt(RecipeInput(dish="pizza", servings=4))

# Output is typed
[response.output.name](http://response.output.name/)  # str
response.output.ingredients  # list[str]

# Errors caught at dev time
await recipe_prompt("pizza")  # Wrong type - IDE flags this
response.output.wrong_field   # No such attribute - IDE flags this

Typed flows

Flows preserve types through the decorator:

@ai.flow()
async def analyze_document(doc: Document) -> Analysis:
    ...

analysis = await analyze_document(my_doc)
analysis.summary  # IDE knows this exists

Breaking change

output_schema removed from generate(). Use output=Output(schema=...) instead:

# Before
response = await ai.generate(output_schema=Recipe, output_format="json")

# After
response = await ai.generate(output=Output(schema=Recipe))

Other changes

Added typed logger wrapper
Type-safe registry lookups
Reduced Any usage across codebase
Added @override decorators where needed
Zero pyright errors in strict mode

Testing

Pyright verification tests in tests/typing/ - all pass
Negative tests verify errors are caught
Existing unit tests pass
Manually verified autocomplete in VS Code

Add TypeVar generics (InputT, OutputT, ChunkT) to the Action class for improved type inference in flows and tools. - Action[InputT, OutputT, ChunkT] now properly types inputs/outputs - FlowWrapper preserves callable signature for correct return types - Uses typing_extensions for Python 3.10+ compatibility - Adds CI type checking with pyright, mypy, and ty This enables IDE autocomplete and type checking for: - Flow return types: result = await my_flow() -> typed - Tool return types: result = await my_tool() -> typed - Streaming chunks: async for chunk in stream -> typed

Improve public API by consolidating exports: - genkit/__init__.py: Export Genkit, GenkitError, Message, Part, etc. Users can now: from genkit import Genkit, Message, Part - genkit/ai/__init__.py: Add explicit __all__ with Genkit properly exported - genkit/types/__init__.py: - Remove internal types (ActionRunContext, *Wrapper, Constrained) - Add ToolInterruptError for user error handling - Organize exports by category (Message, Document, Generation, etc.) Aligns Python API surface with JS/Go patterns for better DX.

Update internal imports to use specific module paths instead of re-export modules, satisfying basedpyright's reportPrivateImportUsage: - Channel, ensure_async: from genkit.aio.* internal modules - find_free_port_sync: from genkit.web.manager._ports - GenkitSpan, init_telemetry_server_exporter: from genkit.core.trace.* - FormatDef, Formatter: from genkit.blocks.formats.types No behavior change - purely import path updates for stricter type checking.

@OverRide

- Add @OverRide decorator to 35 methods that override parent classes (formats, trace exporters, session stores, web adapters) - Add _ = to ~50 function calls where return values are intentionally ignored (satisfies basedpyright reportUnusedCallResult) This improves type safety by: - Making method overrides explicit (catches typos and broken inheritance) - Documenting intentional ignored return values

This commit adds comprehensive type safety improvements: 1. Output[T] class for type-safe output configuration: - `response = await ai.generate(output=Output(schema=Recipe))` - `response.output` is now typed as `Recipe` 2. GenerateResponseWrapper[T] generic: - The response wrapper is now generic over the output type - Full end-to-end type inference from Output[T] to response.output 3. Fixed reportUnannotatedClassAttribute warnings (196 fixes): - Added type annotations to all class instance attributes - Fixed schema generator to produce ClassVar[ConfigDict] annotations 4. Fixed reportMissingTypeArgument warnings (59 fixes): - Added type arguments to Formatter, Channel, Callable, etc. - Added type arguments to PromptFunction, PromptMetadata - Added type arguments to RetrieverFn, IndexerFn, RerankerFn, etc. 5. Export improvements: - Exported GenerateResponseWrapper from genkit package - Users can now type hint with GenerateResponseWrapper[T] Total warnings fixed: ~340 across 40+ files

1. Fix schema generator to use Field(default=None) instead of Field(None): - Pyright doesn't recognize Field(None) as providing a default value - Changed 70+ occurrences in auto-generated typing.py - Also handles Field(None, alias=...) pattern 2. Fix ParamSpec issues in tool decorator (_registry.py): - Added pyright: ignore comments for dynamic dispatch code - ParamSpec can't be statically verified with runtime arg inspection 3. Fix callable check in prompt.py: - Added callable(factory) guard before calling dynamic factory Total reportCallIssue fixes: 39 → 0

1. tracing.py: Fixed actual bug where `span` could be unbound - Moved GenkitSpan creation before try block - Previously would crash if GenkitSpan() threw in except handler 2. _info.py: Fixed optional psutil import pattern - Changed from HAS_PSUTIL flag to `psutil = None` pattern - Pyright can now track the None check for type narrowing 3. typing.py: Fixed optional litestar/starlette imports - Changed from HAVE_* flags to `module = None` pattern - Pyright can now verify conditional type aliases Total reportPossiblyUnboundVariable fixes: 38 → 0

Fixed 25 reportUnusedParameter warnings by prefixing unused parameters with `_` to indicate they are intentionally unused. Files modified: - _registry.py: kwargs in flow wrappers - generate.py: preamble, raw_request, model, registry - prompt.py: dir parameter - retriever.py: ctx in wrapper functions - _action.py: telemetry_labels, input_spec - _util.py: chunk in noop callback - flows.py: request in health_check - reflection.py: encoding, request params, action_input - testing.py: ctx in model_fn - _ports.py: host parameter - signals.py: frame in signal handler

Added super().__init__() calls (3 fixes): - GenerationResponseError: pass message to Exception base - ToolInterruptError: call Exception.__init__ - RedactedSpan: call ReadableSpan.__init__ Suppressed reportUnreachable for intentional code (13 fixes): - Python 3.10 compatibility branches (sys.version_info < 3.11) - Defensive null checks that type narrowing makes unreachable - Exhaustive match/isinstance patterns with fallback branches

Fixed 43 warnings across 7 categories: reportImplicitStringConcatenation (3): - Added explicit '+' for multi-line f-string concatenation reportInvalidCast (3): - Used cast(object, x) as intermediary for MatchableAction casts reportUnsupportedDunderAll (6): - Converted __name__ to literal strings in __all__ exports reportUnnecessaryIsInstance (6): - Suppressed defensive runtime type checks reportUnnecessaryComparison (6): - Suppressed defensive null checks that type narrowing makes unnecessary reportPrivateUsage (13): - Suppressed internal access to _private members within SDK code reportGeneralTypeIssues (6): - Fixed dict unpacking with proper isinstance checks - Suppressed complex TypeVar issues in FlowWrapper

Phase 3a: Create typed Logger protocol wrapper for structlog - Added genkit.core.logging module with Logger protocol and get_logger() - Updated 17 files to use typed logger instead of structlog.get_logger() - Export Logger and get_logger from genkit.core - Eliminates ~100 reportAny warnings from structlog's dynamic methods Phase 3b: Add typed action lookup methods to Registry - Added resolve_retriever(), resolve_embedder(), resolve_reranker(), resolve_model(), resolve_evaluator() methods with proper type casts - Updated callers in _aio.py, generate.py, reranker.py to use typed lookups - Eliminates ~10 reportAny warnings from dynamic registry lookups Also includes: - Design docs for Phase 3: phase3-typed-internals.md - Implementation tasks: phase3-typed-internals-tasks.md - Updated mock registry in embedding_test.py for new method Total reduction: ~110 reportAny warnings eliminated

- Logger protocol: Use `object` for **kwargs and `None` return type instead of `Any` - eliminates 35+ warnings - Loop utilities: Make run_async, iter_over_async, run_loop generic with TypeVar instead of Any - eliminates 11 warnings These changes improve type safety while maintaining compatibility with structlog and asyncio patterns.

- Use typed logger (get_logger) instead of structlog.get_logger - Fix ActionRunContext to be Optional and add None checks - Add type arguments to bare dict return types - Prefix unused parameters with underscore - Fix implicit string concatenation - Add pyright ignore for Python version compatibility check Reduces from 6 errors + 30 warnings to 0 errors + 14 warnings. Remaining warnings are from namespace package resolution for plugins.

Common fixes applied: - Change `ctx: ActionRunContext = None` to `ctx: ActionRunContext | None = None` - Add null checks before accessing ctx.is_streaming and ctx.send_chunk - Add type arguments to bare `dict` and `list` return types - Prefix unused parameters with underscore - Fix relative imports in evaluator-demo - Use typed logger (get_logger) in chat-demo - Fix ActionRunContext import path in anthropic-hello Reduces total errors across samples from 48+ to ~24. Remaining errors are complex type issues (method overrides, Streamlit types, etc.) that need deeper investigation.

Shows the Output[T] pattern for getting typed responses from ai.generate(): response = await ai.generate( prompt='...', output=Output(schema=Recipe), # The magic! ) response.output # Typed as Recipe, not Any! This enables full IDE autocomplete on response.output fields.

- Replace structlog.get_logger with genkit.core.logging.get_logger in all 18 samples for proper type hints - Fix ctx null checks in compat-oai-hello - Make pyrightconfig.json portable (relative venvPath) - Add reportMissingTypeStubs: false to suppress harmless warnings

… instead BREAKING CHANGE: The `output_schema` parameter has been removed from `ai.generate()` and `ai.generate_stream()`. Use `output=Output(schema=YourSchema)` instead, which provides full type inference on `response.output`. Before: response = await ai.generate(prompt='...', output_schema=Recipe) result = cast(Recipe, response.output) # Manual cast needed After: response = await ai.generate(prompt='...', output=Output(schema=Recipe)) result = response.output # Typed as Recipe automatically! This aligns with the JS SDK which uses `output: { schema: ... }`. Updated all samples to use the new pattern.

Channel[T] is now Channel[T, R] where: - T = type of items streamed through the channel - R = type of the close future result This fixes the type mismatch where streaming chunks (GenerateResponseChunkWrapper) and the final response (GenerateResponseWrapper) are different types.

Key files fixed: - aio/channel.py: Fixed unbound 'pending' variable in timeout handler, fixed set_exception type narrowing with walrus operator - ai/_aio.py: Added pyright ignores for list invariance (Document vs DocumentData) - blocks/generate.py: Added explicit type params to Action[Any, Any, Any] - blocks/model.py: Fixed message override with ignore, added None check - blocks/prompt.py: Fixed PromptMetadata dict typing, added ignores for dynamic Action attributes (_executable_prompt, _async_factory) - core/action/_action.py: Fixed telemetry_labels parameter name, added Channel type params, fixed stream callback type - core/flows.py: Fixed 'eerror' typo to 'aerror', added return type ignore - session/chat.py: Suppressed import cycle warning (TYPE_CHECKING guarded) Reduced errors from 50+ to 0 in the 10 key genkit files.

Changes: - Remove unused import (EmbedResponse) - Fix unused call result (task.cancel()) - Fix import locations (Action, ActionKind) - Remove unnecessary casts and isinstance checks - Add pyright config to suppress intentional Any usage: - reportExplicitAny, reportAny (intentional dynamic typing) - reportUnknown* (external library types) Result: 0 errors, 0 warnings on the 8 key genkit files.

ExecutablePrompt is now generic: ExecutablePrompt[OutputT] When defining a prompt with output=Output(schema=T), the returned prompt is typed as ExecutablePrompt[T], and all calls return GenerateResponseWrapper[T] with typed .output property. Example: ```python class Recipe(BaseModel): name: str ingredients: list[str] recipe_prompt = ai.define_prompt( name='recipe', prompt='Create a recipe for {food}', output=Output(schema=Recipe), # Type captured here ) response = await recipe_prompt({'food': 'pizza'}) response.output.name # ✓ Typed as str, autocomplete works! ``` Changes: - Make ExecutablePrompt[OutputT] generic - Make GenerateStreamResponse[OutputT] generic - Add overloads to define_prompt() for type inference - Add overloads to GenkitRegistry.define_prompt() - Add typing tests for ExecutablePrompt This matches the JS SDK pattern where the output type is captured at prompt definition time.

Add comprehensive examples showing all Output fields with define_prompt: - Basic usage with just schema - Full usage with format, content_type, instructions, constrained - Streaming example - Type checking demo Files: - typing-manual-test/main.py: added full Output fields example - typing-evaluation/src/typed_prompt_example.py: new comprehensive example

ExecutablePrompt is now ExecutablePrompt[InputT, OutputT], matching JS SDK. When defining a prompt with both input=Input(schema=I) and output=Output(schema=O), the returned prompt is typed as ExecutablePrompt[I, O]: - Input is type-checked when calling the prompt - Output is typed on response.output Example: ```python class RecipeInput(BaseModel): dish: str servings: int class Recipe(BaseModel): name: str ingredients: list[str] recipe_prompt = ai.define_prompt( name='recipe', prompt='Create a recipe for {dish} serving {servings}', input=Input(schema=RecipeInput), # ← Input typed! output=Output(schema=Recipe), # ← Output typed! ) # Input type-checked: response = await recipe_prompt(RecipeInput(dish='pizza', servings=4)) # response = await recipe_prompt({'dish': 'pizza'}) # ❌ Type error! response.output.name # ✓ Typed as str ``` Changes: - Add Input[T] class in _aio.py - Update ExecutablePrompt to Generic[InputT, OutputT] - Update __call__ and stream to accept InputT - Add 4 overloads to define_prompt for all input/output combinations - Convert Pydantic models to dicts for template rendering - Export Input from genkit.ai - Update typed_prompt_example.py with full examples JS/Python parity table: | Feature | JS | Python | |---------------------|-----|--------| | Prompt typed input | ✅ | ✅ | | Prompt typed output | ✅ | ✅ | | generate output | ✅ | ✅ |

- Add generate_stream() overloads for typed Output[T] - Update docs and tests to use Output() instead of output_schema - Add streaming output type verification test - Fix evaluators plugin to use new Output API - Remove internal design docs and test samples

gemini-code-assist · 2026-01-28T19:16:17Z

Summary of Changes

Hello @huangjeff5, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly upgrades the Python SDK's type safety, providing a more robust and developer-friendly experience. By introducing generic Input and Output classes, and extensively applying modern Python typing features, the changes enable static analysis tools to provide richer feedback, improve code maintainability, and reduce runtime errors. The refactoring touches core components like action definitions, flow wrappers, and generation methods, ensuring that type information is preserved and utilized effectively across the library.

Highlights

Enhanced Type Safety: Introduced generic Input and Output classes for generate and define_prompt methods, enabling compile-time type checking of input and output schemas. This significantly improves type inference and reduces the need for Any.
Comprehensive Type Annotations: Extensively applied modern Python typing features such as TypeVar, Generic, overload, ClassVar, and override across the SDK. Core components like Action, FlowWrapper, and Channel are now generic, preserving type information throughout the library.
Improved Developer Experience: A new typing-guide.md documentation file has been added to explain how to leverage the new type safety features for better IDE autocomplete, inline documentation, and early error detection, fostering a more robust development workflow.
Pyright Compatibility & Typed Logging: Specific adjustments and pyright: ignore comments were added to ensure compatibility and suppress warnings from the Pyright static type checker. Additionally, a new genkit.core.logging module with a Logger Protocol provides type-safe logging throughout the SDK.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Ignored Files

Ignored by pattern: .github/workflows/** (1)
- .github/workflows/python.yml

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

…9.0.2.pyc

gemini-code-assist

Code Review

This is an excellent and extensive pull request that brings comprehensive type safety to the Python SDK. The introduction of generic types for core components like Action, ExecutablePrompt, and FlowWrapper, along with the new Input[T] and Output[T] classes, is a significant improvement for developer experience and code correctness. The changes are consistently applied across the codebase, including updates to samples and the addition of typing verification tests. The new typed logger and pyright configuration are also great additions. I have one minor suggestion for code clarity.

py/samples/compat-oai-hello/src/main.py

…o jh-py-typing

- Fix streaming tests to use .response property (ActionResponse change) - Fix RedactedSpan by removing incorrect super().__init__() call - Fix Channel TypeVar default for backward compatibility - Fix Channel timeout to not cancel external close_future - Add per-file-ignores for typing tests in pyproject.toml - Fix missing docstring args in _action.py and _util.py - Fix imports and formatting to pass lint checks

Consolidates version-specific imports (StrEnum, override) into a single compatibility module to eliminate code duplication and improve maintainability. Changes: - Created genkit/core/_compat.py with centralized version checks - Refactored StrEnum imports (was duplicated in 4 files) - Refactored override decorator imports (was duplicated in 12 files) - Updated schema generator to use _compat module - All compatibility logic now in one place for easier updates

huangjeff5 added 28 commits January 27, 2026 14:53

fix(py): fix typo google_geai → google_genai in sample

4517b57

Merge branch 'main' into jh-py-typing

7a7660c

fix(py): add type annotation to Channel in generate_stream

46d8b21

Merge branch 'main' into jh-py-typing

52e91b6

github-project-automation bot added this to Genkit Backlog Jan 28, 2026

github-actions bot added the docs Improvements or additions to documentation label Jan 28, 2026

github-actions bot added python Python config root labels Jan 28, 2026

huangjeff5 added 3 commits January 28, 2026 13:13

Delete .cursor/rules/slow-multi-s.mdc

79a53a0

Delete py/docs/typing-guide.md

6958d15

Delete pyrightconfig.json

26fd493

huangjeff5 added 2 commits January 28, 2026 13:18

Delete py/samples/realtime-tracing-demo/src/test_audit.py

f4470f0

Delete tests/__pycache__/test_typing_verification.cpython-312-pytest-…

f9c1784

…9.0.2.pyc

gemini-code-assist bot reviewed Jan 28, 2026

View reviewed changes

py/samples/compat-oai-hello/src/main.py Show resolved Hide resolved

huangjeff5 added 11 commits January 28, 2026 13:24

move tests to python tree

780d3dc

Merge branch 'jh-py-typing' of https://github.com/firebase/genkit int…

88dd52a

…o jh-py-typing

fix comment

1776d51

ruff format

2fd367c

remove test, fix workflow

9585961

remove extra tests and stuff

80898c7

fix ty type errors

ba3c8c6

Merge branch 'main' into jh-py-typing

b051b3e

more ty fixes

d5f185a

huangjeff5 marked this pull request as ready for review January 29, 2026 04:16

huangjeff5 added 2 commits January 28, 2026 22:17

fix async def

6da2ece

remove async change

7028cf5

yesudeep approved these changes Jan 29, 2026

View reviewed changes

huangjeff5 merged commit d0764f8 into main Jan 29, 2026
22 checks passed

huangjeff5 deleted the jh-py-typing branch January 29, 2026 16:21

github-project-automation bot moved this to Done in Genkit Backlog Jan 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(py): Add type safety to Python SDK #4309 #4310

feat(py): Add type safety to Python SDK #4309 #4310

Uh oh!

huangjeff5 commented Jan 28, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Jan 28, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(py): Add type safety to Python SDK #4309 #4310

feat(py): Add type safety to Python SDK #4309 #4310

Uh oh!

Conversation

huangjeff5 commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What was broken

What this PR does

Generic Action class

Typed Output[T] for generate()

Typed generate_stream()

Typed prompts with Input[T] and Output[T]

Typed flows

Breaking change

Other changes

Testing

Uh oh!

gemini-code-assist bot commented Jan 28, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

huangjeff5 commented Jan 28, 2026 •

edited

Loading