refactor(streaming): remove LangChain callback dependencies from StreamingHandler #1547

Pouyanpi · 2025-12-16T14:49:05Z

PR Description

Removes LangChain callback dependencies from StreamingHandler. (one callback down)

Key Changes:

StreamingHandler no longer inherits from AsyncCallbackHandler
Streaming now uses llm.astream() with direct push_chunk() calls
Removed LangChain-specific type handling (GenerationChunk, AIMessageChunk, etc.)
Added explicit streaming_handler parameter to llm_call()
Simplified streaming interface (string-only chunks)

…amingHandler Refactored StreamingHandle by removing dependencies on LangChain callback interfaces (AsyncCallbackHandler, LLMResult, etc.). - Remove AsyncCallbackHandler inheritance from StreamingHandler - Replace callback-based streaming with direct push_chunk() interface - Add streaming_handler parameter to llm_call() for explicit streaming - Update llm_call to use llm.astream() instead of callbacks - Simplify push_chunk() to accept only strings (remove LangChain type conversions) - Remove on_chat_model_start, on_llm_new_token, on_llm_end callback methods - Update tests to use push_chunk() directly instead of mocking callbacks

codecov · 2025-12-16T14:58:28Z

Codecov Report

❌ Patch coverage is 94.28571% with 2 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
nemoguardrails/actions/llm/utils.py	93.10%	2 Missing ⚠️

📢 Thoughts on this report? Let us know!

greptile-apps · 2025-12-16T15:02:31Z

Greptile Overview

Greptile Summary

This PR refactors the streaming functionality in NeMo Guardrails by removing LangChain callback dependencies from the StreamingHandler class. The key architectural change moves away from LangChain's AsyncCallbackHandler pattern toward a more direct streaming approach. The StreamingHandler no longer inherits from AsyncCallbackHandler and instead accepts only string chunks through direct push_chunk() calls. Streaming now uses llm.astream() calls with explicit streaming_handler parameters passed to llm_call() functions throughout the codebase, replacing the previous custom_callback_handlers pattern. This refactoring simplifies the streaming interface, reduces external dependencies, and makes the system more provider-agnostic while maintaining all existing streaming functionality.

Important Files Changed

Filename	Score	Overview
`nemoguardrails/streaming.py`	4/5	Removed `AsyncCallbackHandler` inheritance and LangChain-specific chunk types, simplified to string-only chunks
`nemoguardrails/actions/llm/utils.py`	4/5	Added `streaming_handler` parameter to `llm_call()` and implemented new `_stream_llm_call()` function
`nemoguardrails/actions/llm/generation.py`	4/5	Updated multiple `llm_call()` invocations to use `streaming_handler` parameter instead of `custom_callback_handlers`
`nemoguardrails/actions/v2_x/generation.py`	4/5	Replaced `custom_callback_handlers` with `streaming_handler` parameter in passthrough LLM action
`tests/test_streaming_handler.py`	4/5	Refactored tests to remove LangChain callback testing and use direct `push_chunk()` calls
`tests/utils.py`	5/5	Added `_astream` method to `FakeLLM` class to support new streaming architecture in tests
`tests/runnable_rails/test_streaming.py`	4/5	Modified `StreamingFakeLLM` to yield `GenerationChunk` objects with proper error handling

Confidence score: 4/5

This PR requires careful review due to architectural changes in critical streaming functionality
Score reflects significant refactoring of streaming implementation across multiple core files, though changes appear well-structured and consistent
Pay close attention to the streaming handler implementation and LLM call patterns across action files

Sequence Diagram

sequenceDiagram
  participant User
  participant RunnableRails
  participant StreamingHandler as "StreamingHandler"
  participant LLM
  participant llm_call as "llm_call"

  User->>RunnableRails: stream(input)
  RunnableRails->>RunnableRails: _prepare_streaming()
  RunnableRails->>StreamingHandler: new StreamingHandler()
  RunnableRails->>RunnableRails: streaming_handler_var.set()
  RunnableRails->>RunnableRails: generate_async()
  RunnableRails->>llm_call: llm_call(streaming_handler=handler)
  llm_call->>LLM: astream(prompt)
  
  loop For each chunk
    LLM-->>llm_call: yield chunk
    llm_call->>StreamingHandler: push_chunk(chunk.content)
    StreamingHandler->>StreamingHandler: _process(chunk)
    StreamingHandler->>StreamingHandler: queue.put(chunk)
  end
  
  llm_call->>StreamingHandler: push_chunk(END_OF_STREAM)
  StreamingHandler->>StreamingHandler: streaming_finished_event.set()
  llm_call-->>RunnableRails: return completion_text
  
  loop Stream to user
    RunnableRails->>StreamingHandler: __anext__()
    StreamingHandler->>StreamingHandler: queue.get()
    StreamingHandler-->>RunnableRails: chunk
    RunnableRails-->>User: yield chunk
  end

greptile-apps

_{7 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

tests/test_streaming_handler.py

nemoguardrails/actions/llm/utils.py

Pouyanpi force-pushed the refactor/drop-streaming-callback branch from 040ba3c to a11a88f Compare December 16, 2025 14:49

Pouyanpi force-pushed the refactor/drop-streaming-callback branch from a11a88f to bb7f0a3 Compare December 16, 2025 14:52

Pouyanpi changed the title ~~refactor(streaming): remove LangChain callback dependencies from Stre…~~ refactor(streaming): remove LangChain callback dependencies from StreamingHandler Dec 16, 2025

Pouyanpi marked this pull request as draft December 16, 2025 14:53

greptile-apps bot reviewed Dec 16, 2025

View reviewed changes

tests/test_streaming_handler.py Outdated Show resolved Hide resolved

tests/test_streaming_handler.py Outdated Show resolved Hide resolved

nemoguardrails/actions/llm/utils.py Outdated Show resolved Hide resolved

Pouyanpi added 3 commits December 16, 2025 16:23

remove alias test

721831e

fix

43e04ca

remove unused first_token instance variable

714066b

Pouyanpi mentioned this pull request Jan 6, 2026

refactor(streaming)!: drop streaming field from config #1538

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(streaming): remove LangChain callback dependencies from StreamingHandler #1547

refactor(streaming): remove LangChain callback dependencies from StreamingHandler #1547

Uh oh!

Pouyanpi commented Dec 16, 2025

Uh oh!

codecov bot commented Dec 16, 2025

Uh oh!

greptile-apps bot commented Dec 16, 2025

Confidence score: 4/5

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

refactor(streaming): remove LangChain callback dependencies from StreamingHandler #1547

Are you sure you want to change the base?

refactor(streaming): remove LangChain callback dependencies from StreamingHandler #1547

Uh oh!

Conversation

Pouyanpi commented Dec 16, 2025

PR Description

Uh oh!

codecov bot commented Dec 16, 2025

Codecov Report

Uh oh!

greptile-apps bot commented Dec 16, 2025

Greptile Overview

Greptile Summary

Important Files Changed

Confidence score: 4/5

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants