Streaming Converter Implementation for LLM JSONL Processing #89

igordayen · 2025-11-13T02:59:10Z

Streaming Converter Implementation for LLM JSON Processing

Overview

This PR introduces a reactive streaming converter for processing JSONL
(JSON Lines) responses from LLMs, enabling real-time streaming of
structured objects and thinking content.

Key Components

🔧 Core Implementation

StreamingJacksonOutputConverter

Extends JacksonOutputConverter to inherit schema injection capabilities
Converts JSONL input to reactive Flux streams
Supports mixed content with both objects and thinking blocks via
StreamingEvent
Provides resilient error handling with proper logging

StreamingEvent

Sealed class supporting Object and Thinking event types
Includes Either-like fold() method for functional composition
Located in embabel-common-core for cross-product reusability

🛡️ Error Handling & Resilience

Graceful degradation: Continues processing valid lines after individual
failures
Fail-fast on critical errors: Terminates stream on JSON parsing
exceptions
Comprehensive logging: DEBUG/WARN/ERROR levels with structured messages
Production-ready: Uses inherited logger from parent converter

Configuration Framework

StreamingConfigProperties

Spring Boot configuration properties with sensible defaults
Supports buffer, timeout, retry, and error handling configuration
Uses embabel.platform.streaming namespace for consistency
Ready for externalization when production tuning is needed

###Package Structure

embabel-common-core/
├── com/embabel/common/core/streaming/
│ ├── StreamingEvent.kt # Core event types
│ └── config/
│ └── StreamingConfigProperties.kt # Configuration properties

embabel-common-ai/
└── com/embabel/common/ai/converters/streaming/
└── StreamingJacksonOutputConverter.kt # JSONL converter
implementation

Usage Examples

  Basic Object Streaming

  val converter = StreamingJacksonOutputConverter(MyObject::class.java,
  objectMapper)
  val objectStream: Flux<MyObject> = converter.convertStream(jsonlResponse)

  Mixed Content with Thinking

  val eventStream: Flux<StreamingEvent<MyObject>> =
      converter.convertStreamWithThinking(mixedResponse)

  eventStream.subscribe { event ->
      event.fold(
          left = { thinking -> logger.info("LLM reasoning: {}", thinking)
  },
          right = { obj -> processObject(obj) }
      )
  }

Benefits

Real-time LLM Streaming - Process objects as they arrive instead of
waiting for complete response
Thinking Support - Capture LLM reasoning process alongside structured
outputs
Schema Validation - Inherits JSON schema injection for proper LLM
formatting
Production Ready - Robust error handling and logging for operational
environments
Extensible - Configuration framework ready for future tuning needs

Next Steps

This foundation enables the next phase of LLM streaming implementation:

SPI streaming interfaces
PromptRunner integration
ChatClient streaming support
End-to-end integration testing

poutsma · 2025-11-13T14:12:57Z

Here are my thoughts:

By extending JacksonOutputConverter, the StreamingJacksonOutputConverter looses the filtering capabilities that the FilteringJacksonOutputConverter introduces. Unless this is by design, my suggestion would be to extend from FilteringJacksonOutputConverter instead.

More in general, I can see how the StreamingJacksonOutputConverter could potentially contribute to streaming support in Embabel. However, without having the complete picture, it is hard to determine the context in which the StreamingJacksonOutputConverter will be used, and therefore hard to determine which parts of the code are complete, which are lacking, and which are unnecessary.

If we look at streaming from the perspective of the embabel-agent module, I would expect that any streaming solution Embabel offers will ultimately rely on the streaming capabilities of Spring AI's ChatClient, i.e. ChatClient.StreamResponseSpec. This means that, somewhere in ChatClientLlmOperations, we would have to do something like:

chatClient
    .prompt(springAiPrompt)
    .toolCallbacks(interaction.toolCallbacks)
    .options(chatOptions)
    .stream()

offering us a Flux of response chunks.
It is hard to find definite documentation on this, but AFACT Spring AI simply surfaces whatever the underlying model streams, and does not “tokenize” or re-chunk the text itself. This means that we would have to tokenize the response chunks into lines ourselves, by buffering them until a EOL character appears.
However, the StreamingJacksonOutputConverter assumes that the input text is a multi-line string, as the first thing it does it split the string into lines. That seems odd to me, and potentially unnecessary.

So in short, I don't think we can merge this PR until we have an idea of what the integration with PromptRunner and ChatClient will look like.

igordayen · 2025-11-13T15:47:07Z

Thanks for thorough review Arjen.

converter in common represents foundation that gets consumed (thru yet another PR) in embabel-agent.

here is schematically chain of invocations in agent repo:

OperationContextPromptRunner.stream() →
  StreamingPromptRunnerOperationsImpl.createObjectList() →
  ChatClientLlmOperations.streamingMethod() → SpringAI.ChatClient.stream()
  → StreamingJacksonOutputConverter.convertStream() → Flux<T>

This PR is WIP and can not be commited prior to merging artifacts in common as it would break Github build.

Here is usage example:

context.ai()
      .withAutoLlm()
      .stream()
      .createObjectList(ItemClass::class.java)
      .subscribe { item -> println("Received: $item") }

as for missing filtering capabilities - will address this gap accordingly.

Regarding formatting. prompt follows JSONL standard: RFC 7464

see test below:

`convertStream should delegate to parent convert method for each line`()

on multi-object processing.

igordayen · 2025-11-13T17:17:51Z

added streaming converter Filtering support per Arjen's recommendation.

Please refer to comprehensive test:

streaming converter should handle filtering with actual streaming for multiple objects`

poutsma

I have left several comments that aim to simplify this PR by reducing its public interface. When we have resolved those, I think it's ready to be merged.

...rc/main/kotlin/com/embabel/common/ai/converters/streaming/StreamingJacksonOutputConverter.kt

embabel-common-core/src/main/kotlin/com/embabel/common/core/streaming/StreamingUtils.kt

...rc/main/kotlin/com/embabel/common/ai/converters/streaming/StreamingJacksonOutputConverter.kt

poutsma · 2025-11-25T11:56:24Z

embabel-common-core/src/main/kotlin/com/embabel/common/core/streaming/StreamingCapability.kt

+ * val runner: PromptRunner = context.ai().autoLlm()
+ * if (runner.supportsStreaming()) {
+ *     val capability: StreamingCapability = runner.stream()
+ *     val operations = capability as StreamingPromptRunnerOperations (or use asStreaming extension function)
+ *     // Use streaming operations...


I am not sure what the point of StreamingCapability is.
Looking at this example, it appears to me that runner.stream() could return StreamingPromptRunnerOperations directly, to save us extra cast in line 36.

this is part which was not happy either.

PromptRunner::stream() ==> UOE, while in
StreamingPromptRunnerOperations ==> "normal streaming"

poutsma · 2025-11-25T12:09:20Z

embabel-common-core/src/main/kotlin/com/embabel/common/core/streaming/StreamingEvent.kt

As far as I can tell, the only benefit of the StreamingEvent abstraction is so that we can distinguish between objects and thinking blocks. Elsewhere in Embabel, in the SuppressThinkingConverter, we filter out the latter completely, so what's the reason we don't do that here for streaming? Is there a case where the user is interested in thinking blocks?
If there is not, then I would suggest dropping StreamingEvent as well as convertStreamWithThinking in StreamingJacksonOutputConverter.

yes, im aware of supressing thinking, although thinking marker(s) format appears vary, which gets addressed in streaming utils.
thought that in streaming use case, expecially with keeping in mind potential UI integration, would by nice to see intermmediate response from LLM. adds some flexibility. im not markin gas resolved if my thought still requires validation on your side.

poutsma · 2025-11-25T12:10:14Z

...rc/main/kotlin/com/embabel/common/ai/converters/streaming/StreamingJacksonOutputConverter.kt

+     * Supports both object lines and thinking blocks.
+     * Uses resilient error handling - logs warnings for individual line failures but continues processing.
+     */
+    fun convertStreamWithThinking(text: String): Flux<StreamingEvent<T>> {


I have my doubts about the usability of this method, see my comments on StreamingEvent below.

replied above. thanks

...rc/main/kotlin/com/embabel/common/ai/converters/streaming/StreamingJacksonOutputConverter.kt

simplified error handling convertion to object stream got reimplemented thru more generic method that creates StreamEvent removed unnesessary logging updated tests

igordayen added 2 commits November 12, 2025 19:45

Streaming Jackson LLM Output Converter

f68f7c6

Streaming support: enhance prompt

a55a725

igordayen requested review from alexheifetz and poutsma November 13, 2025 02:59

igordayen changed the title ~~Streaming Converter Implementation for LLM JSONL Processing~~ Streaming Converter Implementation for LLM JSON Processing Nov 13, 2025

igordayen changed the title ~~Streaming Converter Implementation for LLM JSON Processing~~ Streaming Converter Implementation for LLM JSONL Processing Nov 13, 2025

JSONL Streaming Converter enhannced with Filtering Support

4ca99a4

Streaming Capability marker interface

3637767

igordayen mentioned this pull request Nov 20, 2025

Draft: LLM Streaming embabel/embabel-agent#1062

Draft

Generalized / Centralized Thinking detection

c205699

poutsma requested changes Nov 25, 2025

View reviewed changes

poutsma reviewed Nov 25, 2025

View reviewed changes

...rc/main/kotlin/com/embabel/common/ai/converters/streaming/StreamingJacksonOutputConverter.kt Outdated Show resolved Hide resolved

Streaming Jackson Converter: addressing code review feedback

9d34e65

simplified error handling convertion to object stream got reimplemented thru more generic method that creates StreamEvent removed unnesessary logging updated tests

Streaming Converter Implementation for LLM JSONL Processing #89

Are you sure you want to change the base?

Streaming Converter Implementation for LLM JSONL Processing #89

Uh oh!

Conversation

igordayen commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Streaming Converter Implementation for LLM JSON Processing

Overview

Key Components

Configuration Framework

Usage Examples

Benefits

Next Steps

Uh oh!

poutsma commented Nov 13, 2025

Uh oh!

igordayen commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

igordayen commented Nov 13, 2025

Uh oh!

poutsma left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

poutsma Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

igordayen Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

poutsma Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

igordayen Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

poutsma Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

igordayen Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

igordayen commented Nov 13, 2025 •

edited

Loading

igordayen commented Nov 13, 2025 •

edited

Loading