Clarify LanguageModelResponseSchema interface #102

michaelwasserman · 2025-04-17T23:45:04Z

I'm confused about the utility and use of the LanguageModelResponseSchema interface.
Can the explainer add some clarity about this interface?

Is it mainly intended to validate that objects passed in its ctor are valid JSON schema, not just JSON objects, nor other incompatible objects (i.e. before passing the object to prompt). How will callers verify their schema is valid? Does the LanguageModelResponseSchema ctor throw an exception, or should it have some validation method or validity property?
The explainer alludes to reusability; elaborating would be helpful. Maybe show an example of re-use/copying, especially if it highlights utility over just deep-copying the underlying object that would be passed into the ctor.
Maybe the interface is useful for other schema formats that might be added in the future? i.e. RegExp shortcut for structured outputs #91 for regex
Anything else I'm missing?

The text was updated successfully, but these errors were encountered:

domenic · 2025-04-21T03:33:31Z

The main intention here is to centralize the preprocessing of the schema up front. If I understand correctly, this includes both validation (which you mention), but also transforming the schema from a JS object into something more native to the language model's representation. @sushraja-msft can give more details.

In other words, my understanding is that this:

function getBooleanFromSession(session, question) {
  return session.prompt(question, { responseJSONSchema: { type: "boolean" } });
}

will be slightly slower than this:

const booleanSchema = new LanguageModelResponseSchema({ type: "boolean" });
function getBooleanFromSession(session, question) {
  return session.prompt(question, { responseJSONSchema: booleanSchema });
}

because the first version requires processing the schema object into something suitable for the language model, whereas the latter version does that once and then reuses the result each time.

With regards to other schema formats, #91 discusses how that would impact the API. I think we would not use LanguageModelResponseSchema for that, and indeed might want to rename LanguageModelResponseSchema to LanguageModelResponseJSONSchema to be clear.

clarkduvall · 2025-04-23T17:13:11Z

In my testing creating the schema object takes something like 1-5ms, so may not be worth having an API around to avoid this cost (on the impl side we can also do things like LRU cache for schemas that should catch this case anyway). I tend to think we may want to get rid of LanguageModelResponseSchema altogether, and just pass the schema as a raw JS object. We can always add LanguageModelResponseSchema later if needed right?

domenic · 2025-04-24T01:11:22Z

Yeah, I'm supportive of that. Since it sounds like that's what our colleagues at Microsoft have implemented so far anyway, I'll post a PR to get rid of it shortly, unless @sushraja-msft or others have objections.

Closes #102. As discussed there, it probably isn't necessary, the naming is potentially confusing, and it can be added back later if we want it.

domenic added a commit that referenced this issue Apr 24, 2025

Remove LanguageModelResponseSchema class

ca958c2

Closes #102. As discussed there, it probably isn't necessary, the naming is potentially confusing, and it can be added back later if we want it.

domenic mentioned this issue Apr 24, 2025

Remove LanguageModelResponseSchema class #107

Merged

domenic added a commit that referenced this issue Apr 24, 2025

Remove LanguageModelResponseSchema class

4831630

Closes #102. As discussed there, it probably isn't necessary, the naming is potentially confusing, and it can be added back later if we want it.

domenic closed this as completed in #107 Apr 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clarify LanguageModelResponseSchema interface #102

Clarify LanguageModelResponseSchema interface #102

michaelwasserman commented Apr 17, 2025

domenic commented Apr 21, 2025

Uh oh!

clarkduvall commented Apr 23, 2025

Uh oh!

domenic commented Apr 24, 2025

Uh oh!

Clarify LanguageModelResponseSchema interface #102

Clarify LanguageModelResponseSchema interface #102

Comments

michaelwasserman commented Apr 17, 2025

domenic commented Apr 21, 2025

Uh oh!

clarkduvall commented Apr 23, 2025

Uh oh!

domenic commented Apr 24, 2025

Uh oh!