Bytez Model Provider Integration #1787

inf3rnus · 2025-10-06T21:03:25Z

Hey there all,

We're Bytez, and we're the largest inference provider on the internet! (We offer inference for 170k+ models.)

We're stuck on step 3 of the integration guide presumably because we need to be added as a provider. (We are unable to use the model mapping api because we don't exist as a provider on your backend.)

This PR allows for Bytez to be used as an inference provider (bytez-ai).

For the most part, changes are isolated to just the Bytez code with the exception of 5 places:

packages/inference/src/tasks/audio/audioClassification.ts
packages/inference/src/tasks/cv/imageClassification.ts
packages/inference/src/tasks/cv/objectDetection.ts
packages/inference/src/tasks/cv/imageToText.ts
packages/inference/src/providers/hf-inference.ts

The first three have this simple addition to allow for the flexibility to prepare a payload async, which better aligns with how the other tasks are setup.

	const payload = providerHelper.preparePayloadAsync
		? await providerHelper.preparePayloadAsync(args)
		: preparePayload(args);

I'd have dug deeper to bring greater consistency for all of the tasks, but opted for these simple adjustments to see what you guys have to say before making any major changes.

The final task change, packages/inference/src/tasks/cv/imageToText.ts follows the same pattern, but also changes the passing of the response to this:

return providerHelper.getResponse(res);

It used to be:

return providerHelper.getResponse(res[0]);

There isn't a hook higher up in the call stack (AFAIK) for us to adapt our response to look like an array, so I've opted for this.

To ensure that the existing hf-inference provider code still works, I've modified the HFInferenceImageToTextTask to destructure the array internally, which to me seems more consistent with the general pattern, where the raw response is untouched until it is passed to the getResponse() handler.

export class HFInferenceImageToTextTask extends HFInferenceTask implements ImageToTextTaskHelper {
	override async getResponse(response: ImageToTextOutput[]): Promise<ImageToTextOutput> {
		const [first] = response
		if (typeof first?.generated_text !== "string") {
			throw new InferenceClientProviderOutputError(
				"Received malformed response from HF-Inference image-to-text API: expected {generated_text: string}"
			);
		}
		return first;
	}
}

Please let us know if we need to do anything else! 🙏

Also, as a question, is it normal for us to not be able to hit https://huggingface.co/api/partners/bytez-ai/models until this PR is accepted and we are on a Team/Enterprise plan? FWIW we are on a Team/Enterprise plan.

If so, we'd like to help you update your docs to make this more explicit!

We look forward to integrating!

Long live huggingface! 🤗

inf3rnus added 6 commits September 9, 2025 20:37

Create the basis for integrating bytez as an inference provider.

d787975

Move Bytez tests to the bottom of the tests file.

9b44f7b

Prepare for PR.

1cd3aa1

Change bytez indentifier to 'bytez-ai'.

a0be6e4

Prepare for PR.

3bf26be

Make Bytez tests concurrent.

cd308ff

inf3rnus requested review from SBrandeis, gary149, Wauplin, julien-c, pcuenca, ngxson and hanouticelina as code owners October 6, 2025 21:03

inf3rnus changed the title ~~09 0 25 bytez integration~~ Bytez Model Provider Integration Oct 6, 2025

Merge main.

83af356

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bytez Model Provider Integration #1787

Bytez Model Provider Integration #1787

Uh oh!

inf3rnus commented Oct 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Bytez Model Provider Integration #1787

Are you sure you want to change the base?

Bytez Model Provider Integration #1787

Uh oh!

Conversation

inf3rnus commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

inf3rnus commented Oct 6, 2025 •

edited

Loading