Skip to content

Updating vision cookbook to render properly #1787

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Apr 23, 2025
Merged

Conversation

rzhao-openai
Copy link
Contributor

@rzhao-openai rzhao-openai commented Apr 23, 2025

Summary

Fix #1783. The cookbook utilizes GPT4.1-mini vision capabilities to analyze a video by splitting the video into images by frames. Then, these images are interpreted by the GPT4.1-mini model to create a script narrating the video. Finally, GPT4o-mini-tts is utilized to generate an audio recording of the script.

Motivation

Updating cookbook to utilize GPT4.1-mini vision capabilities over 4o, and using GPT4o-mini-tts over older tts models.


For new content

When contributing new content, read through our contribution guidelines, and mark the following action items as completed:

  • I have added a new entry in registry.yaml (and, optionally, in authors.yaml) so that my content renders on the cookbook website.
  • I have conducted a self-review of my content based on the contribution guidelines:
    • Relevance: This content is related to building with OpenAI technologies and is useful to others.
    • Uniqueness: I have searched for related examples in the OpenAI Cookbook, and verified that my content offers new insights or unique information compared to existing documentation.
    • Spelling and Grammar: I have checked for spelling or grammatical mistakes.
    • Clarity: I have done a final read-through and verified that my submission is well-organized and easy to understand.
    • Correctness: The information I include is correct and all of my code executes successfully.
    • Completeness: I have explained everything fully, including all necessary references and citations.

We will rate each of these areas on a scale from 1 to 4, and will only accept contributions that score 3 or higher on all areas. Refer to our contribution guidelines for more details.

@rzhao-openai rzhao-openai changed the title Fix https://github.com/openai/openai-cookbook/pull/1783 Fix #1783 Apr 23, 2025
@rzhao-openai rzhao-openai changed the title Fix #1783 Updating vision cookbook to render properly Apr 23, 2025
@rzhao-openai rzhao-openai self-assigned this Apr 23, 2025
@damon-openai
Copy link
Contributor

AD, add more text to the summary please!

@rzhao-openai rzhao-openai merged commit 8e89464 into main Apr 23, 2025
@rzhao-openai rzhao-openai deleted the rz/vision-update branch April 23, 2025 18:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants