Gemini 2.0 Flash Multimodal Live API Client

A lightweight vanilla JavaScript implementation of the Gemini 2.0 Flash Multimodal Live API client. This project provides real-time interaction with Gemini's API through text, audio, video, and screen sharing capabilities.

This is a simplified version of Google's original React implementation, created in response to this issue.

Live Demo on GitHub Pages

Live Demo

Key Features

Real-time text chat with Gemini API
Audio input/output with visualization
Motion-detected video streaming
Screen sharing capabilities
Function calling support
Built with vanilla JavaScript (no dependencies)

Prerequisites

Modern web browser with WebRTC, WebSocket, and Web Audio API support
Google AI Studio API key
Python 3.0+ OR npx http-server (for local development server)

Quick Start

Clone the repository

Set up your API key:

cp js/config/config.example.js js/config/config.js
# Edit js/config/config.js with your API key

Start the development server:

python -m http.server 8000

or

npx http-server 8000

Access the application at http://localhost:8000

Project Structure

├── js/
│ ├── audio/ # Audio processing and management
│ ├── config/ # Configuration files
│ ├── core/ # Core functionality (WebSocket, worklets)
│ ├── tools/ # Function calling implementations
│ ├── utils/ # Utility functions
│ ├── video/ # Video and screen sharing
│ └── main.js # Application entry point
├── css/ # Styling
└── index.html # Main HTML file

Usage Guide

Click "Connect" to establish API connection
Use the interface to:
- Send text messages
- Toggle microphone for audio input
- Enable webcam for video streaming
- Share your screen
Monitor the logs panel for real-time feedback

Development

Adding Custom Tools

Custom tools can be added to extend functionality. See js/tools/README.md for implementation details.

Contributing

Contributions are welcome! Please feel free to submit issues and pull requests.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
css		css
js		js
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gemini 2.0 Flash Multimodal Live API Client

Live Demo on GitHub Pages

Key Features

Prerequisites

Quick Start

Project Structure

Usage Guide

Development

Adding Custom Tools

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

dawoya1/gemini-2-live-api-demo

Folders and files

Latest commit

History

Repository files navigation

Gemini 2.0 Flash Multimodal Live API Client

Live Demo on GitHub Pages

Key Features

Prerequisites

Quick Start

Project Structure

Usage Guide

Development

Adding Custom Tools

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages