GitHub - TarikToha/VLMDemo: This Android app captures a photo and sends it to the Gemini API for image understanding.

Image Understanding using Gemini on Android

This Android app captures an image and describes it using the Gemini API.

Setup

1. Requirements

Android Studio
Minimum SDK: 24
A Gemini API key from Google AI Studio

2. Add Dependencies

Add OkHttp in app/build.gradle:

implementation 'com.squareup.okhttp3:okhttp:4.12.0'

3. Insert Your API Key

Inside MainActivity.java:

private static final String API_KEY = "YOUR_API_KEY_HERE";

4. Internet Permission

Add this to AndroidManifest.xml:

<uses-permission android:name="android.permission.INTERNET" />

Usage

1. Run the App

Build and install the app on a device or emulator that has a camera.

2. Capture an Image

Tap Capture to open the camera. Take a photo, and a small thumbnail will appear in the ImageView.

3. Send to Gemini

Tap Gemini. The app will convert the image to Base64, build a JSON request, send the request to the Gemini API, and receive the description

4. View Results

The app will parse the description and output it in the TextView. Any errors (network issues, JSON parsing problems) will be shown in Logcat.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
app		app
gradle		gradle
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Understanding using Gemini on Android

Setup

1. Requirements

2. Add Dependencies

3. Insert Your API Key

4. Internet Permission

Usage

1. Run the App

2. Capture an Image

3. Send to Gemini

4. View Results

About

Uh oh!

Languages

License

TarikToha/VLMDemo

Folders and files

Latest commit

History

Repository files navigation

Image Understanding using Gemini on Android

Setup

1. Requirements

2. Add Dependencies

3. Insert Your API Key

4. Internet Permission

Usage

1. Run the App

2. Capture an Image

3. Send to Gemini

4. View Results

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages