Audio Transcription Service

A web application that allows users to upload audio files and get transcriptions with timestamps and speaker identification.

Features

Upload audio files
Automatic transcription using OpenAI's Whisper
Display transcriptions with timestamps
Modern and responsive UI

Setup

Create a .env file in the project root with your Hugging Face token:

HUGGING_FACE_TOKEN=your_hugging_face_token

Install Python dependencies:

pip install -r requirements.txt

Run the application:

python app.py

Open your browser and navigate to http://localhost:5000

Usage

Choose your method:
- Click "Choose File" to select an existing audio file
- Click "Start Recording" to record audio from your microphone
Click "Upload and Transcribe"
Wait for the processing to complete
View the transcript with timestamps and speaker identification
Customize speaker names if needed
Click "Update Speaker Names" to apply changes

Supported Audio Formats

M4A
WAV
MP3
Any format supported by pydub

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
static		static
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Audio Transcription Service

Features

Setup

Usage

Supported Audio Formats

About

Uh oh!

Releases

Packages

Uh oh!

Languages

awso/mn

Folders and files

Latest commit

History

Repository files navigation

Audio Transcription Service

Features

Setup

Usage

Supported Audio Formats

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages