Skip to content

awso/mn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio Transcription Service

A web application that allows users to upload audio files and get transcriptions with timestamps and speaker identification.

Features

  • Upload audio files
  • Automatic transcription using OpenAI's Whisper
  • Display transcriptions with timestamps
  • Modern and responsive UI

Setup

  1. Create a .env file in the project root with your Hugging Face token:
HUGGING_FACE_TOKEN=your_hugging_face_token
  1. Install Python dependencies:
pip install -r requirements.txt
  1. Run the application:
python app.py
  1. Open your browser and navigate to http://localhost:5000

Usage

  1. Choose your method:
    • Click "Choose File" to select an existing audio file
    • Click "Start Recording" to record audio from your microphone
  2. Click "Upload and Transcribe"
  3. Wait for the processing to complete
  4. View the transcript with timestamps and speaker identification
  5. Customize speaker names if needed
  6. Click "Update Speaker Names" to apply changes

Supported Audio Formats

  • M4A
  • WAV
  • MP3
  • Any format supported by pydub

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published