Skip to content

A Node.js application that transcribes audio recordings into text and automatically saves them to Google Docs. Perfect for maintaining voice notes, meeting minutes, or any spoken content in a written format.

License

Notifications You must be signed in to change notification settings

s4ff0x/speech-to-docs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speech-to-docs

A Node.js application that transcribes audio recordings into text and automatically saves them to Google Docs. Perfect for maintaining voice notes, meeting minutes, or any spoken content in a written format.

img_1.png

Environment Variables

Create a .env file with the following variables:

OPENAI_SPEECH_API_KEY=
DOC_ID=
TIMEZONE=Asia/Jerusalem
PERSONAL_AUTH_TOKEN=

# Google Service Account Credentials
TYPE=
PROJECT_ID=
PRIVATE_KEY_ID=
PRIVATE_KEY=
CLIENT_EMAIL=
CLIENT_ID=
AUTH_URI=
TOKEN_URI=
AUTH_PROVIDER_X509_CERT_URL=
CLIENT_X509_CERT_URL=

Features

  • Audio file transcription using OpenAI's Whisper model
  • Automatic saving of transcriptions to Google Docs
  • Timestamp recording for each transcription
  • Support for M4A audio format
  • RESTful API interface

Technical Stack

  • Node.js with Express.js
  • OpenAI API (Whisper model for transcription)
  • Google Docs API
  • Multer for file upload handling

Prerequisites

  • Node.js installed
  • OpenAI API key
  • Google Cloud project with enabled Google Docs API
  • Google Service Account credentials

About

A Node.js application that transcribes audio recordings into text and automatically saves them to Google Docs. Perfect for maintaining voice notes, meeting minutes, or any spoken content in a written format.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published