Skip to content

Navigation Menu

Appearance settings

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

KoljaB / RealtimeSTT Public

Notifications You must be signed in to change notification settings
Fork 657
Star 8k

Code
Issues 104
Pull requests 10
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: KoljaB/RealtimeSTT

Releases · KoljaB/RealtimeSTT

v0.3.104

03 May 21:48

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.104 Latest

Latest

RealtimeSTT 0.3.104

Features & Improvements

New parameter: start_callback_in_new_thread
If set to True, all callback functions will be executed in a new thread.
This can be useful if the callback function is blocking and you want to avoid blocking the realtimestt application thread.

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

cxw620, homelab-00, NoiRC256, tychuang1211, Stefanperlarsson, Meshwa428, and tallboxdesign reacted with thumbs up emoji

tallboxdesign reacted with hooray emoji

All reactions

👍 7 reactions
🎉 1 reaction

7 people reacted

v0.3.103

19 Apr 20:36

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.103

RealtimeSTT 0.3.103

Features & Improvements

Thread‑safe IPC: Introduce SafePipe to replace mp.Pipe, hopefully ensuring robust inter-process communication (needs more tests).
Audio normalization: New normalize_audio option scales input to –0.95 dBFS for consistent transcription quality.
Callback overhaul: All event callbacks (VAD, wake‑word, turn detection, recording, realtime updates) now run asynchronously via helper threads.
Wake word & VAD: Add wakeword_backend config and faster_whisper_vad_filter flag; improved error messages when misconfigured.
Rich metadata: Embed nanosecond‑precision timestamps in both client and server, serialized as formatted strings.
CLI enhancements: --faster_whisper_vad_filter and --debug_websockets flags give finer control over server behavior.
Testing updates: Adjusted parameters in realtimestt_test and added a new type_into_textbox.py example.

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

Bastigonzales, brian316, t20n14, amrit-citrusleaf, nu-jliu, and Meshwa428 reacted with heart emoji

All reactions

❤️ 6 reactions

6 people reacted

v0.3.101

11 Apr 12:50

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.101

RealtimeSTT 0.3.101

✨ Features & Improvements

Enhanced Real-time Responsiveness: Real-time transcription processing now intelligently pauses immediately when VAD detects silence, reducing latency and unnecessary work before the final transcription.
Client Connection Robustness: Using a more accurate WebSocket-based server check.
Remote Wake Word Delay Config: Clients can now configure the wake_word_activation_delay on the server.
Updated OpenAI Example: Refreshed the openai_voice_interface.py example with the latest OpenAI API, EdgeEngine TTS, configuration flags, and graceful shutdown.

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

t20n14 reacted with thumbs up emoji

JoshuaWink, Inhishonor, Bastigonzales, and AMEERAZAM08 reacted with heart emoji

All reactions

👍 1 reaction
❤️ 4 reactions

5 people reacted

v0.3.100

23 Mar 11:03

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.100

RealtimeSTT 0.3.100

New VAD callbacks on_vad_start and on_vad_stop

triggering on VAD presence
reverted functionality of on_vad_detect_start, on_vad_detect_stop back to: triggered when the system starts/stops detecting for VAD presence

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

MoaydShagaf and Quackad reacted with heart emoji

All reactions

❤️ 2 reactions

2 people reacted

v0.3.99

21 Mar 19:10

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.99

RealtimeSTT 0.3.99

1. Enhanced Logging Configuration

Introduced a dedicated named logger realtimestt instead of using the root logger.
Added structured logging with handlers for both console (level set by user) and file (always DEBUG).
Logging no longer propagates to the root logger by default (logger.propagate = False).

2. Added possibility to disable Faster-Whisper VAD Filter

Added faster_whisper_vad_filter parameter (default: True) to enable voice activity detection (VAD) from the faster_whisper library.
Improves robustness against background noise at the cost of additional GPU resources.
Integrated into both real-time and main transcription workflows.

3. Audio Worker Improvements

Added improved, detailed debug logging for audio device initialization, sample rate handling, and resampling.

4. VAD Callback Adjustments

fixes #215
Moved on_vad_detect_start and on_vad_detect_stop callbacks to trigger directly during voice activity checks instead of state transitions.
Ensures callbacks align more accurately with actual speech/silence events.

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

Bastigonzales and Meshwa428 reacted with heart emoji

All reactions

❤️ 2 reactions

2 people reacted

v0.3.98

10 Mar 22:42

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.98

RealtimeSTT 0.3.98

minor fix for pypi wheel

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

v0.3.97

10 Mar 20:35

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.97

RealtimeSTT 0.3.97

fix for #210

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

v0.3.95

15 Feb 16:38

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.95

RealtimeSTT 0.3.95

better warmup (using audio file)
merged #200

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

dhruvsh-1729 reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

v0.3.94

23 Jan 20:26

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.94

RealtimeSTT 0.3.94

New Parameters for stop-method of AudioToTextRecorder:
- backdate_stop_seconds (float, default=0.0):
  - Description: Specifies the number of seconds to backdate the stop time when ending a recording.
  - Usage: When invoking stop() due to a wake word detection or a speaker diarization change event, this parameter compensates for any latency, ensuring that only relevant audio is included in the recording and transcription.
- backdate_resume_seconds (float, default=0.0):
  - Description: Specifies the number of seconds to backdate the resume time when restarting listening after a recording has stopped.
  - Usage: Typically set to the same value as backdate_stop_seconds, this parameter allows for fine-tuning.

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

t20n14, r1r2t3n4a5, EpipScintilla, fayizshaffaq, zengjixiang, and saineela reacted with thumbs up emoji

All reactions

👍 6 reactions

6 people reacted

v0.3.93

18 Dec 18:19

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.93

fix for stt-server (got broken by webservers dependency upgrade because of an api change)
added initial_prompt_realtime to AudioToTextRecorder to be able to give different prompts to final and realtime model
added new parameters to client/server (download root, batch sizes)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Previous 1 2 3 4 Next

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.