Best Open Source Windows Speech Recognition Software 2025

VideoSrt

Windows-GUI

This is an open source Windows-GUI software tool that can recognize video speech and automatically generate subtitle SRT files. VideoSrtIt is written in Golanglanguage and developed based on lxn/walk Windows-GUI toolkit. Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.

Downloads: 18 This Week

Last Update: 2023-01-13

See Project

DeepLearning

Deep Learning (Flower Book) mathematical derivation

" Deep Learning " is the only comprehensive book in the field of deep learning. The full name is also called the Deep Learning AI Bible (Deep Learning) . It is edited by three world-renowned experts, Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Includes linear algebra, probability theory, information theory, numerical optimization, and related content in machine learning. At the same time, it also introduces deep learning techniques used by practitioners in the industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling and practical methods, and investigates topics such as natural language processing, Applications in speech recognition, computer vision, online recommender systems, bioinformatics, and video games. Finally, the Deep Learning book provides research directions covering theoretical topics including linear factor models, autoencoders, representation learning, structured probabilistic models, etc.

Downloads: 1 This Week

Last Update: 2022-08-02

See Project

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and compare the results. Tensor2Tensor, or T2T for short, is a library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research. T2T was developed by researchers and engineers in the Google Brain team and a community of users. It is now deprecated, we keep it running and welcome bug-fixes, but encourage users to use the successor library Trax.

Downloads: 1 This Week

Last Update: 2021-05-24

See Project

Polaris programing with voice in Eclipse

Polaris, programing with voice in Eclipse IDE

With Polaris you have the possibility of incorporating speech into programing. Through use of this plugin in Eclipse IDE you can see that not only is it possible to provide an environment for a programing with voice, but that programing with voice it is part of the natural evolution of programming tools. VOICE COMMANDS eclipse task eclipse search eclipse skip eclipse format eclipse new eclipse save eclipse rename eclipse cut eclipse copy eclipse paste eclipse all eclipse delete eclipse close eclipse get eclipse hash eclipse string Efforts are made on daily basic to increase the range of functionality that can be controlled with voice. PREREQUISITE Windows OS and Eclipse IDE. Headphones with microphone, not mandatory, but it will improve speech recognition. Port Number that is setted in Polaris Preference page must not be used by any other application.

Downloads: 7 This Week

Last Update: 2019-05-12

See Project

VoxForge

VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.

Downloads: 3 This Week

Last Update: 2013-04-24

See Project

Awesome Recurrent Neural Networks

A curated list of resources dedicated to RNN

A curated list of resources dedicated to recurrent neural networks (closely related to deep learning). Provides a wide range of works and resources such as a Recurrent Neural Network Tutorial, a Sequence-to-Sequence Model Tutorial, Tutorials by nlintz, Notebook examples by aymericdamien, Scikit Flow (skflow) - Simplified Scikit-learn like Interface for TensorFlow, Keras (Tensorflow / Theano)-based modular deep learning library similar to Torch, char-rnn-tensorflow by sherjilozair, char-rnn in tensorflow, and much more. Codes, theory, applications, and datasets about natural language processing, robotics, computer vision, and much more.

Downloads: 0 This Week

Last Update: 2021-09-22

See Project

High-order HMM in Matlab

Implementation of duration high-order hidden Markov model in Matlab.

Implementation of duration high-order hidden Markov model (DHO-HMM) in Matlab with application in speech recognition.

2 Reviews

Downloads: 0 This Week

Last Update: 2015-02-15

See Project

InproTK

An Incremental Spoken Dialogue Processing Toolkit

InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/

Downloads: 0 This Week

Last Update: 2015-06-16

See Project

Interactive4J

Project aim to provide simple easy APIs for Java developers to use interactive abilities in their Java Applications like speech recognition, handwriting recognition, use of web cam , sound record/play, decision trees , text to speech and many others.

Downloads: 0 This Week

Last Update: 2014-07-15

See Project

Little Linguist

A learning package for children, helping them to learn a foreign language. Techniques such as speech recognition will be used.

Downloads: 0 This Week

Last Update: 2015-11-10

See Project

Scalable Language API

Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.

Downloads: 0 This Week

Last Update: 2018-01-22

See Project

TalkMaths

TalkMaths is a speech user interface that extends the speech recognition program Dragon NaturallySpeaking by parsing spoken mathematical expressions into MathML and/or LaTeX. TalkMaths allows the user to create documents in MathML or LaTeX hands-free.

Downloads: 0 This Week

Last Update: 2015-07-02

See Project

Open Source Windows Speech Recognition Software

Speech Recognition Software for Windows

VideoSrt

DeepLearning

Tensor2Tensor

Polaris programing with voice in Eclipse

VoxForge

Awesome Recurrent Neural Networks

High-order HMM in Matlab

InproTK

Interactive4J

Little Linguist

Scalable Language API

TalkMaths

Related Searches