Skip to content

second-state/silero_vad_server

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

An API server for AI VAD

This is the VAD server that is designed to wok with the EchoKit server.

Install libtorch dependencies

Regular Linux CPU

# download libtorch
curl -LO https://download.pytorch.org/libtorch/cu124/libtorch-cxx11-abi-shared-with-deps-2.4.0%2Bcu124.zip

unzip libtorch-cxx11-abi-shared-with-deps-2.4.0+cu124.zip

# Add to ~/.zprofile or ~/.bash_profile
export LD_LIBRARY_PATH=$(pwd)/libtorch/lib:$LD_LIBRARY_PATH
export LIBTORCH=$(pwd)/libtorch 

Build the API server

git clone https://github.com/second-state/silero_vad_server

cd silero_vad_server
cargo build --release

Run the API server

VAD_LISTEN=0.0.0.0:9094 nohup target/release/silero_vad_server &

In the EchoKit server configuration, you can now use the VAD server in the [asr] section to use it together with any ASR API.

[asr]
url = "http://localhost:9092/v1/audio/transcriptions"
lang = "auto"
prompt = "Hello\n你好\n(noise)\n(bgm)\n(silence)\n"
vad_url = "http://localhost:9094/v1/audio/vad"

About

VAD (Voice Activity Detection) as a service, written in Rust

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Contributors 2

  •  
  •  

Languages