Skip to content
View runngezhang's full-sized avatar

Block or report runngezhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Keras Temporal Convolutional Network. Supports Python and R.

Python 1,985 464 Updated Apr 8, 2025

MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation (TPAMI 2020)

Python 169 41 Updated Feb 26, 2023

收藏的一些经典的历史、政治、心理、哲学、数学、计算机方面电子书(约10万本)

JavaScript 6,524 866 Updated Sep 28, 2023
Python 25 8 Updated Sep 30, 2019

A fast implementation of bss_eval metrics for blind source separation

Python 142 9 Updated Sep 6, 2025

Single channel speech source separation by diffusion process (ICASSP 2023)

Python 120 14 Updated Mar 15, 2024

implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch

Python 203 36 Updated Oct 8, 2020

Noise Suppression Module Port From WebRTC

C 339 160 Updated Mar 3, 2021

Noise Suppression Module Port From WebRTC

C 2 Updated Jun 29, 2019

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

Python 245 31 Updated Sep 13, 2024

Official implementation of DualCycleGAN for nonparallel audio super resolution

Python 53 5 Updated Nov 1, 2022

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,730 2,506 Updated Mar 13, 2025

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,638 176 Updated Aug 27, 2025
HTML 129 23 Updated Oct 11, 2024

Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (CNN) are one of the best-performing neural network architect…

Python 30 4 Updated Dec 19, 2019

Based on sound processing and audio feature extraction

Python 1 2 Updated Apr 11, 2022

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022

Python 302 23 Updated Sep 16, 2023

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"

Python 1 1 Updated Jun 7, 2022

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,111 607 Updated Oct 27, 2023

This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)

Python 28 2 Updated Aug 8, 2022

Unofficial Pytorch Lightning Implementation of SRGAN

Python 1 1 Updated Apr 28, 2023
Jupyter Notebook 2 1 Updated Apr 20, 2023
Python 1 1 Updated Jul 17, 2023

Unofficial Pytorch Lightning Implementation of "Towards Robust Speech Super-Resolution"

Python 10 4 Updated May 8, 2023

Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"

Python 21 6 Updated May 9, 2023

This repository contains the official PyTorch implementation of the paper: "Learning Discrete Structured VAE using NES".

Python 4 4 Updated May 3, 2022

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Python 234 33 Updated May 1, 2025

This project transfert the self supervised Wav2vec2 representation to low ressources languages

Jupyter Notebook 3 1 Updated Jul 4, 2021

This is an attempt to list interesting audio bandwidth expansion/super-resolution research works.

6 2 Updated May 24, 2022
Next