audio-diffusion-pytorch download

A fully featured audio diffusion library, for PyTorch. Includes models for unconditional audio generation, text-conditional audio generation, diffusion autoencoding, upsampling, and vocoding. The provided models are waveform-based, however, the U-Net (built using a-unet), DiffusionModel, diffusion method, and diffusion samplers are both generic to any dimension and highly customizable to work on other formats. Note: no pre-trained models are provided here, this library is meant for research purposes.

Features

Unconditional Generator
Text-Conditional Generator
Diffusion Upsampler
Diffusion Vocoder
Diffusion Autoencoder
Inpainting

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow audio-diffusion-pytorch

audio-diffusion-pytorch Web Site

Other Useful Business Software

MongoDB Atlas | Run databases anywhere

Ensure the availability of your data with coverage across AWS, Azure, and GCP on MongoDB Atlas—the multi-cloud database for every enterprise.

MongoDB Atlas allows you to build and run modern applications across 125+ cloud regions, spanning AWS, Azure, and Google Cloud. Its multi-cloud clusters enable seamless data distribution and automated failover between cloud providers, ensuring high availability and flexibility without added complexity.

Learn More

Rate This Project

User Reviews

Be the first to post a review of audio-diffusion-pytorch!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Music Generators, Python Generative AI, Python Inpainting Tool

Registered

2023-03-28

Similar Business Software

Stable Audio

Start generating music for free. Create custom-length music just by describing it. Powered by the latest audio diffusion models. Generate and download audio in 44.1 kHz stereo. Use the music you create with Stable Audio in your commercial projects. Our mission is to empower creators with...

See Software
Amazon EC2 Trn1 Instances

Amazon Elastic Compute Cloud (EC2) Trn1 instances, powered by AWS Trainium chips, are purpose-built for high-performance deep learning training of generative AI models, including large language models and latent diffusion models. Trn1 instances offer up to 50% cost-to-train savings over other...

See Software
Gemini Diffusion

Gemini Diffusion is our state-of-the-art research model exploring what diffusion means for language and text generation. Large-language models are the foundation of generative AI today. We’re using a technique called diffusion to explore a new kind of language model that gives users greater...

See Software

Report inappropriate content

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch

Get an email when there's a new version of audio-diffusion-pytorch