Stars
Is synthetic data from generative models ready for image recognition?
Pytorch implementation of "Diversified in-domain synthesis with efficient fine-tuning for few-shot classification"
A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold N…
This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).
Magenta: Music and Art Generation with Machine Intelligence
Audio generation using diffusion models, in PyTorch.
Progressive Growing of GANs for Improved Quality, Stability, and Variation
21 Lessons, Get Started Building with Generative AI
Free MLOps course from DataTalks.Club
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
A JavaScript interface for annotating and labeling audio files.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
🏆 A ranked list of awesome Python open-source libraries and tools. Updated weekly.
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and mo…
Latex code for making neural networks diagrams
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
A baseline sample code of anomaly detection for MIMII Dataset
Audio classification Tflite package for flutter (iOS & Android). Can support Google Teachable Machine models
DOA, VAD and KWS for ReSpeaker Microphone Array
Benchmark for sound event localization task of DCASE 2019 challenge
Code and slides for the "Deep Learning (For Audio) With Python" course on TheSoundOfAI Youtube channel.
Baseline method for sound event localization task of DCASE 2021 challenge
Reference models and tools for Cloud TPUs.
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We significantly improve the systematic generalization of transf…
A toolbox to iNNvestigate neural networks' predictions!
Code for ''Explainable machine learning determines effects on the sound absorption coefficient measured in the impedance tube"
MoSQITo is a unified and modular development framework of key sound quality metrics favoring reproducible science and efficient shared scripting among engineers, teachers and researchers community.
Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, and sound event detection. Implemented using PyTorch.