NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. NeMo uses PyTorch Lightning for easy and performant multi-GPU/multi-node mixed-precision training. Supported models: Jasper, QuartzNet, CitriNet, Conformer-CTC, Conformer-Transducer, Squeezeformer-CTC, Squeezeformer-Transducer, ContextNet, LSTM-Transducer (RNNT), LSTM-CTC. NGC collection of pre-trained speech processing models.

Features

  • Python version 3.6, 3.7 or 3.8
  • Pytorch version 1.8.1
  • You must have access to an NVIDIA GPU for training
  • Text Classification (Sentiment Analysis)
  • NeMo voice swap demo
  • Build a nemo container with Dockerfile from a branch

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow NVIDIA NeMo

NVIDIA NeMo Web Site

Other Useful Business Software
Comprehensive Cybersecurity to Safeguard Your Organization | SOCRadar Icon
Comprehensive Cybersecurity to Safeguard Your Organization | SOCRadar

See what hackers already know about your organization – and stop them from getting in.

Protect your organization from cyber threats with SOCRadar’s cutting-edge threat intelligence. Gain 360° visibility into your digital assets, monitor the dark web, and stay ahead of hackers with real-time insights. Start for free and transform your cybersecurity today.
Free Trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of NVIDIA NeMo!