DeepSpeed
Deep learning optimization library: makes distributed training easy
DeepSpeed is an easy-to-use deep learning optimization software suite that enables unprecedented scale and speed for Deep Learning Training and Inference. With DeepSpeed you can:
1. Train/Inference dense or sparse models with billions or trillions of parameters
2. Achieve excellent system throughput and efficiently scale to thousands of GPUs
3. Train/Inference on resource constrained GPU systems
4. Achieve unprecedented low latency and high throughput for inference
5. Achieve extreme compression for an unparalleled inference latency and model size reduction with low costs
DeepSpeed offers a confluence of system innovations, that has made large scale DL training effective, and efficient, greatly improved ease of use, and redefined the DL training landscape in terms of scale that is possible. These innovations such as ZeRO, 3D-Parallelism, DeepSpeed-MoE, ZeRO-Infinity, etc. fall under the training pillar.