Ongoing research training transformer models at scale
Evals is a framework for evaluating LLMs and LLM systems
Framework that is dedicated to making neural data processing
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Implementation of model parallel autoregressive transformers on GPUs