DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Example models using DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
이 저장소에 대한 설명이 제공되지 않았습니다.
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.