TJ Solergibert
TJ Solergibert
Home
Blog
Experience
Large-Scale LLM Training
A Deep Dive into 3D Parallelism with Nanotron⚡️
In this post, we will present 3D parallelism, the technology behind large-scale LLM training. We will delve into the core details of pipeline, tensor, and data parallelism with code snippets from Nanotron⚡️, a 3D parallel trainer from Hugging Face🤗
Antoni-Joan Solergibert
Jun 10, 2024
13 min read
Cite
×