fsdp
Here are 28 public repositories matching this topic...
Best practices & guides on how to write distributed pytorch training code
-
Updated
Oct 22, 2025 - Python
Repo for Qwen Image Finetune
-
Updated
Dec 11, 2025 - Jupyter Notebook
META LLAMA3 GENAI Real World UseCases End To End Implementation Guide
-
Updated
Sep 24, 2024 - Jupyter Notebook
Llama-style transformer in PyTorch with multi-node / multi-GPU training. Includes pretraining, fine-tuning, DPO, LoRA, and knowledge distillation. Scripts for dataset mixing and training from scratch.
-
Updated
Dec 24, 2025 - Python
🦾💻🌐 distributed training & serverless inference at scale on RunPod
-
Updated
May 26, 2024 - Jupyter Notebook
A comprehensive hands-on guide to building production-grade distributed applications with Ray - from distributed training and multimodal data processing to inference and reinforcement learning.
-
Updated
Oct 28, 2025 - Python
Fast and easy distributed model training examples.
-
Updated
Nov 26, 2024 - Python
A script for training the ConvNextV2 on CIFAR10 dataset using the FSDP technique for a distributed training scheme.
-
Updated
Dec 11, 2023 - Python
Simple and efficient implementation of 671B DeepSeek V3 that trainable with FSDP+EP and minimal requirement of 256x A100/H100, targeted for HuggingFace ecosystem
-
Updated
Nov 3, 2025 - Python
Minimal yet high performant code for pretraining llms. Attempts to implement some SOTA features. Implements training through: Deepspeed, Megatron-LM, and FSDP. WIP
-
Updated
Feb 6, 2024 - Python
Framework, Model & Kernel Optimizations for Distributed Deep Learning - Data Hack Summit
-
Updated
Aug 1, 2023 - Python
Implementations of some popular approaches for efficient deep learning training and inference
-
Updated
Mar 30, 2025 - Python
Dataloading for JAX
-
Updated
Oct 3, 2024 - Python
Scalable multimodal AI system combining FSDP, RLHF, and Inferentia optimization for customer insights generation.
-
Updated
May 3, 2025 - Python
🎨 Generate high-quality images with the Qwen-Image model, a powerful text-to-image tool optimized for fast and efficient deployment on serverless architecture.
-
Updated
Nov 15, 2025
A foundational repository for setting up distributed training jobs using Kubeflow and PyTorch FSDP.
-
Updated
Jan 7, 2025 - Python
Mini-FSDP for PyTorch. Minimal single-node Fully Sharded Data Parallel wrapper with param flattening, grad reduce-scatter, AMP, and tiny GPT/BERT training examples.
-
Updated
Sep 21, 2025 - Python
Improve this page
Add a description, image, and links to the fsdp topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the fsdp topic, visit your repo's landing page and select "manage topics."