MUG-V
/

MUG-V-training

video-generation

megatron-checkpoints

Model card Files Files and versions

zyfan commited on Oct 20, 2025

Commit

bba1432

·

verified ·

1 Parent(s): 1bd5e47

Update README.md

Files changed (1) hide show

README.md +0 -2

README.md CHANGED Viewed

@@ -20,7 +20,6 @@ Pre-trained Megatron-format checkpoints for [MUG-V 10B](https://github.com/Shope
 **Torch Distributed Checkpoint** - Flexible parallelism support
-- **Size**: ~64 GB
 - **Format**: Torch Distributed (`.distcp`)
 - **Parallelism**: Can be loaded with **any TP/PP configuration**
 - **Use Case**: Production training, flexible distributed setup
@@ -33,7 +32,6 @@ huggingface-cli download MUG-V/MUG-V-training --local-dir ./checkpoints --includ
 **Torch Format (Legacy)** - Fixed TP=4
-- **Size**: ~64 GB
 - **Format**: Torch format (`mp_rank_XX/model_optim_rng.pt`)
 - **Parallelism**: Must be loaded with **TP=4**
 - **Use Case**: Fixed TP setup or conversion to Torch Distributed

 **Torch Distributed Checkpoint** - Flexible parallelism support
 - **Format**: Torch Distributed (`.distcp`)
 - **Parallelism**: Can be loaded with **any TP/PP configuration**
 - **Use Case**: Production training, flexible distributed setup
 **Torch Format (Legacy)** - Fixed TP=4
 - **Format**: Torch format (`mp_rank_XX/model_optim_rng.pt`)
 - **Parallelism**: Must be loaded with **TP=4**
 - **Use Case**: Fixed TP setup or conversion to Torch Distributed