Skip to main content

🚀 Day 5 of #OpenSourceWeek: 3FS, Thruster for All DeepSeek Data AccessFire-Flyer File System (3FS) - a parallel file system that utilizes the full bandwidth of modern SSDs and RDMA networks.⚡ 6.6 TiB/s aggregate read throughput in a 180-node cluster⚡ 3.66 TiB/min throughput on GraySort benchmark in a 25-node cluster⚡ 40+ GiB/s peak throughput per client node for KVCache lookup🧬 Disaggregated architecture with strong consistency semantics✅ Training data preprocessing, dataset loading, checkpoint saving/reloading, embedding vector search & KVCache lookups for inference in V3/R1📥 3FS → github.com/deepseek-ai/3FS⛲ Smallpond - data processing framework on 3FS → github.com/deepseek-ai/smallpond

  1. 🚀 Day 5 of #OpenSourceWeek: 3FS, Thruster for All DeepSeek Data Access

    Fire-Flyer File System (3FS) - a parallel file system that utilizes the full bandwidth of modern SSDs and RDMA networks.

    6.6 TiB/s aggregate read throughput in a 180-node cluster
    3.66 TiB/min throughput on GraySort benchmark in a 25-node cluster
    40+ GiB/s peak throughput per client node for KVCache lookup
    🧬 Disaggregated architecture with strong consistency semantics
    Training data preprocessing, dataset loading, checkpoint saving/reloading, embedding vector search & KVCache lookups for inference in V3/R1

    📥 3FS → github.com/deepseek-ai/3FS
    Smallpond - data processing framework on 3FS → github.com/deepseek-ai/smallpond