Training Frameworks

2026年5月26日 · 约 1 分钟阅读

深度学习训练框架和相关技术。

📚 现有文档

Stage 2 Training - 分布式训练 Stage 2 优化

🔧 主题概览

PyTorch Ecosystem

PyTorch Core - PyTorch 核心概念
PyTorch Lightning - 高级训练框架
TorchScript - PyTorch 模型优化
PyTorch XLA - TPU 支持

TensorFlow Ecosystem

TensorFlow Core - TensorFlow 基础
Keras - 高级 API
TensorFlow Extended (TFX) - 端到端 ML 平台

Emerging Frameworks

JAX - 可组合的数值计算
Flax - JAX 神经网络库
Haiku - DeepMind 的 JAX 库

Memory Optimization

Gradient Checkpointing - 梯度检查点
Mixed Precision Training - 混合精度训练
ZeRO Optimizer - 内存优化器

Compilation & Acceleration

TorchCompile - PyTorch 2.0 编译
XLA Compilation - 加速线性代数编译器
Custom CUDA Kernels - 自定义 CUDA 内核

修改历史7 次提交

docs: refresh wiki indexes and embeddings
xiaocheng·07-01·41f3817
chore(wiki): add description to 170 pages (clear missing-description lint)
xiaocheng·06-09·2be0221
fix(wiki): clean all lint errors to enable strict CI (PR-3)
xiaocheng·05-25·75375ef
feat(settings): add Astro settings configuration and update documentation
xiaocheng·01-17·d488aef
refactor: reorganize documentation structure and update Navbar component
xiaocheng·01-17·2fb8f42
chore(project): clean up obsolete configuration and build artifacts
xiaocheng·01-16·3574bd3
refactor AI post
xiaocheng·2025-08-15·a5a7637