果冻甜甜的
首页
分类
标签
归档
关于
搜索
总访问量
0
总文章数
16
0%
嗯..! 目前共计 16 篇日志。 继续努力。
2025
12-28
Reducing Energy Bloat in Large Model Training
12-28
Rail-only: A Low-Cost High-Performance Network for Training LLMs with Trillion Parameters
11-23
Reducing Activation Recomputation in Large Transformer Models
11-23
Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM
11-22
InstructCoder: Instruction Tuning Large Language Models for Code Editing
11-22
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
09-07
token 简介
09-07
pytorch中的stream和event
08-17
ubuntu常见shell命令
08-17
lumos:Efficient Performance Modeling and Estimation for Large-scale LLM Training
1
2