AI Infra 工程师:https://www.infoq.cn/article/edwy1v3xy14pgkefdv1u
黄大年茶思屋:三大缩放定律: https://www.chaspark.com/#/hotspots/1174432473590185984
「预训练 Scaling 法则(Pretraining Scaling Law)」、「后训练 Scaling 法则(Post-Training Scaling Law)和推理阶段 Scaling 法则(Test-Time Scaling Law,又称 Long Thinking)」