LLM | 分类 | Blog de Simon🫣

SimonSun

Internet Malou, LLM Rookie, Bug Maker🤧

最新发布

总结：PPO GRPO GSPO RLOO Loss 分析

openclaw 飞书配置踩坑记

扫盲 reward hacking 和熵坍缩

MoE 模型的路由重放 → R3

GRPO → GSPO → DAPO → SAPO

公告

🙌README🙌

🤯There is nothing left

in my right brain,

🤯and there is nothing right

in my left brain...

⭐I wish you every success⭐