The Middle Layer Leverage: Youโve Been Wasting 90% of Your AIโs Potential
A single transformer layer can match full-parameter RL post-training. The secret is the middle layer leverage: almost all performance gains cluster in mid-depth layers, while early and late layers handle syntax and decoding. This means you can freeze 90% of your model and still get top-tier results โ saving massive compute and cost.