Robert Youssef (@rryssf_)

2025-10-22 | ❤️ 2278 | 🔁 317

🚨 Holy shit…Meta just rewrote how Transformers think.

They built something called The Free Transformer and it breaks the core rule every GPT model has lived by since 2017.

For 8 years, Transformers have been blindfolded forced to guess the next token one at a time, no inner plan, no latent thought.

Meta gave it one.

They added random latent variables inside the decoder so the model can secretly decide how it wants to generate before it starts talking.

It’s like giving GPT a hidden mind.

Result:

🧠 Smarter reasoning ⚡️ 3% compute overhead 📈 Outperforms larger baselines on GSM8K, MMLU, and HumanEval

It’s the first Transformer that doesn’t just predict it intends.

Full paper: arxiv. org/abs/2510.17558v1

Auto-generated bookmark

📚 세현's Vault