• Home
  • Journey
  • Projects
  • Writing
  • Reading
  • Setup
  • Contact

Tagged: attention

2 articles

ilo: A Programming Language for AI Agents, Not HumansMay 22, 2026 Three Bets on Long-Context Attention Gemma 4, Qwen 3.6, and DeepSeek V3 each take a different path to long context. A reverse-engineering test shows where the trade hurts.
llmaiattentionmodel-comparison
Read article
May 4, 2026 DeepSeek V4: Don't Look at What You Don't Need DeepSeek V4 reads a million tokens on roughly a quarter of V3.2's compute. It does this by selectively attending to the parts of context the prompt asks about, the same way humans skim a long book.
llmaiattentiondeepseek
Read article
All articles
© 2026 Daniel John Morris
Follow me on X · Linkedin · Github