The frontier moves weekly.
Forecasting AI capability used to be the hard part. It isn't anymore. The hard part is reading what shipped this week, deciding which businesses it changes, and getting that change into production before the next thing ships. This page tracks our thinking against the moving frontier.
Every business in Oman is operating against a 2026 frontier with a 2023 playbook.
Anthropic ships an upgrade every few weeks. Independent research changes monthly. The gap between “what Claude can already do” and “what most teams are using” has stopped narrowing and started widening.
That is the central thesis of Orfloat — and the reason we publish notes here regularly. Not announcements. Working drafts of how we think about the frontier from the inside of a Muscat operating business.
The capability overhang is the most consequential under-discussed force in the Gulf economy today. Reading about AI on Twitter is not the same as having a roadmap.
Forty-two agents on our own codebase, and the call they couldn't make.
We pointed Claude Opus 4.8 and Claude Code's new dynamic workflows at our own website — 42 agents, ~14 minutes, one production deploy. The workflow surfaced a genuinely high-value fix and then nearly fooled itself. The judgment that caught it is the whole point.
Read noteClaude Opus 4.8, and the discipline it asks for.
Anthropic shipped Opus 4.8 today — around four times less likely to let its own code flaws pass, paired with dynamic workflows that orchestrate hundreds of subagents from a single session. The capability stopped being the constraint a while ago. What is left is whether you can describe the work and discern the output — the disciplines, not the model.
Read noteSoftware after software, and the record so far.
Twelve theses on what software becomes when intelligence is abundant — and the empirical record from the last six months that says they are no longer speculative.
Read noteOur north star is Anthropic's own thinking.
We read anthropic.com/news and claude.com/blog end-to-end as they ship. Around that, we follow frontier research from DeepMind, OpenAI, Meta FAIR, and the academic community at NeurIPS, ICML, and ACL. And we keep watch on regional signals — Vision 2040 progress, GCC enterprise adoption data, PwC and McKinsey's MENA work.
When we publish a note, it's because we've read something that we think changes the decision a Muscat operator should be making this quarter. Not because we needed content for the site.