AI Tools Weekly: Glasswing Expands + 2 More Updates

The biggest story in AI tooling this week is not a new model or a flashy demo. It is Anthropic scaling Project Glasswing from roughly 50 organizations to approximately 200 across more than 15 countries, using Claude Mythos Preview to hunt vulnerabilities in critical infrastructure at machine speed. Meanwhile, OpenAI's Codex ecosystem pushed two quieter but meaningful updates: biometric security locks on the iOS app and a new Codex Sites capability that takes you from natural-language prompt to deployed website. Taken together, these updates mark a clear inflection point: AI coding tools are no longer productivity accessories. They are becoming core infrastructure in your security and deployment stack.

TL;DR: Glasswing is the headline because it signals that frontier-model security capabilities are being operationalized in power grids, hospitals, and telecom networks right now. Codex Sites is worth watching because prompt-to-production tooling raises real governance questions every engineering leader needs to get ahead of. The iOS lock screen update is minor polish but reflects a broader push toward making AI coding tools safe enough for sensitive work.

Anthropic: Project Glasswing Goes Global

Project Glasswing started in April as a tightly controlled cohort of about 50 organizations, including the U.S. government and Apple, using Claude Mythos Preview to scan large codebases for vulnerabilities. This week, Anthropic invited approximately 150 new organizations into the program, bringing the total to around 200 partners. The expansion is significant for two reasons. First, the new members skew heavily toward critical infrastructure: power operators, water utilities, healthcare systems, telecommunications providers, and hardware manufacturers. These sectors were barely represented in the original cohort. These are also the organizations most likely to be running legacy codebases in memory-unsafe languages on systems that cannot afford downtime or a breach. Getting Mythos-class AI into those codebases is not a nice-to-have. It is a structural security upgrade. Second, Anthropic is being unusually candid about the competitive timeline. The company warns that within 6 to 12 months, many other AI vendors are expected to reach Mythos-class capability, and that some may release those models without comparable cyber-safeguards. That is an explicit acknowledgment that the defensive window is short. Organizations that get access to vetted, safety-guided frontier models now are building institutional practices before the landscape gets crowded and chaotic.

What Glasswing Partners Are Actually Doing

The use cases are more operationally mature than most pilot programs at this stage. Current partners are using Claude Mythos Preview for:

•
Large-scale vulnerability scanning across codebases
•
Writing patches for discovered vulnerabilities
•
Pre-release security checks
•
Penetration testing and threat detection
•
Translating legacy code into memory-safe languages

That last one deserves attention. Automated translation of C or C++ into Rust or similar is one of the hardest and most labor-intensive security investments an engineering org can make. Mythos-class models doing this at scale changes the economics entirely. Rubrik has reported that Anthropic's frontier model found thousands of critical software vulnerabilities across the landscape. More alarming is the framing around exploit windows: AI has collapsed the time between vulnerability discovery and active exploitation from months to mere seconds. That is not a theoretical future threat. That is the current operating environment.

What Engineering Leaders Should Know

Anthropic has stated it is working to release Mythos-class models to all customers in the coming weeks, contingent on safeguards being in place. That means most teams will have access to these capabilities soon, whether they are ready or not. The strategic implication: the organizations that will benefit most are the ones that have already built the internal workflows, access controls, and review processes to use these models responsibly. The organizations that wait will scramble.

OpenAI Codex: Two Updates, One More Important Than It Looks

Codex Sites: Prompt to Production

Codex Sites is OpenAI's push toward natural-language-to-deployed-website generation. The pitch is straightforward: describe what you want, and the system generates and deploys a full site through an opinionated pipeline without requiring you to manually wire together frameworks, hosting, and CI/CD. This is directionally important, but it also surfaces the governance question that engineering leaders need to answer before their teams start using it: what can be auto-generated and shipped to production without a human review gate? Prompt-to-production tooling compresses the feedback loop in ways that are genuinely valuable for low-stakes services, internal tools, marketing pages, and documentation sites. It becomes a liability when it short-circuits review processes for anything touching sensitive data, authentication flows, or regulated systems. The right posture: pilot Codex Sites aggressively in low-risk surface areas. Define the tier of services where prompt-to-deploy is acceptable, and instrument those pipelines with observability and automated security checks before you expand the perimeter.

iOS App Update: Security Polish That Signals Intent

The ChatGPT/Codex iOS app update (v1.2026.146) adds optional Face ID or passcode lock, a new settings screen for configuring default follow-up behaviors, and a toggle for line wrapping in code diffs. On its own, this reads as routine polish. In context, it reflects something more deliberate. OpenAI is hardening the mobile surface for developers who are working with sensitive codebases on the go. Face ID protection for a coding assistant may seem trivial until you consider that developers are using these tools to review proprietary code, generate patches, and interact with production systems. This is OpenAI acknowledging that the threat model for coding assistants has changed. These tools are no longer playgrounds. They are part of the development workflow for systems that matter.

Comparison: Where These Updates Land

Update	Impact Level	Audience	Immediate Action Needed
Glasswing Expansion (200 orgs, 15+ countries)	High	Security teams, CTOs, infrastructure leads	Apply for access; audit legacy codebases
Codex Sites (prompt-to-deploy)	Medium	Frontend leads, platform teams	Pilot in low-risk services; define guardrails
Codex iOS Face ID lock	Low	Individual developers	Enable it; update mobile access policies

The Bigger Pattern: Security and Deployment Are Converging

What most coverage will miss this week is the through-line connecting all three updates. Frontier-model capabilities are simultaneously moving into offensive-grade security workflows (Glasswing) and into end-to-end deployment pipelines (Codex Sites). These two vectors are converging. In 12 months, the standard engineering workflow will likely involve AI that can scan your codebase for vulnerabilities, generate patches, and deploy updated code, all within a single pipeline. That is either a massive productivity unlock or a massive attack surface, depending entirely on how your team has designed the guardrails. Anthropic's 6-to-12 month warning about competitors releasing Mythos-class models without equivalent safeguards is not alarmism. It is a strategic timeline. The teams that build secure-SDLC practices around AI-assisted development now will have institutional muscle memory before the landscape becomes crowded with less safety-conscious alternatives.

What to Do This Week

Apply for Glasswing access or equivalent. If you run or support critical infrastructure, healthcare systems, or telecommunications, this is the highest-value security program available right now. Start here.

Audit your legacy codebases for Mythos-class scanning readiness. Before you have access to these tools, identify which codebases are the highest-priority candidates for vulnerability scanning and memory-safe translation. Have that list ready.

Define your prompt-to-production policy before Codex Sites becomes default behavior. Categorize your services by risk tier. Decide now which tiers can accept AI-generated-and-deployed code without a manual review gate. Document it. Share it with your team.

Update your mobile AI tool policies. If your team uses Codex or ChatGPT mobile for anything touching proprietary code, require biometric lock. Update your acceptable-use policy to reflect that these tools now carry the same security weight as a corporate device with VPN access.

Track the Mythos-class release timeline. Anthropic has signaled a general release is weeks away. Put it on your radar and plan a structured internal evaluation, not an ad-hoc rollout.

The Bottom Line

The era of AI coding tools as optional productivity boosts is ending. Glasswing's expansion into critical infrastructure signals that frontier models are being embedded into security-sensitive production environments at scale. Codex Sites signals that the gap between prompt and production deployment is closing fast. Both of these are net positives for well-prepared engineering organizations and genuine risks for ones that have not updated their practices.

The engineers who will be most valuable to your team in the next 12 months are the ones who can work fluently with these systems while maintaining rigorous judgment about what gets reviewed, what gets shipped, and what stays gated. That combination of AI fluency and engineering discipline is exactly what elite teams are hiring for right now. Finding those engineers is harder than it has ever been, which is precisely why how you hire matters as much as what you ship.

Nextdev