Hi everyone 👋
Welcome back to AI Agent Weekly. This week, Anthropic launched its newest top model, OpenAI opened cyber tools to more defenders, and Mozilla announced a new open AI client you can run yourself. Let's get into it.
ANTHROPIC LAUNCHES CLAUDE OPUS 4.7: BETTER CODING, SHARPER VISION, SMARTER WORK

What's Happening: Anthropic released Claude Opus 4.7, its newest high-end model. It is now available to everyone. The model is better at hard coding tasks, sees images in higher detail, and follows instructions more carefully.
Report Includes:
Better at complex code: Opus 4.7 solves more tough software tasks than Opus 4.6, with gains on benchmarks like SWE-bench and Terminal-Bench.
Sharper vision: It can read images up to 2,576 pixels wide, helping with diagrams, screenshots, and technical drawings.
Smarter memory: It remembers context across long sessions, so you spend less time repeating yourself.
Safer by design: Cyber features are limited on purpose. Security pros can apply for special access through the Cyber Verification Program.
Same price: $5 per million input tokens, $25 per million output tokens.
Why It Matters: Opus 4.7 is a direct upgrade for developers who need a model that can handle long, complex tasks with less supervision. The better vision and memory mean it can work more like a real teammate. The careful cyber safeguards show Anthropic is testing safety steps before releasing even more powerful models.
OPENAI EXPANDS TRUSTED ACCESS FOR CYBER DEFENSE

What's Happening: OpenAI is opening its Trusted Access for Cyber program to more verified defenders. Approved users can now use GPT-5.4-Cyber, a version of GPT-5.4 with fewer restrictions for legitimate security work.
Report Includes:
GPT-5.4-Cyber: A special model fine-tuned for defensive security tasks like reverse engineering and vulnerability research.
Three guiding ideas: Democratized access (clear rules for who gets in), iterative deployment (learn and improve over time), and ecosystem resilience (support the defender community).
Easy sign-up: Individuals verify at chatgpt.com/cyber. Teams contact their OpenAI rep.
Built on real results: Codex Security has already helped fix over 3,000 high-severity bugs.
Why It Matters: Cyber threats are growing fast. This program lets skilled defenders use powerful AI tools while keeping safeguards in place. It shows OpenAI is balancing broad access with responsible controls - a model other labs may follow.
QWEN RELEASES QWEN3.6-35B-A3B:AN OPEN-SOURCE FOR ALL YOUR NEEDS

What's Happening: Alibaba's Qwen team open-sourced Qwen3.6-35B-A3B, a lightweight MoE model with 35 billion total parameters but only 3 billion active at a time. It is built for agentic coding and repository-scale work, designed for teams that need control, efficiency, and results.
Report Includes:
Open weights for full control: Free to download, use, and modify. Teams can host on their own servers or in private clouds to meet data governance and compliance needs.
Strong coding for real workflows: Matches larger models on agentic coding benchmarks and repo-level tasks, helping teams automate code reviews, refactoring, and legacy modernization.
Multimodal support for technical docs: Includes a vision encoder to process diagrams, architecture charts, and UI mockups, enabling richer context for development pipelines.
Easy integration: Compatible with common MLOps tools and identity systems, reducing time to production.
Why It Matters: Teams need AI that fits their security, cost, and workflow requirements. Qwen3.6-35B-A3B gives groups a high-performance, open model they can customize, audit, and deploy on their own infrastructure. This reduces vendor risk, lowers long-term costs, and accelerates adoption of agentic AI in regulated or sensitive environments. For engineering leaders evaluating build versus buy, this model offers a credible path to own the stack while staying competitive.
OPENAI UPDATES CODEX: NOW WORKS ACROSS YOUR COMPUTER AND TOOLS

What's Happening: OpenAI released a major Codex update. It can now use apps on your computer, work in a built-in browser, generate images, remember your preferences, and handle repeat tasks over time.
Report Includes:
Computer use: Codex can see your screen, click, and type - helpful for testing apps or working in tools without APIs.
In-app browser: Comment on web pages to give Codex precise instructions for frontend work.
Image generation: Uses gpt-image-1.5 to create and edit visuals inside coding workflows.
90+ new plugins: Connect to Jira, GitLab, Microsoft Suite, and more for richer context.
Memory and automations: Codex can remember your preferences and schedule future work.
Why It Matters: Codex is moving beyond writing code to becoming a full development partner. By working across your whole workflow - code, browser, apps, and docs - it reduces context switching and helps developers ship faster.
GOOGLE LAUNCHES GEMINI 3.1 FLASH TTS: EXPRESSIVE AI SPEECH WITH FINE CONTROL

What's Happening: Google released Gemini 3.1 Flash TTS, a new text-to-speech model in public preview. It supports 70+ languages and lets you control tone, style, and pacing using simple audio tags.
Report Includes:
200+ audio tags: Add tags like [whispers], [fast], or [excitement] directly in your text to shape the output.
High quality: Ranks first on public TTS leaderboards with Elo score of 1211.
SynthID watermark: All output includes a hidden mark to identify AI-generated audio.
Easy start: Available now in Google AI Studio and Vertex AI.
Why It Matters: Great voice output is key for accessible apps, audiobooks, and enterprise alerts. Gemini 3.1 Flash TTS gives developers fine-grained control without complex setup - a big step toward natural, customizable AI speech.
ANTHROPIC ADDS ROUTINES TO CLAUDE CODE: AUTOMATE WORK WHILE YOU SLEEP

What's Happening: Anthropic launched Routines in Claude Code (research preview). A routine is a saved automation - prompt, repo, and connectors - that runs on a schedule, via API, or in response to events like GitHub PRs.
Report Includes:
Three trigger types: Scheduled (hourly, nightly), API-triggered, or GitHub webhook-based.
Runs in the cloud: No need to keep your laptop open.
Common uses: Triage bugs nightly, verify deploys, auto-review PRs, or port code between languages.
Usage limits: Pro users get 5 routines/day, Max gets 15, Team/Enterprise gets 25.
Why It Matters: Automation is powerful, but setting up cron jobs and infrastructure is tedious. Routines package Claude Code automations into simple, reusable workflows. This lowers the barrier to building reliable, always-on agent systems.
OPENAI INTRODUCES GPT-ROSALIND FOR LIFE SCIENCES RESEARCH

What's Happening: OpenAI launched GPT-Rosalind, a model built for biology, drug discovery, and medical research. It helps scientists review literature, plan experiments, and analyze complex data.
Report Includes:
Built for science: Better at chemistry, protein design, genomics, and using scientific tools in multi-step workflows.
Strong results: Leads on BixBench, a real-world bioinformatics benchmark. Outperforms GPT-5.4 on 6 of 11 LABBench2 research tasks.
Trusted access: Available to qualified U.S. enterprise customers through a safety review process.
Free plugin: A Life Sciences Research Plugin for Codex connects to 50+ scientific databases and tools.
Why It Matters: Drug discovery takes 10-15 years. AI that helps scientists move faster through early research could speed up breakthroughs. GPT-Rosalind shows how domain-specific models can add real value in high-stakes fields.
MOZILLA ANNOUNCES THUNDERBOLT: OPEN-SOURCE, SELF-HOSTED AI CLIENT

What's Happening: Mozilla's MZLA subsidiary announced Thunderbolt, an open-source AI client you can self-host. It gives organizations full control over their data, models, and AI infrastructure.
Report Includes:
Bring your own models: Connect to commercial APIs, open-source models, or local deployments.
Enterprise integrations: Works with deepset's Haystack for agent orchestration and RAG pipelines.
Cross-platform: Native apps for Windows, macOS, Linux, iOS, and Android.
Security first: Self-hosted option, optional end-to-end encryption, and device-level controls.
Why It Matters: Many organizations want AI benefits without giving up control. Thunderbolt offers a sovereign alternative to cloud-only AI clients. By being open-source and self-hostable, it lets teams build AI systems that fit their security and compliance needs.
Thanks for reading.
See you next week with more AI agent updates.
— Rakesh's Newsletter


