Claude Opus 4.7: Game-Changing AI Model for Harder Tasks

Hi everyone 👋

Welcome back to AI Agent Weekly. This week, Anthropic launched its newest top model, OpenAI opened cyber tools to more defenders, and Mozilla announced a new open AI client you can run yourself. Let's get into it.

ANTHROPIC LAUNCHES CLAUDE OPUS 4.7: BETTER CODING, SHARPER VISION, SMARTER WORK

What's Happening: Anthropic released Claude Opus 4.7, its newest high-end model. It is now available to everyone. The model is better at hard coding tasks, sees images in higher detail, and follows instructions more carefully.

Report Includes:

Better at complex code: Opus 4.7 solves more tough software tasks than Opus 4.6, with gains on benchmarks like SWE-bench and Terminal-Bench.
Sharper vision: It can read images up to 2,576 pixels wide, helping with diagrams, screenshots, and technical drawings.
Smarter memory: It remembers context across long sessions, so you spend less time repeating yourself.
Safer by design: Cyber features are limited on purpose. Security pros can apply for special access through the Cyber Verification Program.
Same price: $5 per million input tokens, $25 per million output tokens.

Why It Matters: Opus 4.7 is a direct upgrade for developers who need a model that can handle long, complex tasks with less supervision. The better vision and memory mean it can work more like a real teammate. The careful cyber safeguards show Anthropic is testing safety steps before releasing even more powerful models.

Read the full report

OPENAI EXPANDS TRUSTED ACCESS FOR CYBER DEFENSE

What's Happening: OpenAI is opening its Trusted Access for Cyber program to more verified defenders. Approved users can now use GPT-5.4-Cyber, a version of GPT-5.4 with fewer restrictions for legitimate security work.

Report Includes:

GPT-5.4-Cyber: A special model fine-tuned for defensive security tasks like reverse engineering and vulnerability research.
Three guiding ideas: Democratized access (clear rules for who gets in), iterative deployment (learn and improve over time), and ecosystem resilience (support the defender community).
Easy sign-up: Individuals verify at chatgpt.com/cyber. Teams contact their OpenAI rep.
Built on real results: Codex Security has already helped fix over 3,000 high-severity bugs.

Why It Matters: Cyber threats are growing fast. This program lets skilled defenders use powerful AI tools while keeping safeguards in place. It shows OpenAI is balancing broad access with responsible controls - a model other labs may follow.

Read the full report

QWEN RELEASES QWEN3.6-35B-A3B:AN OPEN-SOURCE FOR ALL YOUR NEEDS

What's Happening: Alibaba's Qwen team open-sourced Qwen3.6-35B-A3B, a lightweight MoE model with 35 billion total parameters but only 3 billion active at a time. It is built for agentic coding and repository-scale work, designed for teams that need control, efficiency, and results.

Report Includes:

Open weights for full control: Free to download, use, and modify. Teams can host on their own servers or in private clouds to meet data governance and compliance needs.
Strong coding for real workflows: Matches larger models on agentic coding benchmarks and repo-level tasks, helping teams automate code reviews, refactoring, and legacy modernization.
Multimodal support for technical docs: Includes a vision encoder to process diagrams, architecture charts, and UI mockups, enabling richer context for development pipelines.
Easy integration: Compatible with common MLOps tools and identity systems, reducing time to production.

Why It Matters: Teams need AI that fits their security, cost, and workflow requirements. Qwen3.6-35B-A3B gives groups a high-performance, open model they can customize, audit, and deploy on their own infrastructure. This reduces vendor risk, lowers long-term costs, and accelerates adoption of agentic AI in regulated or sensitive environments. For engineering leaders evaluating build versus buy, this model offers a credible path to own the stack while staying competitive.

Read the full report

OPENAI UPDATES CODEX: NOW WORKS ACROSS YOUR COMPUTER AND TOOLS

What's Happening: OpenAI released a major Codex update. It can now use apps on your computer, work in a built-in browser, generate images, remember your preferences, and handle repeat tasks over time.

Report Includes:

Computer use: Codex can see your screen, click, and type - helpful for testing apps or working in tools without APIs.
In-app browser: Comment on web pages to give Codex precise instructions for frontend work.
Image generation: Uses gpt-image-1.5 to create and edit visuals inside coding workflows.
90+ new plugins: Connect to Jira, GitLab, Microsoft Suite, and more for richer context.
Memory and automations: Codex can remember your preferences and schedule future work.

Why It Matters: Codex is moving beyond writing code to becoming a full development partner. By working across your whole workflow - code, browser, apps, and docs - it reduces context switching and helps developers ship faster.

Read the full report

GOOGLE LAUNCHES GEMINI 3.1 FLASH TTS: EXPRESSIVE AI SPEECH WITH FINE CONTROL

What's Happening: Google released Gemini 3.1 Flash TTS, a new text-to-speech model in public preview. It supports 70+ languages and lets you control tone, style, and pacing using simple audio tags.

Report Includes:

200+ audio tags: Add tags like [whispers], [fast], or [excitement] directly in your text to shape the output.
High quality: Ranks first on public TTS leaderboards with Elo score of 1211.
SynthID watermark: All output includes a hidden mark to identify AI-generated audio.
Easy start: Available now in Google AI Studio and Vertex AI.

Why It Matters: Great voice output is key for accessible apps, audiobooks, and enterprise alerts. Gemini 3.1 Flash TTS gives developers fine-grained control without complex setup - a big step toward natural, customizable AI speech.

Read the full report

ANTHROPIC ADDS ROUTINES TO CLAUDE CODE: AUTOMATE WORK WHILE YOU SLEEP

What's Happening: Anthropic launched Routines in Claude Code (research preview). A routine is a saved automation - prompt, repo, and connectors - that runs on a schedule, via API, or in response to events like GitHub PRs.

Report Includes:

Three trigger types: Scheduled (hourly, nightly), API-triggered, or GitHub webhook-based.
Runs in the cloud: No need to keep your laptop open.
Common uses: Triage bugs nightly, verify deploys, auto-review PRs, or port code between languages.
Usage limits: Pro users get 5 routines/day, Max gets 15, Team/Enterprise gets 25.

Why It Matters: Automation is powerful, but setting up cron jobs and infrastructure is tedious. Routines package Claude Code automations into simple, reusable workflows. This lowers the barrier to building reliable, always-on agent systems.

Read the full report

OPENAI INTRODUCES GPT-ROSALIND FOR LIFE SCIENCES RESEARCH

What's Happening: OpenAI launched GPT-Rosalind, a model built for biology, drug discovery, and medical research. It helps scientists review literature, plan experiments, and analyze complex data.

Report Includes:

Built for science: Better at chemistry, protein design, genomics, and using scientific tools in multi-step workflows.
Strong results: Leads on BixBench, a real-world bioinformatics benchmark. Outperforms GPT-5.4 on 6 of 11 LABBench2 research tasks.
Trusted access: Available to qualified U.S. enterprise customers through a safety review process.
Free plugin: A Life Sciences Research Plugin for Codex connects to 50+ scientific databases and tools.

Why It Matters: Drug discovery takes 10-15 years. AI that helps scientists move faster through early research could speed up breakthroughs. GPT-Rosalind shows how domain-specific models can add real value in high-stakes fields.

Read the full report

MOZILLA ANNOUNCES THUNDERBOLT: OPEN-SOURCE, SELF-HOSTED AI CLIENT

What's Happening: Mozilla's MZLA subsidiary announced Thunderbolt, an open-source AI client you can self-host. It gives organizations full control over their data, models, and AI infrastructure.

Report Includes:

Bring your own models: Connect to commercial APIs, open-source models, or local deployments.
Enterprise integrations: Works with deepset's Haystack for agent orchestration and RAG pipelines.
Cross-platform: Native apps for Windows, macOS, Linux, iOS, and Android.
Security first: Self-hosted option, optional end-to-end encryption, and device-level controls.

Why It Matters: Many organizations want AI benefits without giving up control. Thunderbolt offers a sovereign alternative to cloud-only AI clients. By being open-source and self-hostable, it lets teams build AI systems that fit their security and compliance needs.

Read the full report

Thanks for reading.

See you next week with more AI agent updates.

— Rakesh's Newsletter

Claude Opus 4.7 Just Landed and It Is Changing the Game for Hard Coding Tasks

ANTHROPIC LAUNCHES CLAUDE OPUS 4.7: BETTER CODING, SHARPER VISION, SMARTER WORK

OPENAI EXPANDS TRUSTED ACCESS FOR CYBER DEFENSE

QWEN RELEASES QWEN3.6-35B-A3B:AN OPEN-SOURCE FOR ALL YOUR NEEDS

OPENAI UPDATES CODEX: NOW WORKS ACROSS YOUR COMPUTER AND TOOLS

GOOGLE LAUNCHES GEMINI 3.1 FLASH TTS: EXPRESSIVE AI SPEECH WITH FINE CONTROL

ANTHROPIC ADDS ROUTINES TO CLAUDE CODE: AUTOMATE WORK WHILE YOU SLEEP

OPENAI INTRODUCES GPT-ROSALIND FOR LIFE SCIENCES RESEARCH

MOZILLA ANNOUNCES THUNDERBOLT: OPEN-SOURCE, SELF-HOSTED AI CLIENT

Keep Reading

Get the Free Tech & AI Newsletter

Quick Links

Subscription

Socials