The Next Wave x Mindstream

This Week in AI: Your Weekly Model, Tool & Strategy Stack

Stay ahead of the AI curve with actionable prompts and strategic analysis you can use immediately.

Explore HubSpot's AI-Powered CRM, designed to turn creative AI ideas into automated workflows across marketing, sales, and service.

Free HubSpot Tools

This Week's Quick Reference

Best Model For:

Use Case	Recommended Model/Tool	Why
AI Agents (Cost-Effective)	Claude Sonnet 4.6	Opus-level intelligence at 1/5 the price - optimized for agentic tasks, beats Opus on finance/office agents
AI Agents (Platform)	OpenClaw	The viral agent everyone's using - 100K+ GitHub stars, works via Slack/Telegram/WhatsApp while you sleep
Coding (Budget-Conscious)	Claude Sonnet 4.6	Nearly matches Opus 4.6 quality but 1/3 the cost ($3 vs $5 per million input tokens)
Science & Math Problems	Gemini 3.1 Pro	Crushed benchmarks: 77.1% on Arc AGI (10 points ahead of Opus), best at research coding
Animated Graphics (SVG)	Gemini 3.1 Pro	Creates smooth, gradient-rich SVGs - best visual generation for web graphics
Voice Customer Support	ElevenLabs Agents	Emotionally aware AI that handles angry customers with realistic empathy
Ad Campaign Automation	Meta Manus (Ads Manager)	Creates ads, monitors 24/7, auto-adjusts budgets - now integrated directly in Facebook Ads Manager

The Highlights This Week:

OpenClaw creator joined OpenAI after Anthropic fumbled by restricting Claude Max usage - biggest talent acquisition story of the month
5+ major models released in ONE week - Claude Sonnet 4.6, Gemini 3.1 Pro, Grok 4.2, ByteDance Seed 2.0, Qwen variants (possibly biggest single month in AI history)
Meta rushed Manus to Ads Manager - rolled out on Telegram first (not WhatsApp, which Meta owns - make it make sense)
Mark Cuban asked the uncomfortable question - "What happens when AI tokens cost more than your employees?" ($300/day in tokens vs $1,200/month salary)

Quote of the Week:

"This is the year agents actually feel useful. This is the year where an agent feels like an employee and not like a gimmick."

— Matt Wolfe on why 2026 is fundamentally different from 2025's "watch my browser glow" agent hype

Matt's Agent Revelation:

This week Matt discovered Matthew Berman's OpenClaw setup that works overnight while he sleeps—doing research, making thumbnails, finding video talking points—then reports back via messenger in the morning with completed tasks. The shift: agents don't need to show you what they're doing anymore. They operate in the background via APIs, check their own work, and only surface when finished. That's the difference between gimmick and employee.

Anthropic Fumbled the Bag (And OpenAI Swooped In)

The Story:

Peter Steinberger created OpenClaw - the viral AI agent platform with 100K+ GitHub stars (fastest-growing project in GitHub history). Originally called "ClaudeBot," it brought massive brand awareness to Anthropic as users signed up for $200/month Claude Max plans specifically to run it.

Anthropic's Epic Mistake:

They told users: "Claude Max is for Claude Code only. Using it in OpenClaw violates terms of service."

OpenAI's Chess Move:

"Feel free to use our $200/month plan inside OpenClaw. We welcome it."

The Result:

Steinberger (who already had a $100M exit from his PDF company) chose OpenAI over Meta and Anthropic. OpenClaw continues as an independent foundation. Steinberger now builds OpenAI's next-gen agent systems.

Why This Matters:

Anthropic had free marketing, organic customer acquisition, and total brand association with the hottest agent project in the world. They killed it by being restrictive when competitors offered openness. Platform decisions matter - openness wins developers, restrictions lose them.

The Bigger Picture:

This isn't just about one hire. It signals OpenAI's agent strategy for 2026: embrace open ecosystems, let developers use your models however they want, win through distribution not restriction.

Free Prompts

Prompts For Setting Up OpenClaw

I want to set up an AI agent for [specific task - research/scheduling/monitoring]. What permissions does it need? What should I withhold? Design a secure setup with kill switches.

This Is the Year Agents Feel Like Employees (Not Browser Puppets)

2025 Agents

"Ooh, watch my browser highlight while it slowly clicks buttons I could click faster myself."

2026 Agents

Work via APIs while you sleep, check their own work, fix their own errors, report via Slack when tasks complete.

What Changed?

Self-correction capability. Old agents: "I fixed it!" (Spoiler: they didn't). New agents: Check the output, see the error, fix it again, verify it works, THEN report completion.

Real Use Cases This Week:

Matthew Berman's overnight workflow - OpenClaw does research, creates thumbnails, finds talking points, delivers completed work via messenger by morning
Meta Manus in Ads Manager - Analyzes your website, creates ad copy, generates visuals, monitors performance 24/7, adjusts budgets based on your CPA targets
ElevenLabs support agents - Handle angry customers with emotional awareness, solve problems autonomously, escalate when needed

The Economics Question

Mark Cuban broke down the math: if one agent costs $300/day in tokens ($2,400/month) but only does $1,200/month of employee work, you're overpaying 2x.

Matt's Counter

"Very short-term problem." Models are getting better AND cheaper simultaneously. Example: Claude Sonnet 4.6 delivers near-Opus results for 1/3 the cost. In 6 months, today's $300/day agent will cost $100/day with better performance.

The Productivity Multiplier

Agents work 24/7, never need morale management, don't take vacations, and increasingly catch their own mistakes. One $2,400/month agent might replace THREE $1,200/month employees if deployed correctly.

Free Prompts

Agent ROI Calculator Prompts

I'm considering an AI agent for [task currently done by humans]. Calculate: Current monthly cost (salary + benefits), estimated monthly token cost at [usage hours/day], productivity multiplier needed to break even.

The Model Blitz: 5+ Releases in One Week (Here's What Actually Matters)

The Headline

This might be the biggest single month of model releases in AI history. This week alone: Claude Sonnet 4.6, Gemini 3.1 Pro, Grok 4.2, ByteDance Seed 2.0, and multiple Qwen variants.

For Normal People

If you're using AI to write emails, find dinner spots, or brainstorm ideas - you won't notice much difference. These models have been good at that for a while.

For Power Users

The upgrades matter in science, coding, research, and agent deployment.

The Standouts:

Claude Sonnet 4.6 - The agent workhorse

1M token context window (API only - fits entire book series)
Near-Opus coding performance for 1/3 the cost
Beats Opus on finance and office agent tasks
Now the default model in free/$20 plans

Gemini 3.1 Pro - The science/visual specialist

77.1% on Arc AGI (visual logic puzzles) - 10 points ahead of Opus
Best at competitive coding and scientific research
Dramatically improved SVG generation (gradients, textures, animations)
Available now at gemini.google.com

Grok 4.2 - The "council of experts"

Uses 4 specialized sub-models (research, reasoning, critic, writer)
They debate internally, deliver consensus answer
No big announcement - Elon just tweeted it
Similar to "mixture of experts" architecture but more transparent

Matt's Take

"Better, faster, cheaper, smarter" all happening simultaneously. For Joe Schmo users, stick with whatever LLM has the best vibes. For developers and agent builders, this week changed the economics dramatically.

Free Prompts

Model Selection Prompts

I need to [describe task]. Compare Claude Sonnet 4.6 vs Gemini 3.1 Pro vs ChatGPT 5.3 for this specific use case. Consider: cost per task, quality of output, speed, context limits. Recommend the best option.

OpenClaw | The AI Agent Platform (Finally Feels Useful)

What It Is: Open-source AI agent framework that lets you interact with your AI via Slack, Telegram, WhatsApp, or Discord - treating it like a remote team member instead of a chat interface. Give it tasks, it completes them while you're offline, reports back when done.

Why Everyone's Obsessed: Hit 100K+ GitHub stars (fastest-growing project in GitHub history). Users are running agents overnight that do research, create content, monitor systems, and deliver completed work by morning - not watching browsers slowly click buttons.

The Drama: Originally called "ClaudeBot," Anthropic restricted its use, so creator Peter Steinberger joined OpenAI instead. Now OpenAI-backed but remains open-source and model-agnostic (works with Claude, GPT, Gemini).

Real Use Case: Matthew Berman's agent finds video topics, researches them, generates thumbnails, extracts talking points - all while he sleeps. He wakes to a Slack thread of completed deliverables.

The Catch: You can give it access to emails, calendars, passwords, API keys - but with great power comes great "please don't accidentally give your agent your bank account access."

Free Prompts

Prompts to Try

Design a safe OpenClaw workflow for [low-risk task - daily news monitoring/scheduled reporting]. Include: what it can access, what requires my approval, how it reports results, kill switch conditions.

Claude Sonnet 4.6 | The Agent-Optimized Model (Opus Performance, 1/3 the Cost)

What It Does: Anthropic's latest Sonnet model delivers near-Opus-level intelligence with a 1 million token context window (entire book series fits) but costs dramatically less to run. Specifically optimized for agentic tasks - actually outperforms Opus 4.6 on finance agents and office task agents.

The Economics Breakthrough:

Input tokens: $3 per million (vs Opus $5)
Output tokens: $15 per million (vs Opus $25)
Translation: Run the same agent workflows for 1/3 the cost with nearly identical quality

Why This Matters for Agents: Remember Mark Cuban's question about token costs exceeding employee salaries? This model cuts that cost by 66% overnight. Your $300/day agent just became a $100/day agent with the same capabilities.

What's Available Where:

Free & $20/month plans: Now the default model (you're already using it)
API developers: Get the 1M token context window for complex workflows
Coding benchmarks: 79.6% (Opus is 80.9% - basically tied)

The Strategic Play: Anthropic is making premium intelligence accessible at Sonnet prices because they want developers building agents on Claude, not OpenAI. After losing OpenClaw's creator, this is their counter-move.

Free Prompts

Prompts to Try

I'm uploading [entire document/multiple reports/full conversation history]. Read everything, then: extract the 10 most critical decisions made, identify 5 contradictions or gaps, recommend 3 action items with supporting quotes.

ElevenLabs Agents for Support | Voice AI That Gets Your Customers Are Mad

What It Does: Voice-based AI customer support that's "emotionally and contextually aware" - meaning it detects when customers are frustrated and adjusts tone, offers better solutions, and knows when to show empathy vs when to move efficiently through problem-solving.

The Demo That Impressed: Customer stuck in Orlando, flight canceled 3 times, missing daughter's birthday. Agent: "3 cancellations on one trip. That's yeah, that's not the experience you should be having. I'm gonna fix this, okay?" Then honestly assessed: "I can book you tonight but that route's been a mess all week and I wouldn't trust it. The other option gets you home by noon tomorrow."

What You Can Customize:

Load your SOPs (standard operating procedures)
Define different agent personalities for different scenarios
Set guardrails for what agents can/can't promise
Integrate with your existing support systems

Matt's Prediction: "I guarantee we're gonna hear stories about people getting free cars and flights just because they used a certain word pattern that got around the guardrails." (See: the Chevy chatbot that gave away a Tahoe for free.)

Joe's Forecast: "Testing guardrails" will become people's new pastime.

Free Prompts

Prompts to Try

Design a customer support agent for [your business]. Create scenarios for: angry customer, confused customer, refund request, technical problem. For each, define: tone to use, solutions it can offer autonomously, when to escalate to humans.

Platform Openness Wins (Anthropic's $100M Lesson in How to Lose Developers)

What Happened: Anthropic had everything going for them with OpenClaw - free brand awareness, organic customer acquisition (users signing up for Claude Max specifically to run it), and total association with the hottest agent project in the AI world. Then they threatened users for using their $200/month plan "incorrectly." OpenAI said "we welcome developers," and poached both the creator and the mindshare.

The Sequence:

ClaudeBot goes viral → named after Claude → brings Anthropic customers
Anthropic restricts usage → violates ToS warnings
OpenAI says "use our platform however you want"
Creator joins OpenAI → $500B+ company just won the agent platform race

What This Reveals: Developer loyalty goes to platforms that enable, not restrict. When open alternatives exist, restrictive ToS policies backfire catastrophically. OpenAI learned from past mistakes (remember when they tried to ban competitive products?) - now they're weaponizing openness.

What This Means For You: Before building critical workflows on any platform, check their stance on third-party integrations and API usage. Platforms that welcome creative implementation scale faster than platforms that restrict it. When choosing between similar-quality tools, bet on the one that won't threaten you for using it creatively.

The Agent Economics Question Everyone's Avoiding (Until Now)

The Uncomfortable Math: Mark Cuban put numbers to what everyone's thinking: If you're spending $300/day on AI tokens to run an agent ($2,400/month), but that agent only does work equivalent to a $1,200/month employee, you're paying 2x for the same output. Factor in developer costs ($200/day maintenance), and you're at $2,600/month vs $1,200/month human. Does the agent need to be 2.16x more productive just to break even?

Matt's Reality Check: "Very short-term problem."

The Counter-Evidence:

Claude Sonnet 4.6 just dropped costs by 66% (from $5 to $3 per million input tokens)
It delivers near-Opus quality while being 1/3 the price
Models are getting cheaper WEEKLY, not yearly
6 months from now, today's $300/day agent becomes a $100/day agent with better performance

The Productivity Multiplier Cuban Didn't Mention:

Agents work 168 hours/week (humans work 40)
No vacation, no sick days, no morale management
Increasingly self-correct before reporting (catching their own mistakes)
Can run multiple parallel workflows simultaneously

Real Calculation: One $2,400/month agent working 24/7 with 90% accuracy might genuinely replace three $1,200/month employees working 9-5. The math flips when you account for uptime and self-management.

The Strategic Question: It's not "Can I afford agents?" It's "Can I afford NOT to have agents when my competitors are running 24/7 operations?"

What This Means For You: Run your own numbers. Don't blindly trust VC hype OR skeptical takes. Test one agent on one workflow for 30 days, track actual token costs and time saved, then decide. The answer varies wildly by use case - research agents might save you 20 hours/week, while customer support agents might cost more than humans (for now).

Video

Listen To The Full Episode

Hear Matt Wolfe and Joe Fier break down all the drama, technical details, and strategic implications. From Anthropic's fumble losing OpenClaw to the uncomfortable economics of AI agents costing more than humans (and why that won't last), this episode cuts through the hype with actual numbers and real-world testing.

AI Agents Are Now Hiring Humans. We Need to Talk

Mindstream

The AI landscape changes faster than any human can track. Subscribe to Mindstream's newsletter to get AI news delivered straight to your inbox every single day.

This Week in AI: Your Weekly Model, Tool & Strategy Stack

Explore HubSpot's AI-Powered CRM, designed to turn creative AI ideas into automated workflows across marketing, sales, and service.

This Week's Quick Reference

Best Model For:

The Highlights This Week:

Quote of the Week:

Matt's Agent Revelation:

Hot Topics This Week

Anthropic Fumbled the Bag (And OpenAI Swooped In)

Prompts For Setting Up OpenClaw

This Is the Year Agents Feel Like Employees (Not Browser Puppets)

Agent ROI Calculator Prompts

The Model Blitz: 5+ Releases in One Week (Here's What Actually Matters)

Model Selection Prompts

Featured Tools

OpenClaw | The AI Agent Platform (Finally Feels Useful)

Prompts to Try

Claude Sonnet 4.6 | The Agent-Optimized Model (Opus Performance, 1/3 the Cost)

Prompts to Try

ElevenLabs Agents for Support | Voice AI That Gets Your Customers Are Mad

Prompts to Try

Strategic AI Shifts

Platform Openness Wins (Anthropic's $100M Lesson in How to Lose Developers)

The Agent Economics Question Everyone's Avoiding (Until Now)

Listen To The Full Episode

Mindstream

Blogs

Blogs

Marketing

Sales

Service

Website

Instagram Marketing

Customer Retention

Email Marketing

SEO

Sales Prospecting

Newsletters

Newsletters

The Hustle

Marketing Against the Grain

The Science of Scaling

Mindstream

Videos

Videos

The Hustle

Marketing with HubSpot

My First Million

Marketing Against the Grain

HubSpot

The Next Wave

The Science of Scaling

Resources

Resources

Academy

Templates

Ebooks

Kits

Tools

Podcasts

HubSpot Products

The HubSpot Customer Platform

Overview of all products

Marketing Hub

Sales Hub

Service Hub

Content Hub

Data Hub

Commerce Hub

About HubSpot

Contact Us

Customer Support

Log in

日本語

Deutsch

English

Español

Português

Français

This Week in AI: Your Weekly Model, Tool & Strategy Stack

Explore HubSpot's AI-Powered CRM, designed to turn creative AI ideas into automated workflows across marketing, sales, and service.

This Week's Quick Reference

Best Model For:

The Highlights This Week:

Quote of the Week:

Matt's Agent Revelation:

Hot Topics This Week

Anthropic Fumbled the Bag (And OpenAI Swooped In)

Prompts For Setting Up OpenClaw

This Is the Year Agents Feel Like Employees (Not Browser Puppets)

Agent ROI Calculator Prompts

The Model Blitz: 5+ Releases in One Week (Here's What Actually Matters)

Model Selection Prompts

Featured Tools

OpenClaw | The AI Agent Platform (Finally Feels Useful)

Prompts to Try

Claude Sonnet 4.6 | The Agent-Optimized Model (Opus Performance, 1/3 the Cost)

Prompts to Try

ElevenLabs Agents for Support | Voice AI That Gets Your Customers Are Mad

Prompts to Try

Strategic AI Shifts

Platform Openness Wins (Anthropic's $100M Lesson in How to Lose Developers)

The Agent Economics Question Everyone's Avoiding (Until Now)

Listen To The Full Episode

Mindstream