It's Monday, May 11th: Anthropic cuts agentic blackmail rates from 96% to zero by teaching Claude why an action aligns with its values. Plus: China's national chip fund just bet $7B on DeepSeek.

The top AI stories from last week, filtered for what will help you stay in the know.

1️⃣ TRAINING METHODS: Anthropic's "Teach Why" Method Drops Claude's Blackmail Rate from 96% to Zero

Read the full post by Anthropic.

Anthropic's alignment team dropped Claude Haiku 4.5's blackmail rate from 96% to zero by training the model to reason about its values, not imitate correct behavior.

  • Anthropic put Claude in scenarios where a user faced an ethical dilemma and trained the model to talk through the values at stake.

  • The May 8 research hit the same alignment gains as Anthropic's old approach with 28× fewer training tokens.

  • The new technique also worked on situations the team hadn't trained for, which the old approach struggled with.

Anthropic's streak of transparency in misalignment research raises questions that the entire frontier model industry will need to answer. The next round of safety reports from OpenAI and Google will show who's ready to answer them.

Our Perspective

2️⃣ CAPITAL RAISED: DeepSeek Targets $7.35B First-Ever Raise at $50B Valuation, Led by State Chip Fund

Image Credit: Getty Images

DeepSeek is raising up to 50 billion yuan ($7.35 billion) in its first-ever external raise. The round would value the Chinese AI lab at roughly ~$50 billion USD and would make it the largest raise by any Chinese AI company to date.

Beijing's chip fund leading a model lab round is the real signal here. China is no longer treating frontier AI and semiconductor independence as two separate projects, and the open-weight performance ceiling now has $7B of state-backed runway behind it.

Our Perspective

✍️ REGISTER NOW: Ship LinkedIn Videos in Under 1 Hour

In our recent flagship session with Eileen Wu and Descript CEO Laura Burkhauser, there was strong interest in a more hands-on workshop for tech leaders to create video content quickly.

This next virtual session on Thursday, May 21 is designed as that next step. Join Eileen Wu (AI Collective, Mavryx) and Trevor Howell (Descript) at 12 PM PDT / 3 PM EDT.

You’ll leave with:

  • A repeatable setup that takes the busywork out of every video

  • Cleaner audio and tighter cuts without switching between tools

  • A version of you on camera that still sounds like you

  • A finished video ready to publish

If you missed our flagship conversation, you can find more takeaways here.

Join us to learn the workflow leaders are using to ship a LinkedIn video in under an hour.

📰 Other Headlines

Your breakdown of what’s happening in AI this week, from Noah Frank ⚡️

🪙 Spotlight On: Layoffs at Coinbase

Coinbase laid off 14% of its workforce last week. Its CEO told staff the cuts were possible because engineers can now ship in days what used to take a team weeks. The same week, Challenger, Gray & Christmas reported over 83,000 US job cuts in April, with roughly ¼ of them attributed by employers to AI.

Yet despite all of the concern (and soaring stock prices) the data doesn’t necessarily justify the narrative that AI is imminently coming for your job.

For example, MIT's David Autor has spent decades measuring what happens when automation moves through a workforce, and his finding is consistent across periods and industries: the count of tasks automated matters less than what happens to the tasks that remain. For instance, between 1977 and 2018, automation removed 64.5% of routine tasks from job descriptions while adding 75.6% new ones. In this way, roles get recomposed around what the worker can now do that they couldn't before.

Microsoft’s Transformation Paradox concept.

The Microsoft Work Trend Index has a name for what's happening, what they call the Transformation Paradox. Of the 20,000 AI users surveyed across ten countries, 58% said they're producing work they couldn't have done a year ago, rising to 80% among the most advanced users. But only one in four said their leadership is aligned on what AI is for, and only 13% said they're rewarded for redesigning their work with AI even when results don't immediately follow. The paradox is that the same forces accelerating individual AI adoption are holding back its institutional payoff. That's the gap that explains why most enterprise AI deployments haven't shown ROI even as individual workers report transformation. Boris Cherny at Anthropic told CNN that by the end of the year, the term "software engineer" probably won't survive. The job becomes "builder," because writing code is the smaller part of what building software now means.

So what about the 21,490 cuts in April? We don't know yet how many are real AI displacement and how many are convenient attribution during a macro environment that was always going to produce layoffs. Stock markets reward AI-driven cost stories. A CEO citing "AI efficiency" gets credit for vision, but at the same time, a CEO citing "we over-hired in 2022" gets credit for nothing.

What do you think: are these signs that AI layoffs are real, or inflated expectations?

🚀 Humans in AI Week is coming.

This June, AIC is hosting 100+ events in one week, all built around a single question: what does it mean to be human in the AI era? It's the largest human-centered AI gathering we've ever run, across every chapter, on six continents.

Read the announcement, and pledge your voice below.

Say user_id. Get user_id.

Wispr Flow recognizes variable names, file references, and framework syntax mid-dictation. Speak your prompt, get developer-ready text for GitHub, Jira, or your editor. No mangled syntax. Ever.

🫵 Do You Belong on Our Newsletter?

Share your message with the world’s largest AI community. To inquire about partnership availability, reach out to our team below.

The AI Collective is a community of volunteers, made for volunteers. All proceeds directly fund future initiatives that benefit this community.

Before You Go…

Connect With Us on Socials

Get Involved in Your Community

Thank you to the thousands of volunteers around the world who make this work possible. We truly could not do this without you.

About the Authors

Noah is a researcher, innovation strategist, and ex-founder thinking and writing about the future of AI. His work and body of research explores the economics of emerging technology and organizational strategy.

About Joy Dong

Joy is a news editor, writer, and entrepreneur at the intersection of AI and blockchain. Whether she is demystifying complex systems in her newsletter, TEA, or building streamlined solutions through her automation agency, Ownly, Joy’s mission is to make emerging tech accessible and actionable for everyone.

Add Your Thoughts

Avatar

or to participate

Keep Reading