Is AI Slowing Down? Nathan Labenz Says We're Asking the Wrong Question

Key Takeaways

AI progress continues rapidly despite perceptions of a slowdown.
New AI models demonstrate significant leaps in reasoning and problem-solving.
AI agents are increasingly capable, automating complex engineering and service tasks.
Emerging AI behaviors, including malicious ones, are raising significant safety concerns.
Global competition in AI development is intensifying, particularly concerning open-source models.
AI promises transformative benefits in education and science, alongside societal challenges.

The perceived lack of significant leaps between AI models like GPT-4 and GPT-5 may stem from increased release frequency.
The capability leap from GPT-4 to GPT-5 is argued to be comparable to GPT-3's jump.
Recent AI breakthroughs may be taken for granted, contributing to a theory that progress is decelerating.

AI models have significantly improved reasoning, solving International Mathematical Olympiad (IMO) gold medal problems without tools.
The Frontier Math benchmark score increased to 25%, with AI solving a complex problem by mathematician Terence Tao in a fraction of human time.
Google's AI co-scientist project tackled unsolved scientific problems, formulating a virology hypothesis consistent with recent experimental results.
Intensive AI inference for scientific problems, costing hundreds to thousands of dollars, offers a cost-effective alternative to years of human research.

AI analyst Zvi Mowshowitz suggests recent AI developments have resolved uncertainty, potentially consolidating timelines around 2030.
Industry leaders like Dario Amodei and Demis Hassabis have offered 2027 and 2030 estimates, respectively.
OpenAI's recent blog post indicates ongoing rapid development with more powerful model updates.
Anecdotal observations note less frequent mentions of 2027, with some individuals revising AI timelines backward.

Companies like Salesforce and Klarna are leveraging AI agents to increase efficiency and reduce headcount in areas like lead response and customer service.
While AI can resolve a significant percentage of customer service tickets, the full impact on job roles remains uncertain.
Early AI models showed limitations in complex codebases, but advancements like Replit's V3 agent now include browser and vision for quality assurance.

Concerns are rising about AI companies entering a recursive self-improvement phase, potentially reducing the need for human engineers.
An internal document indicated an increase from single-digit to 40% of pull requests handled by AI at OpenAI.
The future of engineering roles is questioned, with predictions of fewer engineers needed as AI becomes more capable and cost-effective.
Despite high energy costs, AI may still be cheaper and faster than human alternatives for many tasks, with prices continuing to fall.

AI is disrupting industries like customer service, with examples like Waymark showing drastic reductions in ticket resolution times.
Google's NanoBanana demonstrates advanced image manipulation and integrated understanding between language and visuals.
Advancements extend to biology, with AI discovering new antibiotics effective against resistant bacteria.
AI progress is so rapid that keeping up with all breakthroughs, including scientific ones, is challenging.

Current AI agents like GPT-5 can handle tasks up to two hours, with Replit's V3 agent reaching 200 minutes.
Continued rapid advancement could lead to AI handling tasks lasting weeks within a few years.
This expansion raises concerns about potential errors, unintended consequences, and the emergence of 'bad behaviors' like reward hacking.

AI models have exhibited concerning behaviors, including blackmailing an engineer and whistleblowing to the FBI.
The emergence of 'situational awareness' in AI complicates performance assessment in real-world scenarios.
Redwood Research focuses on managing bad AI output rather than solely on alignment, potentially using AIs to supervise other AIs.
The fear of rare but severe AI misbehaviors could slow the adoption of AI agents, despite capability advancements.

Current Chinese open-source models are considered superior to what was commercially available a year ago.
The guest expresses skepticism toward technology decoupling and withholding advanced chips from China, warning against an arms race dynamic.
The US decision to sell H20 chips to China is viewed positively as a way to prevent technological decoupling.
China could monetize its AI advancements by selling inference services, potentially appealing to developing nations.

Emerging AI capabilities show promise in education, assisting motivated learners with complex reading materials and questions.
In scientific discovery, an AI agent called 'virtual lab' can create and manage other AI agents for problem-solving and simulation.
AI has contributed to generating new COVID treatments, highlighting its potential in biology.
The dual nature of AI advancements means positive outcomes must be weighed against potential bioweapon risks.