a16z Podcast

What Is an AI Agent?

Overview

Content

Defining AI Agents

* There's significant disagreement about what constitutes an "agent" in AI, with definitions ranging from simple to highly complex: - Simplest definition: A clever prompt or chat interface with a knowledge base - Most extensive definition: Near-AGI with ability to persist over time, learn independently, and work on complex problems

* The term "agent" is described as "nebulous" and "overloaded," potentially being more of a marketing concept than a precise technical category

* Key definitional elements of agents potentially involve: - An LLM running in a loop - Tool use capabilities - Ability to make decisions - Potential for planning - Dynamic decision tree capabilities

* Agentic behavior exists on a spectrum rather than as a binary classification: - Differs from simple API calls or single-prompt interactions - Involves more complex reasoning and decision-making - Simple translation or API calls likely not considered "agents" - More complex routing or multi-step reasoning approaches feel more "agentlike"

Current State and Limitations

* Most current AI "agents" likely represent early, rudimentary implementations, described as more like "weekend demos" than mature technologies

* A variety of agent types exist, including artistic agents, coding agents, and LLM wrappers

* Two emerging interface specializations: - Tight feedback loop interfaces (like Cursor) emphasizing immediate interaction - Backend systems focused on independent agent operation

* Technical challenges remain significant: - Non-determinism in LLM outputs is a major unsolved problem - Incorporating LLM outputs into program control flow is complex - Data access and integration remain challenging - Current multimodal limitations (visual and web-based interactions are clunky)

AI Agents and Human Replacement

* There's a marketing narrative positioning AI agents as cost-effective replacements for human workers, but the actual impact is more nuanced

* AI tends to augment rather than completely replace human labor, with potential outcomes including: - Two humans potentially replaced by one more productive human with AI - Keeping existing employees but increasing productivity - Slowing net new human hiring rather than direct replacement

* Most jobs involve fundamental creative work that AI cannot fully replicate: - AI systems still require human initiation and prompting - AI lacks true human decision-making and creative intent - Someone must still "push the button" and provide direction

* Two types of agents exist: - Agents that work with/replace human tasks - Low-level system agents that interact with each other

Agents vs. Functions

* From an external perspective, an AI agent and a traditional software function can appear indistinguishable

* AI models introduce new characteristics to functions: - Pre-trained and can be fine-tuned, making them more easily sharable - Models take up much of the functionality within a function - Development infrastructure will likely evolve around these new characteristics

* Philosophical perspective: Humans can be conceptualized as "functions" that respond to inputs - Examples like Mechanical Turk illustrate how humans currently serve function-like roles - Creative work and human input are not easily replaceable, even in seemingly simple tasks

Pricing and Economic Considerations

* Current AI agent pricing models are still emerging and undefined: - Most use value-based pricing or cost-plus pricing models - Companies charging based on GPU running costs with a modest premium - Many AI companies are still uncertain about the exact value they're generating

* Traditional infrastructure pricing models that could apply: - Per-seat pricing for human-used services - Usage-based pricing for machine-to-machine services

* Specific pricing examples emerging: - AI companion services experimenting with per-response charging - Some services charge by tokens or interactions - Potential for flat monthly fee models

* Over time, technology costs tend to converge toward marginal production costs - Current AI services (like translation) cost dramatically less than human equivalents

Architecture and Development Trends

* AI agent architecture shares similarities with SaaS software: - LLM infrastructure likely to remain specialized and external - State management typically handled externally (like databases) - Core agent logic can be relatively lightweight and run on minimal compute

* Predicted winners will be specialists building on or fine-tuning foundational models: - Importance of pushing model distributions through novel data, workflows, and aesthetics - Example: Image generation models have limited style ranges, creating opportunities for specialized creators

* OpenAI and other companies are starting to verticalize and create specific products for specific use cases - Code-related AI tools showing clearer ROI, making pricing more straightforward - Goal is to move from selling a product to selling a solution

Data Access and Future Challenges

* Companies often create data silos and walled gardens to restrict automated access: - Consumer companies have incentives to keep data private and maintain user engagement - Concerns about how AI agents will interact with existing data ecosystems

* Websites are developing more complex anti-agent CAPTCHA systems - Data providers may actively resist AI agent access - Historical precedent exists (e.g., Gmail ads controversy) of companies adapting to protect data

* Advanced AI models might eventually enable agents to: - Browse websites - Log in as humans - Access previously restricted systems - Potentially bypass authentication mechanisms

Future Outlook

* Multimodal training (incorporating clicks, web navigation, drawing, vector art) could unlock new agent capabilities

* Within two years, agents could potentially: - Use most tools a human can access - Reason across fragmented data sources - Perform tasks more efficiently by accessing personal data repositories

* The term "agent" might become obsolete in 2-5 years, which would indicate successful integration

* Philosophical perspective on AI's societal impact: - Rejects binary narrative of AI as either utopian or dystopian - Advocates viewing AI as a "normal technology" like water, electricity, or internet - Emphasizes AI as a tool to help and enhance human capabilities

* Developing true digital agents that mimic human capabilities is likely a 10-year challenge

More from a16z Podcast

Explore all episode briefs from this podcast

View All Episodes →

Listen smarter with PodBrief

Get AI-powered briefs for all your favorite podcasts, plus a daily feed that keeps you informed.

Download on the App Store