Latent Space: The AI Engineer Podcast

92 episode briefs available

Jailbreaking AGI: Pliny the Liberator & John V on AI Red Teaming, BT6, and the Future of AI Security

[State of Code Evals] After SWE-bench, Code Clash & SOTA Coding Benchmarks recap — John Yang

[Latent Space LIVE @ NeurIPS] State of AI Startups 2025 — with Sarah Catanzaro, Amplify Partners

DevDay 2025: Apps SDK, Agent Kit, MCP, Codex and why Prompting is More Important than Ever

The PhD Student & Professor Reinventing AI: Fei-Fei Li & Justin Johnson on Spatial Intelligence

[State of Post-Training] From GPT-4.1 to 5.1: RLVR, Agent & Token Efficiency — Josh McGrath, OpenAI

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

[State of Evals] LMArena's $1.7B Vision — Anastasios Angelopoulos, LMArena

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

Taste is your Moat (Dylan Field of Figma)

One Year of MCP — with David Soria Parra and AAIF leads from OpenAI, Goose, Linux Foundation

⚡️ Agents, Workflows, and Python: Malte Ubl Unpacks Vercel's AI Strategy at Ship AI

⚡ Inside GitHub’s AI Revolution: Jared Palmer Reveals Agent HQ & The Future of Coding Agents

⚡️ Ship AI recap: Agents, Workflows, and Python — w/ Vercel CTO Malte Ubl

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

[State of Code RL] Cursor Composer, OpenAI o3/GPT-5, and Reasoning — Ashvin Nair, Cursor

Anthropic, Glean & OpenRouter: How AI Moats Are Built with Deedy Das of Menlo Ventures

SAM 3: The Eyes for AI — Nikhila & Pengchuan (Meta Superintelligence), ft. Joseph Nelson (Roboflow)

⚡ Inside Google Labs: The AI Coding Agent You Haven't Heard About — Jed Borovik, Google

How Frontier AI + Virtual Biology Can Help Us Cure All Diseases

The VC Who Built a Podcast to Hire CROs and Now Fix Pricing for Enterprises — Joubin Mirzadegan

[NeurIPS Best Paper] 1000 Layer Networks for Self-Supervised RL — Kevin Wang et al, Princeton

The Agents Economy Backbone - with Emily Glassberg Sands, Head of Data & AI at Stripe

⚡️ 10x AI Engineers with 10x Salaries — Alex Lieberman & Arman Hezarkhani, Tenex

Steve Yegge's Vibe Coding Manifesto: Why Claude Code Isn't It & What Comes After the IDE

ChatGPT Codex: The Missing Manual

Unsupervised Learning x Latent Space Crossover Special

⚡️The Rise and Fall of the Vector DB Category

Claude Code: Anthropic's CLI Agent

The Creators of Model Context Protocol

SF Compute: Commoditizing Compute

⚡️GPT 4.1: The New OpenAI Workhorse

Why Every Agent needs Open Source Cloud Sandboxes

[AIEWF Preview] Containing Agent Chaos — Solomon Hykes

⚡️CloudChef: Your Robot Chef - Michellin-Star food at $12/hr (w/ Kitchen tour!)

The Utility of Interpretability — Emmanuel Amiesen

WebSim, WorldSim, and The Summer of Simulative AI — with Joscha Bach of Liquid AI, Karan Malhotra of Nous Research, Rob Haisfield of WebSim.ai

Emulating Humans with NSFW Chatbots - with Jesse Silver

ICLR 2024 — Best Papers & Talks (ImageGen, Vision, Transformers, State Space Models) ft. Durk Kingma, Christian Szegedy, Ilya Sutskever

ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt)

How to train a Million Context LLM — with Mark Huang of Gradient.ai

How AI is eating Finance — with Mike Conover of Brightwave

The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka

How To Hire AI Engineers — with James Brady & Adam Wiggins of Elicit

[High Agency] AI Engineer World's Fair Preview

State of the Art: Training >70B LLMs on 10,000 H100 clusters

Benchmarks 201: Why Leaderboards > Arenas >> LLM-as-Judge

Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI

The Winds of AI Winter (Q2 Four Wars Recap) + ChatGPT Voice Mode Preview

Segment Anything 2: Demo-first Model Development

AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai

Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie)

Why you should write your own LLM benchmarks — with Nicholas Carlini, Google DeepMind

Efficiency is Coming: 3000x Faster, Cheaper, Better AI Inference from Hardware Improvements, Quantization, and Synthetic Data Distillation

Language Agents: From Reasoning to Acting

From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team

The Ultimate Guide to Prompting

Building AGI in Real Time (OpenAI Dev Day 2024)

Production AI Engineering starts with Evals — with Ankur Goyal of Braintrust

Building the AI Engineer Nation — with Josephine Teo, Minister of Digital Development and Information, Singapore

Building the Silicon Brain - with Drew Houston of Dropbox

How NotebookLM Was Made

In the Arena: How LMSys changed LLM Benchmarking Forever

Agents @ Work: Dust.tt

Agents @ Work: Lindy.ai

The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic

Why Compound AI + Open Source will beat Closed AI

Bolt.new, Flow Engineering for Code Agents, and >$8m ARR in 2 months as a Claude Wrapper

Generative Video WorldSim, Diffusion, Vision, Reinforcement Learning and Robotics — ICML 2024 Part 1

Windsurf: The Enterprise AI IDE - with Varun and Anshul of Codeium AI

2024 in AI Startups [LS Live @ NeurIPS]

2024 in Vision [LS Live @ NeurIPS]

2024 in Open Models [LS Live @ NeurIPS]

2024 in Post-Transformers Architectures (State Space Models, RWKV) [LS Live @ NeurIPS]

2024 in Synthetic Data and Smol Models [LS Live @ NeurIPS]

2024 in Agents [LS Live! @ NeurIPS 2024]

Latent.Space 2024 Year in Review

AI Engineering for Art — with comfyanonymous, of ComfyUI

Beating Google at Search with Neural PageRank and $5M of H200s — with Will Bryk of Exa.ai

[Ride Home] Simon Willison: Things we learned about LLMs in 2024

Everything you need to run Mission Critical Inference (ft. DeepSeek v3 + SGLang)

The Agent Reasoning Interface: o1/o3, Claude 3, ChatGPT Canvas, Tasks, and Operator — with Karina Nguyen of OpenAI

Outlasting Noam Shazeer, crowdsourcing Chat + AI with >1.4m DAU, and becoming the "Western DeepSeek" — with William Beauchamp, Chai Research

Agent Engineering with Pydantic + Graphs — with Samuel Colvin

The AI Architect — Bret Taylor

Bee AI: The Wearable Ambient Agent

The Inventors of Deep Research

Open Operator, Serverless Browsers and the Future of Computer-Using Agents

Building Snipd: The AI Podcast App for Learning

⚡️How Claude 3.7 Plays Pokémon

⚡️The new OpenAI Agents Platform

The Agent Network — Dharmesh Shah