From Vibe Coding to Vibe Researching: OpenAI’s Mark Chen and Jakub Pachocki

Key Takeaways

OpenAI aims to develop an automated researcher for economically impactful ideas.
GPT-5 demonstrates significant advances in reasoning, especially in hard sciences.
Reinforcement Learning consistently surprises experts with its sustained improvements.
OpenAI's Codex models now exceed human expert coding capabilities.
OpenAI maintains a strong research culture focused on fundamental, long-term thinking.
Compute resources remain a critical constraint for advancing AI research.
Mutual trust between key individuals is a consistent element at OpenAI.

GPT-5 integrates instant response with long-horizon reasoning, aiming to provide a seamless user experience.
The model enables more users to access reasoning capabilities by default, aligning with agentic behavior trends.
GPT-5 showed surprising capabilities in hard sciences, generating non-trivial new mathematics.
The model significantly speeds up tasks for professional physicists and mathematicians, marking a leap from previous models.

Reinforcement Learning (RL) continuously proves effective despite earlier predictions of plateauing.
Experts express surprise at RL's sustained improvements and its ability to overcome anticipated limitations like mode collapse.
A key challenge for RL, defining environments and anchoring models to the real world, was addressed by language modeling breakthroughs.
OpenAI researchers report significant progress and new research directions combining RL with natural language processing over the past few years.

OpenAI's Codex team focuses on adapting raw intelligence for real-world coding, addressing software development complexities.
Latest Codex models aim to correct previous patterns of insufficient time on complex problems and too much on simple ones.
Current coding models now surpass the personal coding capabilities of OpenAI researchers, an advancement compared to AlphaGo's impact.
Despite impressive progress, AI coding models still exhibit an 'uncanny valley' effect, indicating room for improvement for true coworker-level performance.

Qualities of a great researcher include persistence, honesty about failures, and selecting significant problems.
A significant aspect of research involves managing emotions over extended periods and knowing when to persevere or pivot.
Identifying and rectifying software bugs or flawed conceptual frameworks, such as incorrect assumptions, were crucial breakthroughs during GPT-5's development.

OpenAI's innovation approach involves discovering new aspects of the deep learning stack rather than copying competitors.
The company hires researchers with strong technical fundamentals and a history of solving hard problems, valuing ambition and persistence.
OpenAI fosters a culture that attracts and develops top research talent, including those who may not be outwardly visible.
A winning research culture protects fundamental research, providing space for long-term thinking and avoiding focus solely on immediate product competition.

Balancing fundamental research with product development involves clearly delineating mandates for researchers and product teams.
OpenAI supports diverse research ideas, such as investigating diffusion models for visual media, synthesized into a coherent roadmap.
The long-term goal of creating an automated researcher guides individual projects, allowing for bottom-up idea generation.
OpenAI historically prioritizes core algorithmic advances, viewing resource allocation as a portfolio management challenge for leadership.

Compute remains a significant constraint for AI research organizations like OpenAI, contrary to earlier predictions of a shift to data constraints.
The historical role of universities in fundamental research, often enabled by access to compute, contrasts with the compute-intensive nature of Frontier AI.
Physical constraints, such as energy, are identified as fundamental principles that should remain constant amidst rapid AI progress.

A consistent element observed at OpenAI is the mutual trust shared between key individuals, including Chief Scientist Jakub Pachocki and Chief Research Officer Mark Chen.
Their collaboration originated from a shared interest in reasoning research when it was an unpopular direction.
Mark Chen highlights Jakub Pachocki's exceptional ability to tackle complex technical challenges, while Pachocki praises Chen's leadership in building a cohesive research team.