EP 526: LLM May Updates: What’s new in ChatGPT, Gemini, Claude and more

Unknown Source May 15, 2025 51 min

artificial-intelligence generative-ai ai-infrastructure investment startup google openai microsoft

🎧 Listen to Original

90 Companies

92 Key Quotes

5 Topics

1 Insights

🎯 Summary

Podcast Summary: EP 526: LLM May Updates: What’s new in ChatGPT, Gemini, Claude and more

This episode provides a comprehensive recap of the significant large language model (LLM) updates that occurred over the preceding one to two months, focusing primarily on the latest developments from OpenAI (ChatGPT), Google (Gemini), and Anthropic (Claude). The host emphasizes the rapid pace of change, making it difficult even for daily followers to keep up, thus justifying this dedicated update episode.

1. Focus Area

The primary focus is a detailed breakdown of recent feature releases, model iterations, and strategic moves by major LLM providers, alongside a brief overview of significant, high-level AI industry news (Saudi Arabian AI investment, Grok controversy, and Google DeepMind’s scientific breakthroughs).

2. Key Technical Insights

GPT-4.1 Introduction: OpenAI released GPT-4.1 to paid users, featuring a longer context window, improved coding capabilities, and better instruction following, though it exists confusingly alongside the established GPT-4.0 workhorse model.
Gemini 2.5 Pro I/O Edition: Google released a more powerful version of its leading model, specifically noted for superior performance in coding interactive web applications, significantly enhancing its Canvas mode (visual/programmatic output generation).
AlphaEvolve Breakthrough: Google DeepMind’s AlphaEvolve AI invented a new algorithm that improved data center efficiency by 0.7% and sped up AI training by 23%, demonstrating AI’s capability to create novel scientific solutions beyond simple code refinement.

3. Business/Investment Angle

Saudi Arabia’s AI Ambition: Saudi Arabia is making a massive $600 billion AI push, securing major hardware deals (e.g., Nvidia chips via Humane, AMD infrastructure) and planning significant data center investments, signaling a global race to become an AI hub.
ChatGPT as a Shopping Platform: OpenAI is aggressively integrating shopping features (product cards, pricing, reviews) directly into ChatGPT, posing a direct challenge to incumbents like Google Shopping and Amazon (whose own AI shopping assistant, Rufus, was deemed poor).
Workspace Integration Value: The inclusion of Microsoft SharePoint and OneDrive connectors in ChatGPT Deep Research highlights the growing importance of integrating LLMs directly with enterprise data ecosystems for proprietary research.

4. Notable Companies/People

OpenAI: Released GPT-4.1, GPT-4o Mini, and integrated SharePoint/OneDrive connectors. The host noted the removal of the older GPT-4.0 model and the potential groundwork for a future social network based on image generation history.
Google (Gemini/DeepMind): Released Gemini 2.5 Pro I/O Edition, Gemini 2.5 Flash (for API efficiency), VEO (video generation), and showcased AlphaEvolve’s scientific achievements.
Anthropic (Claude): Mentioned as having previously offered deep research/data source integration (Artifacts) that OpenAI and Gemini are now competing with or surpassing.
Elon Musk (X AI/Grok): Mentioned due to Grok’s controversial, unprompted comments regarding violence in South Africa, highlighting moderation challenges.

5. Future Implications

The industry is moving toward deeper enterprise integration (SharePoint/OneDrive access), sophisticated output generation (Gemini Canvas surpassing Claude Artifacts), and the blurring of lines between search, commerce, and AI assistance (ChatGPT shopping). Furthermore, AI is proving capable of genuine scientific discovery (AlphaEvolve), suggesting future exponential leaps in R&D across various hard sciences.

6. Target Audience

This episode is highly valuable for AI Professionals, Product Managers, Technology Strategists, and Power Users who rely on daily LLM performance for their work and need to stay current on competitive feature parity and model capabilities across the major platforms.

Comprehensive Narrative Summary

The podcast opens by acknowledging the overwhelming volume of LLM updates, setting the stage for a necessary catch-up session covering ChatGPT, Gemini, and Claude. Before diving into the core LLM comparisons, the host quickly covers three major news items: Saudi Arabia’s massive $600 billion commitment to AI infrastructure and partnerships; the controversy surrounding Elon Musk’s Grok bot exhibiting racially charged, unprompted outputs; and the significant scientific achievement of Google DeepMind’s AlphaEvolve creating new, highly efficient algorithms.

The main segment focuses on ChatGPT updates. OpenAI rolled out GPT-4.1 to paid users, noting its longer context and better coding skills, though its relationship with the default GPT-4o remains confusing. A major practical update is the integration of Microsoft SharePoint and OneDrive connectors into ChatGPT’s Deep Research feature, allowing users to query internal company data directly—a feature previously seen in Claude. Furthermore, ChatGPT is evolving into a full shopping platform, complete with product cards and pricing, directly challenging Amazon and Google Shopping. Other April updates included the removal of the older GPT-4 model, the introduction of memory features (which the host dislikes for power users due to mixing contexts), and the subtle groundwork for a potential social network via centralized image saving.

The focus then shifts to Google Gemini, which the host claims “woke up and chose violence” by releasing the Gemini 2.5 Pro I/O Edition just before its I/O conference. This new version is benchmarked as the world’s most powerful model, particularly excelling in coding and dramatically improving the Canvas feature (Google’s equivalent to Claude Artifacts), which allows users to generate interactive web apps or data visualizations from raw data dumps.

🏢 Companies Mentioned

Google's new deep research with 2.5 Pro ✅ ai_application

IBM Think conference ✅ organization

NotebookLM ✅ ai_application

Meta ✅ big_tech

Anthropic (implied via Claude) ✅ ai_application

So May ✅ unknown

Anthropic Claude ✅ unknown

Google Gemini API ✅ unknown

Google One ✅ unknown

Google Workspace ✅ unknown

Gemini Advanced ✅ unknown

LLM Arena ✅ unknown

Sometimes I ✅ unknown

Pro I ✅ unknown

Claude Artifacts ✅ unknown

💬 Key Insights

"Yeah, I think 10 million token context window, which is nutty, and the Mixture of Experts architecture."

Impact Score: 10

"They announced their new models: Llama 3 for the Scout, Llama 3 for Scout, and Llama 3 for Maverick, which are already released and the first open-weight natively multi-modal models with unprecedented context support. Yeah, I think 10 million token context window, which is nutty, and the Mixture of Experts architecture."

Impact Score: 10

"Google Gemini did also update the deep research to 2.5, 2.5 Pro as well. That was in April. That one's big, y'all, that one's big because I have been saying for a while that OpenAI's deep research was in a league of its own, but now Google Gemini with their 2.5 Pro is in that league."

Impact Score: 10

"ChatGPT has added Microsoft SharePoint and OneDrive connectors for deep research for Plus, Pro, and Team users globally."

Impact Score: 10

"All the people that say AI is nothing but advanced autocomplete. It just literally created a new algorithm that's shattering the scientific community."

Impact Score: 10

"Google DeepMind's AlphaEvolve AI has broken records by inventing new algorithms for real-world impact."

Impact Score: 10

📊 Topics

#artificialintelligence 164 #generativeai 127 #investment 2 #aiinfrastructure 2 #startup 1

🧠 Key Takeaways

💡 be seeing Llama's behemoth, which is the large version, and then we'll separately get a reasoning model