Models & ReleasesQuoting Romain HuetOpenAI has merged Codex into its main model line starting with GPT-5.4, eliminating the separate coding variant. GPT-5.5 extends this unified approach with improved agentic coding and computer-use capabilities, signaling a shift toward single-model versatility over specialized branches.Simon Willison·Apr 2589
ResearchPolicy & RegulationAnthropic says stronger AI models cut better deals, and the losers don't even noticeAnthropic ran a week-long marketplace experiment where 69 AI agents negotiated deals on behalf of employees, revealing that stronger models systematically outperformed weaker ones while users remained unaware of the disparity. The finding raises concerns about economic inequality if AI agents begin handling real-world transactions without human oversight.The Decoder·Apr 2573
Models & ReleasesProducts & AppsAce the Ping-Pong Robot Can Whup Your AssAce, a ping-pong robot, demonstrates real-time ball trajectory prediction and adaptive racket control to sustain volleys against human opponents. The system combines computer vision and motor control to compete in a sport requiring split-second decision-making.WIRED — AI·Apr 2547
Policy & RegulationBusiness & FundingThe UAE wants half its government run by autonomous AI agents within two yearsThe UAE announced plans to automate half its government operations using autonomous AI agents by 2028, marking one of the most ambitious public-sector AI deployments at scale. The initiative signals how nation-states are treating AI infrastructure as core to governance modernization.The Decoder·Apr 2585
Business & FundingGoogle pours up to $40 billion into ChatGPT rival AnthropicGoogle committed up to $40 billion to Anthropic, following Amazon's $25 billion pledge weeks earlier, bringing total recent capital to the Claude maker to $65 billion. The dual mega-rounds signal intensifying competition for frontier AI capabilities as tech giants vie for leadership in the post-ChatGPT era.The Decoder·Apr 2597
Tools & CodeProducts & AppsGPT-5.5 prompting guideOpenAI published best-practice guidance for GPT-5.5 now available via API, including a technique for multi-step tasks where models send user-visible status updates before tool calls to improve perceived responsiveness.Simon Willison·Apr 2564
Tools & CodeModels & Releasesllm 0.31Simon Willison's llm CLI tool now supports GPT-5.5 and adds a verbosity parameter for controlling output detail on OpenAI's latest models. The update brings developer-facing control over response formatting for newer model tiers.Simon Willison·Apr 2464
Opinion & AnalysisThe people do not yearn for automationNilay Patel's essay examines why AI adoption remains unpopular despite ChatGPT's soaring usage, arguing that technologists' obsession with automation and data modeling has created a cultural rift with the general public.Simon Willison·Apr 2477
Models & ReleasesThree reasons why DeepSeek’s new model V4 mattersDeepSeek unveiled V4, its next-generation flagship model with substantially improved context window handling through architectural redesign. The open-source release marks a competitive escalation in long-context capabilities among frontier labs.MIT Technology Review — AI·Apr 2489
Models & ReleasesProducts & AppsIntroducing GPT-5.5 with PerplexityOpenAI released GPT-5.5, which cuts token consumption by 56% while maintaining or improving speed on agentic workflows. Early adopters like Perplexity report significant productivity gains, with one engineer completing an internal tool in under an hour versus days previously.OpenAI (YouTube)·Apr 2452
Products & AppsWorkspace agents in ChatGPT: Weekly metrics reporting agentOpenAI demonstrated a workspace agent within ChatGPT that automates end-to-end business reporting: extracting weekly metrics, generating visualizations, writing narrative copy, and packaging a shareable report. The walkthrough illustrates practical agentic workflow capabilities moving beyond single-task chatbots.OpenAI (YouTube)·Apr 2452
Products & AppsWorkspace agents in ChatGPT: Software review agentOpenAI rolled out workspace agents in ChatGPT, autonomous systems that handle enterprise workflows like software request review, policy checking, and IT ticket routing. The feature targets team productivity by automating multi-step approval processes with clear handoff logic.OpenAI (YouTube)·Apr 2452
Products & AppsTools & CodeWorkspace agents in ChatGPT: Third-party risk management agentOpenAI rolled out workspace agents in ChatGPT, autonomous tools built on Codex that handle vendor risk screening across sanctions, financials, and reputation. The feature transforms compliance workflows into structured reports for enterprise teams.OpenAI (YouTube)·Apr 2452
Models & ReleasesProducts & AppsIntroducing GPT-5.5 with NVIDIA's AI ResearcherOpenAI unveiled GPT-5.5, claiming it as the company's most capable model yet, with NVIDIA researchers reporting 10x faster experiment execution. The model demonstrated autonomous code refactoring and abstract problem-solving in early demonstrations.OpenAI (YouTube)·Apr 2452
Products & AppsBusiness & FundingComfyUI hits $500M valuation as creators seek more control over AI-generated mediaComfyUI, a node-based interface for fine-grained control over generative AI workflows, raised $30 million at a $500 million valuation. The funding reflects growing creator demand for alternatives to black-box AI tools, positioning the open-source platform as a bridge between professional studios and individual artists seeking transparency.TechCrunch — AI·Apr 2469
Models & ReleasesOpinion & AnalysisGPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War IntensifiesOpenAI released GPT-5.5 alongside DeepSeek's V4 launch, intensifying competition in frontier model capabilities. The analysis covers performance comparisons across multiple models and explores implications of compute scarcity in the current AI landscape.AI Explained·Apr 2467
Models & ReleasesOpinion & AnalysisOpenAI's chief scientist says AI progress has been "surprisingly slow" and promises big leaps aheadOpenAI released GPT-5.5 and chief scientist Jakub Pachocki signaled that major capability breakthroughs remain ahead, characterizing the current pace of progress as slower than expected. The framing suggests the lab is recalibrating expectations while positioning future releases as transformative.The Decoder·Apr 2473
Business & FundingHardware & InfraGoogle to invest up to $40B in Anthropic in cash and computeGoogle is committing up to $40 billion to Anthropic in cash and compute resources, intensifying the race among AI labs to secure infrastructure capacity. The deal follows Anthropic's release of Mythos, a cybersecurity-focused model, signaling Google's strategic bet on the startup as competition for frontier AI capabilities heats up.TechCrunch — AI·Apr 2492
ResearchTools & CodeSpend Less, Fit Better: Budget-Efficient Scaling Law Fitting via Active Experiment SelectionResearchers propose an active learning method to cut the cost of fitting scaling laws, which currently consume millions in compute during pilot experiments. The technique selects which training runs to execute from a heterogeneous pool to maximize extrapolation accuracy for high-cost target regions, outperforming classical design approaches across benchmarks.arXiv cs.LG·Apr 2462
ResearchHow Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding TasksResearchers analyzed token consumption across eight frontier LLMs running agentic coding tasks on SWE-bench Verified, finding agentic workflows burn 1000x more tokens than traditional code reasoning. The study also evaluates whether models can predict their own token costs before execution, offering practical insights for teams deploying cost-sensitive AI agents.arXiv cs.CL·Apr 2462
ResearchRepresentational Harms in LLM-Generated Narratives Against Global Majority NationalitiesResearchers found that major LLMs generate narratives containing persistent stereotypes, erasure, and one-dimensional portrayals of people from Global Majority nationalities. The study evaluates representational harms in open-ended text generation, with implications for high-stakes applications like asylum interviews.arXiv cs.CL·Apr 2458
ResearchRelaxation-Informed Training of Neural Network Surrogate ModelsResearchers propose training regularizers that optimize neural network surrogates for embedding in mixed-integer linear programs, directly controlling MILP tractability properties like binary variable count and relaxation tightness rather than relying on standard prediction loss alone.arXiv cs.LG·Apr 2452
Business & FundingCanadian, German AI Startups Join Forces to Challenge US DominanceA Canadian and German AI startup are partnering to build a regional AI stack designed to reduce dependence on US vendors while meeting European and Canadian regulatory requirements. The collaboration signals growing appetite among non-US players to develop sovereign AI infrastructure.AI Business·Apr 2455
ResearchModels & ReleasesNeural Recovery of Historical Lexical Structure in Bantu Languages from Modern DataResearchers trained a transformer model on modern Bantu morphology and recovered historical Proto-Bantu lexical structure, validating 91% of top noun cognate predictions against established reconstructions. The work demonstrates neural models can infer deep linguistic history from contemporary data alone, with practical applications to language documentation.arXiv cs.CL·Apr 2454
ResearchZero-Shot Morphological Discovery in Low-Resource Bantu Languages via Cross-Lingual Transfer and Unsupervised ClusteringResearchers used cross-lingual transfer learning and unsupervised clustering to automatically discover morphological patterns in Giriama, a low-resource Bantu language with minimal labeled data. The method identified two previously unknown prefix variants and achieved 86.7% lemmatization accuracy across 19,624 words, demonstrating practical gains for linguistic analysis in data-scarce settings.arXiv cs.CL·Apr 2452
ResearchTools & CodeAligning Dense Retrievers with LLM Utility via DistillationAligning Dense Retrievers with LLM Utility via DistillationResearchers propose Utility-Aligned Embeddings, a technique that trains retrieval models to match LLM utility signals without requiring expensive test-time inference. The method embeds graded relevance directly into dense vectors, potentially making RAG systems faster and more accurate than current similarity-based or LLM re-ranking approaches.arXiv cs.LG·Apr 2458
Business & FundingProducts & AppsAI-Designed Drugs by a DeepMind Spinoff Are Headed to Human TrialsIsomorphic Labs, a DeepMind spinoff, is advancing AI-designed drug candidates into human clinical trials, marking a concrete validation of machine learning in drug discovery beyond the research phase.WIRED — AI·Apr 2481
Policy & RegulationHow Project Maven taught the military to love AIThe US military's 1,000+ target strike on Iran in 24 hours relied heavily on AI systems like Maven Smart to accelerate targeting workflows, demonstrating how defense AI has matured from experimental to operationally decisive.The Verge — AI·Apr 2481
Models & ReleasesGPT-5.5 Boasts Coding Advancements, But Falls Short of Opus 4.7OpenAI's GPT-5.5 shows gains in coding and tool use but trails Anthropic's Claude Opus 4.7 in key benchmarks. The release underscores intensifying competition between frontier labs on specialized capabilities rather than raw scale.AI Business·Apr 2461
Business & FundingHardware & InfraSamsung may be bracing for first-ever annual loss in smartphone businessSamsung faces potential first annual loss in smartphones as AI-driven demand for memory chips strains production capacity and margins. The memory shortage, fueled by AI infrastructure buildout, is reshaping the competitive dynamics of consumer device makers dependent on chip supply.Ars Technica — AI·Apr 2469