Models & Releases Research Products & Apps Business & Funding

Modelwire

A curated feed of what matters in AI. Independent, ad-supported, built in Denver, Colorado.

Read

Today
Models & Releases
Research
Business & Funding

About

About Modelwire
Methodology
Our sources
Editor's notes
Contact
Advertise

Legal

Privacy policy
Terms of use
DMCA & takedowns
Corrections

© 2026 Modelwire. All article links go to the original publishers.Summaries generated by Modelwire. We don’t republish full articles.

Earlier stories

The full Modelwire feed, ordered by publish time.

Illustration for: Quoting Romain Huet

Models & Releases

Quoting Romain Huet

OpenAI has merged Codex into its main model line starting with GPT-5.4, eliminating the separate coding variant. GPT-5.5 extends this unified approach with improved agentic coding and computer-use capabilities, signaling a shift toward single-model versatility over specialized branches.

Simon Willison·Apr 25

89

Illustration for: Anthropic says stronger AI models cut better deals, and the losers don't even notice

Research Policy & Regulation

Anthropic says stronger AI models cut better deals, and the losers don't even notice

Anthropic ran a week-long marketplace experiment where 69 AI agents negotiated deals on behalf of employees, revealing that stronger models systematically outperformed weaker ones while users remained unaware of the disparity. The finding raises concerns about economic inequality if AI agents begin handling real-world transactions without human oversight.

The Decoder·Apr 25

73

Illustration for: Ace the Ping-Pong Robot Can Whup Your Ass

Models & Releases Products & Apps

Ace the Ping-Pong Robot Can Whup Your Ass

Ace, a ping-pong robot, demonstrates real-time ball trajectory prediction and adaptive racket control to sustain volleys against human opponents. The system combines computer vision and motor control to compete in a sport requiring split-second decision-making.

WIRED — AI·Apr 25

47

Illustration for: The UAE wants half its government run by autonomous AI agents within two years

Policy & Regulation Business & Funding

The UAE wants half its government run by autonomous AI agents within two years

The UAE announced plans to automate half its government operations using autonomous AI agents by 2028, marking one of the most ambitious public-sector AI deployments at scale. The initiative signals how nation-states are treating AI infrastructure as core to governance modernization.

The Decoder·Apr 25

85

Illustration for: Google pours up to $40 billion into ChatGPT rival Anthropic

Business & Funding

Google pours up to $40 billion into ChatGPT rival Anthropic

Google committed up to $40 billion to Anthropic, following Amazon's $25 billion pledge weeks earlier, bringing total recent capital to the Claude maker to $65 billion. The dual mega-rounds signal intensifying competition for frontier AI capabilities as tech giants vie for leadership in the post-ChatGPT era.

The Decoder·Apr 25

97

Illustration for: GPT-5.5 prompting guide

Tools & Code Products & Apps

GPT-5.5 prompting guide

OpenAI published best-practice guidance for GPT-5.5 now available via API, including a technique for multi-step tasks where models send user-visible status updates before tool calls to improve perceived responsiveness.

Simon Willison·Apr 25

64

Illustration for: llm 0.31

Tools & Code Models & Releases

llm 0.31

Simon Willison's llm CLI tool now supports GPT-5.5 and adds a verbosity parameter for controlling output detail on OpenAI's latest models. The update brings developer-facing control over response formatting for newer model tiers.

Simon Willison·Apr 24

64

Illustration for: The people do not yearn for automation

Opinion & Analysis

The people do not yearn for automation

Nilay Patel's essay examines why AI adoption remains unpopular despite ChatGPT's soaring usage, arguing that technologists' obsession with automation and data modeling has created a cultural rift with the general public.

Simon Willison·Apr 24

77

Illustration for: Three reasons why DeepSeek’s new model V4 matters

Models & Releases

Three reasons why DeepSeek’s new model V4 matters

DeepSeek unveiled V4, its next-generation flagship model with substantially improved context window handling through architectural redesign. The open-source release marks a competitive escalation in long-context capabilities among frontier labs.

MIT Technology Review — AI·Apr 24

89

Illustration for: Introducing GPT-5.5 with Perplexity

Models & Releases Products & Apps

Introducing GPT-5.5 with Perplexity

OpenAI released GPT-5.5, which cuts token consumption by 56% while maintaining or improving speed on agentic workflows. Early adopters like Perplexity report significant productivity gains, with one engineer completing an internal tool in under an hour versus days previously.

OpenAI (YouTube)·Apr 24

52

Illustration for: Workspace agents in ChatGPT: Weekly metrics reporting agent

Products & Apps

Workspace agents in ChatGPT: Weekly metrics reporting agent

OpenAI demonstrated a workspace agent within ChatGPT that automates end-to-end business reporting: extracting weekly metrics, generating visualizations, writing narrative copy, and packaging a shareable report. The walkthrough illustrates practical agentic workflow capabilities moving beyond single-task chatbots.

OpenAI (YouTube)·Apr 24

52

Illustration for: Workspace agents in ChatGPT: Software review agent

Products & Apps

Workspace agents in ChatGPT: Software review agent

OpenAI rolled out workspace agents in ChatGPT, autonomous systems that handle enterprise workflows like software request review, policy checking, and IT ticket routing. The feature targets team productivity by automating multi-step approval processes with clear handoff logic.

OpenAI (YouTube)·Apr 24

52

Illustration for: Workspace agents in ChatGPT: Third-party risk management agent

Products & Apps Tools & Code

Workspace agents in ChatGPT: Third-party risk management agent

OpenAI rolled out workspace agents in ChatGPT, autonomous tools built on Codex that handle vendor risk screening across sanctions, financials, and reputation. The feature transforms compliance workflows into structured reports for enterprise teams.

OpenAI (YouTube)·Apr 24

52

Illustration for: Introducing GPT-5.5 with NVIDIA's AI Researcher

Models & Releases Products & Apps

Introducing GPT-5.5 with NVIDIA's AI Researcher

OpenAI unveiled GPT-5.5, claiming it as the company's most capable model yet, with NVIDIA researchers reporting 10x faster experiment execution. The model demonstrated autonomous code refactoring and abstract problem-solving in early demonstrations.

OpenAI (YouTube)·Apr 24

52

Illustration for: ComfyUI hits $500M valuation as creators seek more control over AI-generated media

Products & Apps Business & Funding

ComfyUI hits $500M valuation as creators seek more control over AI-generated media

ComfyUI, a node-based interface for fine-grained control over generative AI workflows, raised $30 million at a $500 million valuation. The funding reflects growing creator demand for alternatives to black-box AI tools, positioning the open-source platform as a bridge between professional studios and individual artists seeking transparency.

TechCrunch — AI·Apr 24

69

Illustration for: GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies

Models & Releases Opinion & Analysis

GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies

OpenAI released GPT-5.5 alongside DeepSeek's V4 launch, intensifying competition in frontier model capabilities. The analysis covers performance comparisons across multiple models and explores implications of compute scarcity in the current AI landscape.

AI Explained·Apr 24

67

Illustration for: OpenAI's chief scientist says AI progress has been "surprisingly slow" and promises big leaps ahead

Models & Releases Opinion & Analysis

OpenAI's chief scientist says AI progress has been "surprisingly slow" and promises big leaps ahead

OpenAI released GPT-5.5 and chief scientist Jakub Pachocki signaled that major capability breakthroughs remain ahead, characterizing the current pace of progress as slower than expected. The framing suggests the lab is recalibrating expectations while positioning future releases as transformative.

The Decoder·Apr 24

73

Illustration for: Google to invest up to $40B in Anthropic in cash and compute

Business & Funding Hardware & Infra

Google to invest up to $40B in Anthropic in cash and compute

Google is committing up to $40 billion to Anthropic in cash and compute resources, intensifying the race among AI labs to secure infrastructure capacity. The deal follows Anthropic's release of Mythos, a cybersecurity-focused model, signaling Google's strategic bet on the startup as competition for frontier AI capabilities heats up.

TechCrunch — AI·Apr 24

92

Illustration for: Spend Less, Fit Better: Budget-Efficient Scaling Law Fitting via Active Experiment Selection

Research Tools & Code

Spend Less, Fit Better: Budget-Efficient Scaling Law Fitting via Active Experiment Selection

Researchers propose an active learning method to cut the cost of fitting scaling laws, which currently consume millions in compute during pilot experiments. The technique selects which training runs to execute from a heterogeneous pool to maximize extrapolation accuracy for high-cost target regions, outperforming classical design approaches across benchmarks.

arXiv cs.LG·Apr 24

62

Illustration for: How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks

How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks

Researchers analyzed token consumption across eight frontier LLMs running agentic coding tasks on SWE-bench Verified, finding agentic workflows burn 1000x more tokens than traditional code reasoning. The study also evaluates whether models can predict their own token costs before execution, offering practical insights for teams deploying cost-sensitive AI agents.

arXiv cs.CL·Apr 24

62

Illustration for: Representational Harms in LLM-Generated Narratives Against Global Majority Nationalities

Representational Harms in LLM-Generated Narratives Against Global Majority Nationalities

Researchers found that major LLMs generate narratives containing persistent stereotypes, erasure, and one-dimensional portrayals of people from Global Majority nationalities. The study evaluates representational harms in open-ended text generation, with implications for high-stakes applications like asylum interviews.

arXiv cs.CL·Apr 24

58

Illustration for: Relaxation-Informed Training of Neural Network Surrogate Models

Relaxation-Informed Training of Neural Network Surrogate Models

Researchers propose training regularizers that optimize neural network surrogates for embedding in mixed-integer linear programs, directly controlling MILP tractability properties like binary variable count and relaxation tightness rather than relying on standard prediction loss alone.

arXiv cs.LG·Apr 24

52

Illustration for: Canadian, German AI Startups Join Forces to Challenge US Dominance

Business & Funding

Canadian, German AI Startups Join Forces to Challenge US Dominance

A Canadian and German AI startup are partnering to build a regional AI stack designed to reduce dependence on US vendors while meeting European and Canadian regulatory requirements. The collaboration signals growing appetite among non-US players to develop sovereign AI infrastructure.

AI Business·Apr 24

55

Illustration for: Neural Recovery of Historical Lexical Structure in Bantu Languages from Modern Data

Research Models & Releases

Neural Recovery of Historical Lexical Structure in Bantu Languages from Modern Data

Researchers trained a transformer model on modern Bantu morphology and recovered historical Proto-Bantu lexical structure, validating 91% of top noun cognate predictions against established reconstructions. The work demonstrates neural models can infer deep linguistic history from contemporary data alone, with practical applications to language documentation.

arXiv cs.CL·Apr 24

54

Illustration for: Zero-Shot Morphological Discovery in Low-Resource Bantu Languages via Cross-Lingual Transfer and Unsupervised Clustering

Zero-Shot Morphological Discovery in Low-Resource Bantu Languages via Cross-Lingual Transfer and Unsupervised Clustering

Researchers used cross-lingual transfer learning and unsupervised clustering to automatically discover morphological patterns in Giriama, a low-resource Bantu language with minimal labeled data. The method identified two previously unknown prefix variants and achieved 86.7% lemmatization accuracy across 19,624 words, demonstrating practical gains for linguistic analysis in data-scarce settings.

arXiv cs.CL·Apr 24

52

Illustration for: Aligning Dense Retrievers with LLM Utility via DistillationAligning Dense Retrievers with LLM Utility via Distillation

Research Tools & Code

Aligning Dense Retrievers with LLM Utility via DistillationAligning Dense Retrievers with LLM Utility via Distillation

Researchers propose Utility-Aligned Embeddings, a technique that trains retrieval models to match LLM utility signals without requiring expensive test-time inference. The method embeds graded relevance directly into dense vectors, potentially making RAG systems faster and more accurate than current similarity-based or LLM re-ranking approaches.

arXiv cs.LG·Apr 24

58

Illustration for: AI-Designed Drugs by a DeepMind Spinoff Are Headed to Human Trials

Business & Funding Products & Apps

AI-Designed Drugs by a DeepMind Spinoff Are Headed to Human Trials

Isomorphic Labs, a DeepMind spinoff, is advancing AI-designed drug candidates into human clinical trials, marking a concrete validation of machine learning in drug discovery beyond the research phase.

WIRED — AI·Apr 24

81

Illustration for: How Project Maven taught the military to love AI

Policy & Regulation

How Project Maven taught the military to love AI

The US military's 1,000+ target strike on Iran in 24 hours relied heavily on AI systems like Maven Smart to accelerate targeting workflows, demonstrating how defense AI has matured from experimental to operationally decisive.

The Verge — AI·Apr 24

81

Illustration for: GPT-5.5 Boasts Coding Advancements, But Falls Short of Opus 4.7

Models & Releases

GPT-5.5 Boasts Coding Advancements, But Falls Short of Opus 4.7

OpenAI's GPT-5.5 shows gains in coding and tool use but trails Anthropic's Claude Opus 4.7 in key benchmarks. The release underscores intensifying competition between frontier labs on specialized capabilities rather than raw scale.

AI Business·Apr 24

61

Illustration for: Samsung may be bracing for first-ever annual loss in smartphone business

Business & Funding Hardware & Infra

Samsung may be bracing for first-ever annual loss in smartphone business

Samsung faces potential first annual loss in smartphones as AI-driven demand for memory chips strains production capacity and margins. The memory shortage, fueled by AI infrastructure buildout, is reshaping the competitive dynamics of consumer device makers dependent on chip supply.

Ars Technica — AI·Apr 24

69

Older stories →