Models & Releases Research Products & Apps Business & Funding

Modelwire

A curated feed of what matters in AI. Independent, ad-supported, built in Denver, Colorado.

Read

Today
Models & Releases
Research
Business & Funding

About

About Modelwire
Methodology
Our sources
Editor's notes
Contact
Advertise

Legal

Privacy policy
Terms of use
DMCA & takedowns
Corrections

© 2026 Modelwire. All article links go to the original publishers.Summaries generated by Modelwire. We don’t republish full articles.

Earlier stories

The full Modelwire feed, ordered by publish time.

Illustration for: Google unveils two new TPUs designed for the "agentic era"

Hardware & Infra

Google unveils two new TPUs designed for the "agentic era"

Google split its next-generation Tensor chip into two specialized processors: one optimized for inference, the other for training. The move signals the company's bet on agentic AI workloads as a distinct infrastructure category.

Ars Technica — AI·Apr 22

81

Illustration for: Efficient Multi-Cohort Inference for Long-Term Effects and Lifetime Value in A/B Testing with User Learning

Efficient Multi-Cohort Inference for Long-Term Effects and Lifetime Value in A/B Testing with User Learning

Researchers propose a method to measure long-term treatment effects and lifetime value changes in A/B tests for streaming platforms, addressing the gap between short-term metrics and actual user retention. The approach uses inverse-variance weighting across multiple cohorts to detect interventions that appear beneficial initially but erode value through churn.

arXiv cs.LG·Apr 22

52

Illustration for: Relative Entropy Estimation in Function Space: Theory and Applications to Trajectory Inference

Relative Entropy Estimation in Function Space: Theory and Applications to Trajectory Inference

Researchers developed a scalable method to estimate KL divergence between probability distributions in function space, addressing a key evaluation bottleneck in trajectory inference from snapshot data. The technique enables better assessment of models reconstructing latent dynamics in fields like single-cell genomics where destructive measurements prevent direct path observation.

arXiv cs.LG·Apr 22

52

Illustration for: Google makes an interesting choice with its new agent building tool for enterprises

Products & Apps Business & Funding

Google makes an interesting choice with its new agent building tool for enterprises

Google launched Gemini Enterprise Agent Platform, positioning it specifically for technical and IT teams rather than business users. The move signals a shift in how major AI vendors are segmenting the enterprise agent market.

TechCrunch — AI·Apr 22

58

Illustration for: Anthropic’s Mythos rollout has missed America’s cyberscurity agency

Policy & Regulation Products & Apps

Anthropic’s Mythos rollout has missed America’s cyberscurity agency

CISA, the US government's central cybersecurity agency, lacks access to Anthropic's Mythos Preview despite other federal agencies adopting the vulnerability-detection model. The exclusion raises questions about coordination gaps in federal AI procurement for critical infrastructure defense.

The Verge — AI·Apr 22

65

Illustration for: Personalized electric vehicle energy consumption estimation framework that integrates driver behavior with map data

Research Models & Releases

Personalized electric vehicle energy consumption estimation framework that integrates driver behavior with map data

Researchers developed a personalized EV energy consumption model combining LSTM-based driver behavior prediction with physics-based battery simulation and map data. The framework estimates real-time state-of-charge across varied terrain by learning individual driving patterns rather than assuming generic driver profiles.

arXiv cs.LG·Apr 22

52

Illustration for: Coverage, Not Averages: Semantic Stratification for Trustworthy Retrieval Evaluation

Coverage, Not Averages: Semantic Stratification for Trustworthy Retrieval Evaluation

Researchers formalize retrieval evaluation as a statistical problem and propose semantic stratification, a method that organizes documents into entity-based clusters to systematically test RAG systems across missing query categories. The approach provides formal coverage guarantees and interpretable failure-mode visibility, addressing a core bottleneck in retrieval-augmented generation accuracy.

arXiv cs.LG·Apr 22

58

Illustration for: AI Overviews are coming to your Gmail at work

Products & Apps

AI Overviews are coming to your Gmail at work

Google is rolling out AI Overviews to Gmail's enterprise tier, enabling users to generate instant email summaries across multiple messages. The feature brings Google's LLM-powered summarization directly into workplace productivity workflows.

TechCrunch — AI·Apr 22

58

Illustration for: Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

Models & Releases

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

Alibaba's Qwen3.6-27B achieves coding performance matching its 397B predecessor while shrinking model size from 807GB to 55.6GB, demonstrating major efficiency gains in open-weight model design.

Simon Willison·Apr 22

89

Illustration for: V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization

Research Models & Releases

V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization

Researchers introduce V-tableR1, a reinforcement learning framework that trains multimodal LLMs to reason step-by-step through visual table tasks using critic feedback. The approach addresses a core weakness in current vision-language models: treating visual reasoning as pattern matching rather than rigorous multi-step inference.

arXiv cs.LG·Apr 22

58

Illustration for: Google Meet will take AI notes for in-person meetings too

Products & Apps

Google Meet will take AI notes for in-person meetings too

Google's Gemini now generates meeting notes and transcripts across in-person gatherings, Zoom, and Microsoft Teams, expanding beyond its original Google Meet-only scope. The feature graduates from Android-only alpha testing to broader availability, positioning Gemini as a cross-platform meeting intelligence layer.

The Verge — AI·Apr 22

65

Illustration for: Lifecycle-Aware Federated Continual Learning in Mobile Autonomous Systems

Lifecycle-Aware Federated Continual Learning in Mobile Autonomous Systems

Researchers propose a federated continual learning framework that lets distributed autonomous fleets learn collaboratively while mitigating catastrophic forgetting across mission lifecycles. The approach addresses layer-specific forgetting sensitivity and long-term drift accumulation, moving beyond simulation-only validation toward real-world fleet heterogeneity.

arXiv cs.LG·Apr 22

52

Illustration for: AAC: Admissible-by-Architecture Differentiable Landmark Compression for ALT

Research Tools & Code

AAC: Admissible-by-Architecture Differentiable Landmark Compression for ALT

Researchers introduce AAC, a differentiable neural module that learns to compress landmark sets for shortest-path heuristics while mathematically guaranteeing admissibility without post-hoc calibration. The technique bridges classical algorithmic search with end-to-end neural training, enabling learned graph compression that preserves formal guarantees.

arXiv cs.LG·Apr 22

52

Illustration for: RespondeoQA: a Benchmark for Bilingual Latin-English Question Answering

Research Models & Releases

RespondeoQA: a Benchmark for Bilingual Latin-English Question Answering

Researchers released RespondeoQA, the first question-answering benchmark for Latin-English bilingual tasks with 7,800 QA pairs sourced from historical pedagogical materials. Testing LLaMa 3, Qwen QwQ, and OpenAI's o3-mini revealed all models struggle with skill-oriented questions, suggesting reasoning capabilities remain limited on specialized language tasks.

arXiv cs.CL·Apr 22

42

$Illustration for: F\textsuperscript{2}LP-AP: Fast \& Flexible Label Propagation with Adaptive Propagation Kernel$

Research Tools & Code

F\textsuperscript{2}LP-AP: Fast \& Flexible Label Propagation with Adaptive Propagation Kernel

Researchers propose F²LP-AP, a training-free graph neural network method that classifies nodes without expensive iterative training by adapting propagation parameters to local graph structure. The approach uses geometric medians and clustering coefficients to handle both homophilous and heterophilous graphs, addressing a key GNN limitation.

arXiv cs.LG·Apr 22

52

Illustration for: Fast Bayesian equipment condition monitoring via simulation based inference: applications to heat exchanger health

Research Tools & Code

Fast Bayesian equipment condition monitoring via simulation based inference: applications to heat exchanger health

Researchers propose a neural-network-based alternative to MCMC for real-time industrial equipment diagnostics, using simulation-based inference to map sensor data directly to degradation parameters without expensive likelihood computations. The method targets heat exchanger monitoring but generalizes to any complex failure-mode diagnosis under uncertainty.

arXiv cs.LG·Apr 22

52

Illustration for: Near-Future Policy Optimization

Near-Future Policy Optimization

Researchers propose Near-Future Policy Optimization (NPO), a reinforcement learning technique that balances high-quality external trajectories with accessible training data by optimizing the ratio of value gain to absorption cost, addressing a key bottleneck in post-training RL systems.

arXiv cs.LG·Apr 22

58

Illustration for: Anchor-and-Resume Concession Under Dynamic Pricing for LLM-Augmented Freight Negotiation

Anchor-and-Resume Concession Under Dynamic Pricing for LLM-Augmented Freight Negotiation

Researchers propose a two-index framework for LLM-powered freight negotiation that adapts concession strategies to dynamic pricing without violating offer monotonicity, addressing vulnerabilities in current AI broker systems.

arXiv cs.CL·Apr 22

42

Illustration for: Supplement Generation Training for Enhancing Agentic Task Performance

Research Tools & Code

Supplement Generation Training for Enhancing Agentic Task Performance

Researchers propose Supplement Generation Training, a method where smaller LLMs generate task-specific prompts that boost larger foundation models' performance without retraining them. The approach decouples optimization from massive models, reducing computational overhead and enabling faster adaptation to new domains.

arXiv cs.LG·Apr 22

58

Illustration for: Exploiting LLM-as-a-Judge Disposition on Free Text Legal QA via Prompt Optimization

Exploiting LLM-as-a-Judge Disposition on Free Text Legal QA via Prompt Optimization

Researchers tested automatic prompt optimization on legal QA evaluation, finding that AI judges trained with lenient feedback criteria outperform strict baselines and generalize better across different judge models. The ProTeGi method consistently beat human-designed prompts on the LEXam benchmark using Qwen3 and DeepSeek judges.

arXiv cs.CL·Apr 22

52

Illustration for: Tokenised Flow Matching for Hierarchical Simulation Based Inference

Tokenised Flow Matching for Hierarchical Simulation Based Inference

Researchers propose Tokenised Flow Matching for Posterior Estimation (TFMPE), a technique that cuts simulator costs in hierarchical inference by training neural surrogates on single-site data rather than multi-site batches, then assembling synthetic observations to amortise full posterior inference.

arXiv cs.LG·Apr 22

52

Illustration for: COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling

Research Tools & Code

COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling

Researchers propose COMPASS, a parameter-efficient fine-tuning framework that uses semantic clustering to selectively sample multilingual training data, reducing negative cross-lingual interference when adapting LLMs to new languages.

arXiv cs.CL·Apr 22

58

Illustration for: AI Tools Are Helping Mediocre North Korean Hackers Steal Millions

Policy & Regulation

AI Tools Are Helping Mediocre North Korean Hackers Steal Millions

North Korean threat actors leveraged AI to automate malware development and social engineering, stealing up to $12 million in a three-month campaign. The incident underscores how AI commoditizes attack sophistication for lower-skilled adversaries, expanding the threat surface beyond well-resourced nation-states.

WIRED — AI·Apr 22

65

Illustration for: Generative Flow Networks for Model Adaptation in Digital Twins of Natural Systems

Generative Flow Networks for Model Adaptation in Digital Twins of Natural Systems

Researchers propose using Generative Flow Networks to calibrate digital twin simulators of natural systems when observations are sparse and indirect. The approach frames model adaptation as a generative problem, allowing multiple plausible parameter configurations to be sampled by likelihood rather than forcing a single optimal fit.

arXiv cs.LG·Apr 22

52

Illustration for: Auto-ART: Structured Literature Synthesis and Automated Adversarial Robustness Testing

Research Tools & Code

Auto-ART: Structured Literature Synthesis and Automated Adversarial Robustness Testing

Researchers synthesized nine years of adversarial robustness literature and released Auto-ART, an open-source framework with 50+ attacks and gradient-masking detection that maps to NIST, OWASP, and EU AI Act standards. The work addresses fragmented evaluation protocols that have hindered trustworthy ML deployment claims.

arXiv cs.LG·Apr 22

62

Illustration for: Gemma 4 VLA Demo on Jetson Orin Nano Super

Models & Releases Hardware & Infra

Gemma 4 VLA Demo on Jetson Orin Nano Super

Google's Gemma 4 VLA (vision-language model) now runs on Nvidia's Jetson Orin Nano Super, bringing multimodal inference to edge devices. This expands accessible on-device AI capabilities for robotics and embedded applications.

Hugging Face·Apr 22

72

Illustration for: Storm Surge Modeling, Bias Correction, Graph Neural Networks, Graph Convolution Networks

Research Models & Releases

Storm Surge Modeling, Bias Correction, Graph Neural Networks, Graph Convolution Networks

Researchers introduced StormNet, a graph neural network combining convolutional and attention mechanisms with LSTMs to correct bias in storm surge forecasts from traditional models like ADCIRC. The spatio-temporal approach captures dependencies across water-level monitoring stations to improve tropical cyclone impact predictions.

arXiv cs.LG·Apr 22

52

Illustration for: Google unveils 8th-gen TPUs, agent platform, and Workspace AI layer at Cloud Next '26

Hardware & Infra Products & Apps

Google unveils 8th-gen TPUs, agent platform, and Workspace AI layer at Cloud Next '26

Google rolled out eighth-generation TPUs alongside a new agent platform and Workspace AI layer at Cloud Next '26, consolidating its infrastructure and enterprise software under an 'Agentic Enterprise' strategy. The moves signal Google's push to compete in both AI compute and agent-driven productivity tools.

The Decoder·Apr 22

85

Illustration for: MGDA-Decoupled: Geometry-Aware Multi-Objective Optimisation for DPO-based LLM Alignment

MGDA-Decoupled: Geometry-Aware Multi-Objective Optimisation for DPO-based LLM Alignment

Researchers propose MGDA-Decoupled, a geometry-based multi-objective optimization method that balances competing alignment goals in LLM training without relying on reinforcement learning or explicit reward models. The technique addresses fairness issues in existing DPO pipelines by preventing systematic under-weighting of harder-to-optimize objectives like truthfulness or harmlessness.

arXiv cs.LG·Apr 22

58

Illustration for: Variance Is Not Importance: Structural Analysis of Transformer Compressibility Across Model Scales

Variance Is Not Importance: Structural Analysis of Transformer Compressibility Across Model Scales

Researchers systematically tested compression techniques across GPT-2 and Mistral 7B, discovering that high-variance activations don't correlate with model importance and that transformer blocks behave linearly only under specific input distributions. The findings challenge conventional assumptions about which components matter for efficient inference.

arXiv cs.LG·Apr 22

62

Older stories →