
D$^3$-Subsidy: Online and Sequential Driver Subsidy Decision-Making for Large-Scale Ride-Hailing Market
Researchers have developed D3-Subsidy, a diffusion-based controller for real-time driver incentive optimization in ride-hailing networks. The system addresses a core operations-research problem in marketplace AI: balancing dynamic supply and demand under strict budget caps and latency constraints at city scale. Rather than optimizing individual transactions, the approach uses forward-looking sequential decision-making to coordinate subsidy allocation across entire regions. This work bridges reinforcement learning and practical production constraints, offering a template for how ML systems handle multi-objective optimization in high-stakes, real-time consumer marketplaces where traditional per-order methods become computationally prohibitive.58




















