Convergence analysis of machine learning algorithms for the numerical solution of mean field control and games: II—the finite horizon case

René Carmona; Mathieu Laurière

doi:10.1214/21-AAP1715

Abstract

We propose two numerical methods for the optimal control of McKean–Vlasov dynamics in finite time horizon. Both methods are based on the introduction of a suitable loss function defined over the parameters of a neural network. This allows the use of machine learning tools, and efficient implementations of stochastic gradient descent in order to perform the optimization. In the first method, the loss function stems directly from the optimal control problem. The second method tackles a generic forward-backward stochastic differential equation system (FBSDE) of McKean–Vlasov type, and relies on suitable reformulation as a mean field control problem. To provide a guarantee on how our numerical schemes approximate the solution of the original mean field control problem, we introduce a new optimization problem, directly amenable to numerical computation, and for which we rigorously provide an error rate. Several numerical examples are provided. Both methods can easily be applied to certain problems with common noise, which is not the case with the existing technology. Furthermore, although the first approach is designed for mean field control problems, the second is more general and can also be applied to the FBSDEs arising in the theory of mean field games.

Funding Statement

Both authors were partially supported by ARO Grant AWD1005491 and NSF award AWD1005433.

Acknowledgments

Also, we would like to thank two anonymous referees for a rigorous review of the original version of the paper. Their insightful comments helped us improve significantly the quality of the paper.

Citation

Download Citation

René Carmona. Mathieu Laurière. "Convergence analysis of machine learning algorithms for the numerical solution of mean field control and games: II—the finite horizon case." Ann. Appl. Probab. 32 (6) 4065 - 4105, December 2022. https://doi.org/10.1214/21-AAP1715

Information

Received: 1 September 2019; Revised: 1 October 2020; Published: December 2022

First available in Project Euclid: 6 December 2022

MathSciNet: MR4522347

zbMATH: 1505.65243

Digital Object Identifier: 10.1214/21-AAP1715

Subjects:

Primary: 60H10 , 65C30 , 91A13

Keywords: forward-backward SDE , machine learning , McKean–Vlasov , mean field control , Mean field games , numerical approximation

Abstract

Funding Statement

Acknowledgments

Citation

Information

KEYWORDS/PHRASES

PUBLICATION TITLE:

PUBLICATION YEARS