June 2024
tl;dr: A principled decision making framework to account for interaction with other agents.
MPDM starts with a regorous formulation and makes assumptions with domain knowledge in autonomous driving to simplify the problem to be tractable for online deployment.
MPDM forward simulates multiple scenarios initiated by different policy (high level intention or behavior pattern) of ego and how other vehicles would react. This brings two advantages of MPDM.
MPDM can enable personalized driving experience we can evaluate multipe outcome using user defined cost function to accomodate different driving preferences. –> This is extended to include risk tolerance in MARC.
MPDM can enable intelligent and human-like behavior of active cut-in into dense traffic flow even when there is not a sufficient gap present(华山挤出一条路). This is NOT possible with a predict-then-plan schema without considering the interaction explicitly.
Despite simple design, MPDM is a pioneering work in decision making, and improved by subsequent works. MPDM has the assumption that the ego intention does not change within the planning horizon (10s, at 0.25s). This is improved by EUDM which allows change of ego policy within planning horizon once, and MARC which introduces risk aware contigency planning.