Off-Policy Evaluation and Learning for Matching Markets
Matching users based on mutual preferences is a fundamental aspect of services driven by reciprocal recommendations, such as job search and dating applications. Although A/B tests remain the gold standard for evaluating new policies in recommender systems
https://arxiv.org/abs/2507.13608