{"title":"A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search","authors":"Shuhei Goda, Yudai Hayashi, Yuta Saito","doi":"arxiv-2409.10992","DOIUrl":null,"url":null,"abstract":"Matching users with mutual preferences is a critical aspect of services\ndriven by reciprocal recommendations, such as job search. To produce\nrecommendations in such scenarios, one can predict match probabilities and\nconstruct rankings based on these predictions. However, this direct match\nprediction approach often underperforms due to the extreme sparsity of match\nlabels. Therefore, most existing methods predict preferences separately for\neach direction (e.g., job seeker to employer and employer to job seeker) and\nthen aggregate the predictions to generate overall matching scores and produce\nrecommendations. However, this typical approach often leads to practical\nissues, such as biased error propagation between the two models. This paper\nintroduces and demonstrates a novel and practical solution to improve\nreciprocal recommendations in production by leveraging \\textit{pseudo-match\nscores}. Specifically, our approach generates dense and more directly relevant\npseudo-match scores by combining the true match labels, which are accurate but\nsparse, with relatively inaccurate but dense match predictions. We then train a\nmeta-model to output the final match predictions by minimizing the prediction\nloss against the pseudo-match scores. Our method can be seen as a\n\\textbf{best-of-both (BoB) approach}, as it combines the high-level ideas of\nboth direct match prediction and the two separate models approach. It also\nallows for user-specific weights to construct \\textit{personalized}\npseudo-match scores, achieving even better matching performance through\nappropriate tuning of the weights. Offline experiments on real-world job search\ndata demonstrate the superior performance of our BoB method, particularly with\npersonalized pseudo-match scores, compared to existing approaches in terms of\nfinding potential matches.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"67 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10992","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Matching users with mutual preferences is a critical aspect of services
driven by reciprocal recommendations, such as job search. To produce
recommendations in such scenarios, one can predict match probabilities and
construct rankings based on these predictions. However, this direct match
prediction approach often underperforms due to the extreme sparsity of match
labels. Therefore, most existing methods predict preferences separately for
each direction (e.g., job seeker to employer and employer to job seeker) and
then aggregate the predictions to generate overall matching scores and produce
recommendations. However, this typical approach often leads to practical
issues, such as biased error propagation between the two models. This paper
introduces and demonstrates a novel and practical solution to improve
reciprocal recommendations in production by leveraging \textit{pseudo-match
scores}. Specifically, our approach generates dense and more directly relevant
pseudo-match scores by combining the true match labels, which are accurate but
sparse, with relatively inaccurate but dense match predictions. We then train a
meta-model to output the final match predictions by minimizing the prediction
loss against the pseudo-match scores. Our method can be seen as a
\textbf{best-of-both (BoB) approach}, as it combines the high-level ideas of
both direct match prediction and the two separate models approach. It also
allows for user-specific weights to construct \textit{personalized}
pseudo-match scores, achieving even better matching performance through
appropriate tuning of the weights. Offline experiments on real-world job search
data demonstrate the superior performance of our BoB method, particularly with
personalized pseudo-match scores, compared to existing approaches in terms of
finding potential matches.