3年，2篇论文，1门课程:最优非货币奖励政策

Microeconomics: Asymmetric & Private Information eJournal Pub Date : 2020-11-24 DOI:10.2139/ssrn.3647569

Wei Chen, Shivam Gupta, Milind Dawande, G. Janakiraman

{"title":"3年，2篇论文，1门课程:最优非货币奖励政策","authors":"Wei Chen, Shivam Gupta, Milind Dawande, G. Janakiraman","doi":"10.2139/ssrn.3647569","DOIUrl":null,"url":null,"abstract":"We consider a principal who periodically offers a fixed, binary, and costly non-monetary reward to agents endowed with private information, to incentivize the agents to invest effort over the long run. An agent's output, as a function of his effort, is a priori uncertain and is worth a fixed per-unit value to the principal. The principal's goal is to design an attractive reward policy that specifies how the rewards are to be given to an agent over time, based on that agent's past performance. This problem, which we denote by P, is motivated by practical examples from both academia (a reduced teaching load for achieving a certain research-productivity threshold) and industry (\"Supplier of the Year\" awards in recognition of excellent past performance). The following \"limited-term'' reward policy structure has been quite popular in practice: The principal evaluates each agent periodically; if an agent's performance over a certain (limited) number of periods in the immediate past exceeds a pre-defined threshold, then the principal rewards him for a certain (limited) number of periods in the immediate future. For the deterministic special case of problem P, where there is no uncertainty in any agent's output given his effort, we show that there always exists an optimal policy that is a limited-term policy and also obtain such a policy. When agents' outputs are stochastic, we show that the class of limited-term policies may not contain any optimal policy of problem P but is guaranteed to contain policies that are arbitrarily near-optimal: Given any epsilon>0, we show how to obtain a limited-term policy whose performance is within epsilon of that of an optimal policy. This guarantee depends crucially on the use of sufficiently long histories of the agents' outputs for the determination of the rewards. In situations where access to this historical information is limited, we derive structural insights on the role played by (i) the length of the available history and (ii) the variability in the random variable governing an agent's output, on the performance of this class of policies. Finally, we introduce and analyze the class of \"score-based'' reward policies - we show that this class is guaranteed to contain an optimal policy and also obtain such a policy.","PeriodicalId":119201,"journal":{"name":"Microeconomics: Asymmetric & Private Information eJournal","volume":"281 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"3 Years, 2 Papers, 1 Course Off: Optimal Non-Monetary Reward Policies\",\"authors\":\"Wei Chen, Shivam Gupta, Milind Dawande, G. Janakiraman\",\"doi\":\"10.2139/ssrn.3647569\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider a principal who periodically offers a fixed, binary, and costly non-monetary reward to agents endowed with private information, to incentivize the agents to invest effort over the long run. An agent's output, as a function of his effort, is a priori uncertain and is worth a fixed per-unit value to the principal. The principal's goal is to design an attractive reward policy that specifies how the rewards are to be given to an agent over time, based on that agent's past performance. This problem, which we denote by P, is motivated by practical examples from both academia (a reduced teaching load for achieving a certain research-productivity threshold) and industry (\\\"Supplier of the Year\\\" awards in recognition of excellent past performance). The following \\\"limited-term'' reward policy structure has been quite popular in practice: The principal evaluates each agent periodically; if an agent's performance over a certain (limited) number of periods in the immediate past exceeds a pre-defined threshold, then the principal rewards him for a certain (limited) number of periods in the immediate future. For the deterministic special case of problem P, where there is no uncertainty in any agent's output given his effort, we show that there always exists an optimal policy that is a limited-term policy and also obtain such a policy. When agents' outputs are stochastic, we show that the class of limited-term policies may not contain any optimal policy of problem P but is guaranteed to contain policies that are arbitrarily near-optimal: Given any epsilon>0, we show how to obtain a limited-term policy whose performance is within epsilon of that of an optimal policy. This guarantee depends crucially on the use of sufficiently long histories of the agents' outputs for the determination of the rewards. In situations where access to this historical information is limited, we derive structural insights on the role played by (i) the length of the available history and (ii) the variability in the random variable governing an agent's output, on the performance of this class of policies. Finally, we introduce and analyze the class of \\\"score-based'' reward policies - we show that this class is guaranteed to contain an optimal policy and also obtain such a policy.\",\"PeriodicalId\":119201,\"journal\":{\"name\":\"Microeconomics: Asymmetric & Private Information eJournal\",\"volume\":\"281 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Microeconomics: Asymmetric & Private Information eJournal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3647569\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microeconomics: Asymmetric & Private Information eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3647569","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

我们考虑一个委托人，他定期向拥有私人信息的代理人提供固定的、二元的、昂贵的非货币性奖励，以激励代理人在长期内投入努力。作为其努力的函数，代理的输出是先验的不确定的，并且对委托人的单位价值是固定的。委托人的目标是设计一个有吸引力的奖励政策，根据代理人过去的表现，指定随着时间的推移如何给予奖励。这个问题，我们用P表示，是由来自学术界(减少教学负担以达到一定的研究生产力门槛)和工业界(“年度供应商”奖，以表彰过去的出色表现)的实际例子所激发的。实践中普遍采用的“有限期限”奖励政策结构是:委托人定期对代理人进行评估;如果代理人在过去一定(有限)时间内的表现超过了预先定义的阈值，那么委托人就会在不久的将来一定(有限)时间内奖励他。对于问题P的确定性特例，在给定智能体努力的情况下，任何智能体的输出都不存在不确定性，我们证明了总是存在一个最优策略，该策略是有限期限策略，并且也得到了这样一个策略。当智能体的输出是随机的时，我们证明了有限期策略类可能不包含问题P的任何最优策略，但保证包含任意接近最优的策略:给定任意epsilon>0，我们展示了如何获得性能在最优策略的epsilon内的有限期策略。这种保证关键依赖于使用足够长的代理输出历史来确定奖励。在对历史信息的访问受到限制的情况下，我们对(i)可用历史的长度和(ii)控制代理输出的随机变量的可变性对这类策略的性能所起的作用得出了结构性的见解。最后，我们引入并分析了一类“基于分数”的奖励策略，我们证明了该类保证包含一个最优策略，并且也获得了这样一个最优策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

3 Years, 2 Papers, 1 Course Off: Optimal Non-Monetary Reward Policies

We consider a principal who periodically offers a fixed, binary, and costly non-monetary reward to agents endowed with private information, to incentivize the agents to invest effort over the long run. An agent's output, as a function of his effort, is a priori uncertain and is worth a fixed per-unit value to the principal. The principal's goal is to design an attractive reward policy that specifies how the rewards are to be given to an agent over time, based on that agent's past performance. This problem, which we denote by P, is motivated by practical examples from both academia (a reduced teaching load for achieving a certain research-productivity threshold) and industry ("Supplier of the Year" awards in recognition of excellent past performance). The following "limited-term'' reward policy structure has been quite popular in practice: The principal evaluates each agent periodically; if an agent's performance over a certain (limited) number of periods in the immediate past exceeds a pre-defined threshold, then the principal rewards him for a certain (limited) number of periods in the immediate future. For the deterministic special case of problem P, where there is no uncertainty in any agent's output given his effort, we show that there always exists an optimal policy that is a limited-term policy and also obtain such a policy. When agents' outputs are stochastic, we show that the class of limited-term policies may not contain any optimal policy of problem P but is guaranteed to contain policies that are arbitrarily near-optimal: Given any epsilon>0, we show how to obtain a limited-term policy whose performance is within epsilon of that of an optimal policy. This guarantee depends crucially on the use of sufficiently long histories of the agents' outputs for the determination of the rewards. In situations where access to this historical information is limited, we derive structural insights on the role played by (i) the length of the available history and (ii) the variability in the random variable governing an agent's output, on the performance of this class of policies. Finally, we introduce and analyze the class of "score-based'' reward policies - we show that this class is guaranteed to contain an optimal policy and also obtain such a policy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Microeconomics: Asymmetric & Private Information eJournal

自引率

0.00%

发文量

期刊最新文献

Quality and Pricing Decisions for Reward-based Crowdfunding: Effects of Moral Hazard Punish Underperformance with Resting Optimal Dynamic Contracts in the Presence of Switching Cost A reconsideration of the Rothschild-Stiglitz insurance market model by information theory Learning from Law Enforcement Pulp Friction: The Value of Quantity Contracts in Decentralized Markets