{"title":"利用LINQ表达式树发展数学公式并直接应用于信用评分","authors":"Alexandru-Ion Marinescu, A. Andreica","doi":"10.1109/SYNASC.2018.00069","DOIUrl":null,"url":null,"abstract":"Credit scoring is a well established and scrutinized domain within the artificial intelligence field of research and has direct implications in the functioning of financial institutions, by evaluating the risk of approving loans for different clients, which may or may not reimburse them in due time. It is the clients who fail to repay their debt that we are interested in predicting, which makes it a much more difficult task, since they form only a small minority of the total client count. From an input-output perspective, the problem can be stated as: given a set of client properties, such as age, marital status, loan duration, one must yield a 0-1 response variable, with 0 meaning \"good\" and 1, \"bad\" clients. Many techniques with high accuracy exist, such as artificial neural networks, but they behave as black box units. We add to this whole context the constraint that the output must be a concrete, tractable mathematical formula, which provides significant added value for a financial analyst. To this end, we present a means for evolving mathematical formulas using genetic programming coupled with Language Integrated Query expression trees, a feature present in the C# programming language.","PeriodicalId":273805,"journal":{"name":"2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evolving Mathematical Formulas using LINQ Expression Trees and Direct Applications to Credit Scoring\",\"authors\":\"Alexandru-Ion Marinescu, A. Andreica\",\"doi\":\"10.1109/SYNASC.2018.00069\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Credit scoring is a well established and scrutinized domain within the artificial intelligence field of research and has direct implications in the functioning of financial institutions, by evaluating the risk of approving loans for different clients, which may or may not reimburse them in due time. It is the clients who fail to repay their debt that we are interested in predicting, which makes it a much more difficult task, since they form only a small minority of the total client count. From an input-output perspective, the problem can be stated as: given a set of client properties, such as age, marital status, loan duration, one must yield a 0-1 response variable, with 0 meaning \\\"good\\\" and 1, \\\"bad\\\" clients. Many techniques with high accuracy exist, such as artificial neural networks, but they behave as black box units. We add to this whole context the constraint that the output must be a concrete, tractable mathematical formula, which provides significant added value for a financial analyst. To this end, we present a means for evolving mathematical formulas using genetic programming coupled with Language Integrated Query expression trees, a feature present in the C# programming language.\",\"PeriodicalId\":273805,\"journal\":{\"name\":\"2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SYNASC.2018.00069\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SYNASC.2018.00069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Evolving Mathematical Formulas using LINQ Expression Trees and Direct Applications to Credit Scoring
Credit scoring is a well established and scrutinized domain within the artificial intelligence field of research and has direct implications in the functioning of financial institutions, by evaluating the risk of approving loans for different clients, which may or may not reimburse them in due time. It is the clients who fail to repay their debt that we are interested in predicting, which makes it a much more difficult task, since they form only a small minority of the total client count. From an input-output perspective, the problem can be stated as: given a set of client properties, such as age, marital status, loan duration, one must yield a 0-1 response variable, with 0 meaning "good" and 1, "bad" clients. Many techniques with high accuracy exist, such as artificial neural networks, but they behave as black box units. We add to this whole context the constraint that the output must be a concrete, tractable mathematical formula, which provides significant added value for a financial analyst. To this end, we present a means for evolving mathematical formulas using genetic programming coupled with Language Integrated Query expression trees, a feature present in the C# programming language.