{"title":"基于大数据的房地产估值","authors":"M. Mamedli, A. V. Umnov","doi":"10.32609/0042-8736-2022-12-118-136","DOIUrl":null,"url":null,"abstract":"The paper considers the application of the web scrapping and machine learning algorithms for the assessment of the real estate price on the secondary housing market in Moscow. For this, we collect and process the data from the CIAN website and the data from “Reforma GKH”. To evaluate real estate objects, we consider such machine learning algorithms as Elastic Net, Random Forest and Gradient Boosting. We also apply Shapley vector-based approach to interpret the results of the black-box algorithms. The results suggest that the use of black-box algorithms in assessing the price of apartments on the Moscow secondary housing market allows to obtain more accurate price estimates both for different price segments and for the sample as a whole. At the same time, Gradient Boosting has demonstrated the best accuracy among other algorithms. Interpretation based on the Shapley vector shows that the total area, year of construction, ceiling height, renovation, as well as monolithic construction technology had a positive effect on the price. The price is negatively affected by the number of floors in the house, the possibility of mortgage and lack of repairs. Developed methodology can be applied in real estate insurance, mortgage, determination of cadastral value of real estate and others.","PeriodicalId":45534,"journal":{"name":"Voprosy Ekonomiki","volume":" ","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real estate valuation based on big data\",\"authors\":\"M. Mamedli, A. V. Umnov\",\"doi\":\"10.32609/0042-8736-2022-12-118-136\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper considers the application of the web scrapping and machine learning algorithms for the assessment of the real estate price on the secondary housing market in Moscow. For this, we collect and process the data from the CIAN website and the data from “Reforma GKH”. To evaluate real estate objects, we consider such machine learning algorithms as Elastic Net, Random Forest and Gradient Boosting. We also apply Shapley vector-based approach to interpret the results of the black-box algorithms. The results suggest that the use of black-box algorithms in assessing the price of apartments on the Moscow secondary housing market allows to obtain more accurate price estimates both for different price segments and for the sample as a whole. At the same time, Gradient Boosting has demonstrated the best accuracy among other algorithms. Interpretation based on the Shapley vector shows that the total area, year of construction, ceiling height, renovation, as well as monolithic construction technology had a positive effect on the price. The price is negatively affected by the number of floors in the house, the possibility of mortgage and lack of repairs. Developed methodology can be applied in real estate insurance, mortgage, determination of cadastral value of real estate and others.\",\"PeriodicalId\":45534,\"journal\":{\"name\":\"Voprosy Ekonomiki\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2022-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Voprosy Ekonomiki\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.32609/0042-8736-2022-12-118-136\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Voprosy Ekonomiki","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32609/0042-8736-2022-12-118-136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ECONOMICS","Score":null,"Total":0}
The paper considers the application of the web scrapping and machine learning algorithms for the assessment of the real estate price on the secondary housing market in Moscow. For this, we collect and process the data from the CIAN website and the data from “Reforma GKH”. To evaluate real estate objects, we consider such machine learning algorithms as Elastic Net, Random Forest and Gradient Boosting. We also apply Shapley vector-based approach to interpret the results of the black-box algorithms. The results suggest that the use of black-box algorithms in assessing the price of apartments on the Moscow secondary housing market allows to obtain more accurate price estimates both for different price segments and for the sample as a whole. At the same time, Gradient Boosting has demonstrated the best accuracy among other algorithms. Interpretation based on the Shapley vector shows that the total area, year of construction, ceiling height, renovation, as well as monolithic construction technology had a positive effect on the price. The price is negatively affected by the number of floors in the house, the possibility of mortgage and lack of repairs. Developed methodology can be applied in real estate insurance, mortgage, determination of cadastral value of real estate and others.