Xiang Cai, Jia-jun Wan, Ying-Ying Jiang, Nan Zhou, Lei Wang, Chen-Meng Wu, Ye Tian
{"title":"通过机器学习识别环境信息披露操纵行为","authors":"Xiang Cai, Jia-jun Wan, Ying-Ying Jiang, Nan Zhou, Lei Wang, Chen-Meng Wu, Ye Tian","doi":"10.1007/s10668-024-05340-8","DOIUrl":null,"url":null,"abstract":"<p>Corporate environmental information disclosure manipulation (EIDM) has a high level of concealment, which brings great challenges to the identification and judgment of manipulation behavior. Compared to traditional methods, machine learning techniques excel in handling large and complex datasets while achieving higher accuracy. This research applies machine learning techniques to construct the identification model of EIDM behavior and carry out the identification research of EIDM behavior. Based on the “public pressure” theory, the detection indicators will be improved from three aspects: public pressure, corporate governance, and financial indicators. By combining the collected environmental pollution penalty cases of Chinese listed companies from 2011 to 2020 with a pressure pool indicator system, we establish a training set and a test set to compare the identification ability of the logistic regression (LR), decision tree (DT), Support Vector Machine (SVM), Backpropagation (BP) Neural Network, and random forest (RF) models. Additionally, during the initial phase of model training, hyperparameter tuning is conducted across these models to ensure the maximization of their performance. For imbalanced data, after comparing the two oversampling techniques of the Borderline synthetic minority oversampling technique (Borderline SMOTE) and adaptive synthetic sampling (ADASYN), our study indicates that the Borderline SMOTE model has a better recognition effect than ADASYN and that the Borderline SMOTE-RF model is superior to the LR, DT, BP, and SVM models. We hope that our research can provide a reference for regulatory authorities, accelerate the improvement of the mandatory environmental information disclosure (EID) system of listed companies, improve the identification and early warning capabilities of EIDM, and promote the improvement of EID quality.</p>","PeriodicalId":540,"journal":{"name":"Environment, Development and Sustainability","volume":null,"pages":null},"PeriodicalIF":4.7000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying environmental information disclosure manipulation behavior via machine learning\",\"authors\":\"Xiang Cai, Jia-jun Wan, Ying-Ying Jiang, Nan Zhou, Lei Wang, Chen-Meng Wu, Ye Tian\",\"doi\":\"10.1007/s10668-024-05340-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Corporate environmental information disclosure manipulation (EIDM) has a high level of concealment, which brings great challenges to the identification and judgment of manipulation behavior. Compared to traditional methods, machine learning techniques excel in handling large and complex datasets while achieving higher accuracy. This research applies machine learning techniques to construct the identification model of EIDM behavior and carry out the identification research of EIDM behavior. Based on the “public pressure” theory, the detection indicators will be improved from three aspects: public pressure, corporate governance, and financial indicators. By combining the collected environmental pollution penalty cases of Chinese listed companies from 2011 to 2020 with a pressure pool indicator system, we establish a training set and a test set to compare the identification ability of the logistic regression (LR), decision tree (DT), Support Vector Machine (SVM), Backpropagation (BP) Neural Network, and random forest (RF) models. Additionally, during the initial phase of model training, hyperparameter tuning is conducted across these models to ensure the maximization of their performance. For imbalanced data, after comparing the two oversampling techniques of the Borderline synthetic minority oversampling technique (Borderline SMOTE) and adaptive synthetic sampling (ADASYN), our study indicates that the Borderline SMOTE model has a better recognition effect than ADASYN and that the Borderline SMOTE-RF model is superior to the LR, DT, BP, and SVM models. We hope that our research can provide a reference for regulatory authorities, accelerate the improvement of the mandatory environmental information disclosure (EID) system of listed companies, improve the identification and early warning capabilities of EIDM, and promote the improvement of EID quality.</p>\",\"PeriodicalId\":540,\"journal\":{\"name\":\"Environment, Development and Sustainability\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environment, Development and Sustainability\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s10668-024-05340-8\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment, Development and Sustainability","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10668-024-05340-8","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Identifying environmental information disclosure manipulation behavior via machine learning
Corporate environmental information disclosure manipulation (EIDM) has a high level of concealment, which brings great challenges to the identification and judgment of manipulation behavior. Compared to traditional methods, machine learning techniques excel in handling large and complex datasets while achieving higher accuracy. This research applies machine learning techniques to construct the identification model of EIDM behavior and carry out the identification research of EIDM behavior. Based on the “public pressure” theory, the detection indicators will be improved from three aspects: public pressure, corporate governance, and financial indicators. By combining the collected environmental pollution penalty cases of Chinese listed companies from 2011 to 2020 with a pressure pool indicator system, we establish a training set and a test set to compare the identification ability of the logistic regression (LR), decision tree (DT), Support Vector Machine (SVM), Backpropagation (BP) Neural Network, and random forest (RF) models. Additionally, during the initial phase of model training, hyperparameter tuning is conducted across these models to ensure the maximization of their performance. For imbalanced data, after comparing the two oversampling techniques of the Borderline synthetic minority oversampling technique (Borderline SMOTE) and adaptive synthetic sampling (ADASYN), our study indicates that the Borderline SMOTE model has a better recognition effect than ADASYN and that the Borderline SMOTE-RF model is superior to the LR, DT, BP, and SVM models. We hope that our research can provide a reference for regulatory authorities, accelerate the improvement of the mandatory environmental information disclosure (EID) system of listed companies, improve the identification and early warning capabilities of EIDM, and promote the improvement of EID quality.
期刊介绍:
Environment, Development and Sustainability is an international and multidisciplinary journal covering all aspects of the environmental impacts of socio-economic development. It is also concerned with the complex interactions which occur between development and environment, and its purpose is to seek ways and means for achieving sustainability in all human activities aimed at such development. The subject matter of the journal includes the following and related issues:
-mutual interactions among society, development and environment, and their implications for sustainable development
-technical, economic, ethical and philosophical aspects of sustainable development
-global sustainability - the obstacles and ways in which they could be overcome
-local and regional sustainability initiatives, their practical implementation, and relevance for use in a wider context
-development and application of indicators of sustainability
-development, verification, implementation and monitoring of policies for sustainable development
-sustainable use of land, water, energy and biological resources in development
-impacts of agriculture and forestry activities on soil and aquatic ecosystems and biodiversity
-effects of energy use and global climate change on development and sustainability
-impacts of population growth and human activities on food and other essential resources for development
-role of national and international agencies, and of international aid and trade arrangements in sustainable development
-social and cultural contexts of sustainable development
-role of education and public awareness in sustainable development
-role of political and economic instruments in sustainable development
-shortcomings of sustainable development and its alternatives.