首页 > 最新文献

AI, Machine Learning and Applications最新文献

英文 中文
Data Management for Trading, Risk and Regulatory Compliance in Investment Banking 投资银行交易、风险和监管合规数据管理
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140213
Hemendra Vyas
Data is growing enormously across all industries, banking and financial institutions are no exception. Financial organizations are increasingly interested in effectively managing and using day to day data to make business decisions and complying with new and existing regulations. There are general regulatory requirements for data retention of up to 7 years which makes the overall data management process challenging. To overcome this challenge banks and financial institutes rely on regular data backups of individual applications. With new regulations such as Fundamental Review of Trading Books being implemented in 2023-24, which impact multiple areas of bank, there is an immediate need for a centralized database to handle big data. In this paper author proposes a big data platform for a typical investment bank which can unify the data needs of Trading, Market Risk, Credit Risk, Counterparty Risk, Enterprise Risk Management and Model Risk Management and help with regulatory compliance.
各行各业的数据都在大幅增长,银行和金融机构也不例外。金融组织越来越重视有效管理和使用日常数据,以制定业务决策并遵守新的和现有的法规。一般法规要求数据保留长达 7 年,这使得整个数据管理过程充满挑战。为了克服这一挑战,银行和金融机构需要定期对个别应用程序进行数据备份。随着《交易账簿基本审查》等新法规将于 2023-24 年实施,这些法规将对银行的多个领域产生影响,因此迫切需要一个集中式数据库来处理大数据。在本文中,作者为一家典型的投资银行提出了一个大数据平台,该平台可以统一交易、市场风险、信用风险、交易对手风险、企业风险管理和模型风险管理的数据需求,并有助于监管合规。
{"title":"Data Management for Trading, Risk and Regulatory Compliance in Investment Banking","authors":"Hemendra Vyas","doi":"10.5121/csit.2024.140213","DOIUrl":"https://doi.org/10.5121/csit.2024.140213","url":null,"abstract":"Data is growing enormously across all industries, banking and financial institutions are no exception. Financial organizations are increasingly interested in effectively managing and using day to day data to make business decisions and complying with new and existing regulations. There are general regulatory requirements for data retention of up to 7 years which makes the overall data management process challenging. To overcome this challenge banks and financial institutes rely on regular data backups of individual applications. With new regulations such as Fundamental Review of Trading Books being implemented in 2023-24, which impact multiple areas of bank, there is an immediate need for a centralized database to handle big data. In this paper author proposes a big data platform for a typical investment bank which can unify the data needs of Trading, Market Risk, Credit Risk, Counterparty Risk, Enterprise Risk Management and Model Risk Management and help with regulatory compliance.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"153 1-3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140492462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Machine Learning-Based Heart Murmur Detection and Classification using Sound Feature Analysis 利用声音特征分析进行基于机器学习的心脏杂音检测和分类新方法
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140206
Ram Sivaraman, Joe Xiao
An electrocardiogram (ECG) is a common method used for diagnosis of heart diseases. ECG is not sufficient to detect heart abnormalities early. Heart sound monitoring or phonocardiogram (PCG) is a non-invasive assessment that can be performed during routine exams. PCG can provide valuable details for both heart disorder diagnosis as well as any perioperative cardiac monitoring. Further, heart murmurs are abnormal signals generated by turbulent blood flow in the heart and are closely associated with specific heart diseases. This paper presents a new machine learning-based heart sounds evaluation for murmurs with high accuracy. A random forest classifier is built using the statistical moments of the coefficients extracted from the heart sounds. The classifier can predict the location of the heart sounds with over 90% accuracy. The random forest classifier has a murmur detection accuracy of over 70% for test dataset and detects with over 98% accuracy for the full dataset.
心电图(ECG)是诊断心脏疾病的常用方法。心电图不足以早期发现心脏异常。心音监测或心音图(PCG)是一种非侵入性评估,可在常规检查中进行。PCG 可为心脏疾病诊断和任何围手术期心脏监测提供有价值的细节。此外,心脏杂音是由心脏内紊乱的血流产生的异常信号,与特定的心脏疾病密切相关。本文提出了一种新的基于机器学习的心脏杂音评估方法,具有很高的准确性。本文利用从心音中提取的系数的统计矩建立了一个随机森林分类器。该分类器预测心音位置的准确率超过 90%。随机森林分类器对测试数据集的杂音检测准确率超过 70%,对全部数据集的检测准确率超过 98%。
{"title":"A Novel Machine Learning-Based Heart Murmur Detection and Classification using Sound Feature Analysis","authors":"Ram Sivaraman, Joe Xiao","doi":"10.5121/csit.2024.140206","DOIUrl":"https://doi.org/10.5121/csit.2024.140206","url":null,"abstract":"An electrocardiogram (ECG) is a common method used for diagnosis of heart diseases. ECG is not sufficient to detect heart abnormalities early. Heart sound monitoring or phonocardiogram (PCG) is a non-invasive assessment that can be performed during routine exams. PCG can provide valuable details for both heart disorder diagnosis as well as any perioperative cardiac monitoring. Further, heart murmurs are abnormal signals generated by turbulent blood flow in the heart and are closely associated with specific heart diseases. This paper presents a new machine learning-based heart sounds evaluation for murmurs with high accuracy. A random forest classifier is built using the statistical moments of the coefficients extracted from the heart sounds. The classifier can predict the location of the heart sounds with over 90% accuracy. The random forest classifier has a murmur detection accuracy of over 70% for test dataset and detects with over 98% accuracy for the full dataset.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"55 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140492904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Latent Dirichlet Allocation to Explore the Dimensionality of the U.S. Practice of Law 利用潜在德里希勒分配探索美国法律实践的维度
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140204
Patrick H. Gaughan, En Cheng, Taylor C. Burgess, Aine C. Bolton
Over the centuries, the U.S. practice of law has evolved into a complex and amorphous profession. To facilitate improved analysis and understanding, this exploratory study seeks to partition law practice areas into meaningful subgroups. The study applies Latent Dirichlet Allocation (“LDA”) as a soft clustering method to 437,210 individual U.S. lawyer profiles in private practice in 2000. The profiles came from a nationally recognized directory. The resulting subgroupings contain terms consistent with the hypothesized relationships. The results also suggest the possibility of systematically binning individual practice areas into discrete practice area distributions. As such, this study makes contributions to the existing literature in at least three areas: 1) it provides support for the existence of the hypothesized law practice relationships; 2) it provides an empirical basis for developing an improved measurement of the U.S. practice of law; and 3) this study also suggests additional research to advance the field.
几个世纪以来,美国的法律实践已发展成为一个复杂而无定形的行业。为了便于更好地分析和理解,本探索性研究试图将法律实践领域划分为有意义的子群。本研究对 2000 年美国私人执业律师的 437,210 份个人档案采用了 Latent Dirichlet Allocation("LDA")作为软聚类方法。这些资料来自一个全国公认的目录。结果显示,分组包含的术语与假设的关系一致。研究结果还表明,有可能将个别执业领域系统地划分为离散的执业领域分布。因此,本研究至少在三个方面对现有文献做出了贡献:1)它为假设的法律实务关系的存在提供了支持;2)它为制定改进的美国法律实务测量方法提供了实证基础;3)本研究还为推进该领域的研究提出了更多建议。
{"title":"Using Latent Dirichlet Allocation to Explore the Dimensionality of the U.S. Practice of Law","authors":"Patrick H. Gaughan, En Cheng, Taylor C. Burgess, Aine C. Bolton","doi":"10.5121/csit.2024.140204","DOIUrl":"https://doi.org/10.5121/csit.2024.140204","url":null,"abstract":"Over the centuries, the U.S. practice of law has evolved into a complex and amorphous profession. To facilitate improved analysis and understanding, this exploratory study seeks to partition law practice areas into meaningful subgroups. The study applies Latent Dirichlet Allocation (“LDA”) as a soft clustering method to 437,210 individual U.S. lawyer profiles in private practice in 2000. The profiles came from a nationally recognized directory. The resulting subgroupings contain terms consistent with the hypothesized relationships. The results also suggest the possibility of systematically binning individual practice areas into discrete practice area distributions. As such, this study makes contributions to the existing literature in at least three areas: 1) it provides support for the existence of the hypothesized law practice relationships; 2) it provides an empirical basis for developing an improved measurement of the U.S. practice of law; and 3) this study also suggests additional research to advance the field.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"59 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140492758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Salience-Based Multi-Document Summarization Performance using a Hybrid Sentence Similarity Measure 使用混合句子相似度量提高基于显著性的多文档摘要性能
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140202
Kamal Sarkar, S. Chowdhury
The process of creating a single summary from a group of related text documents obtained from many sources is known as multi-document summarization. The efficacy of a multidocument summarization system is heavily reliant upon the sentence similarity metric employed to eliminate redundant sentences from the summary, given that the documents may contain redundant information. The sentence similarity measure is also crucial for a graph-based multi-document summarization, where the presence of an edge between two phrases is decided by how similar the two sentences are to one another. To enhance multi-document summarization performance, this study provides a new method for defining a hybrid sentence similarity measure combining a lexical similarity measure and a BERT-based semantic similarity measure. Tests conducted on the benchmark datasets demonstrate how well the proposed hybrid sentence similarity metric is effective for enhancing multi-document summarization performance.
从多个来源获得的一组相关文本文档中创建单一摘要的过程被称为多文档摘要。由于文档可能包含冗余信息,因此多文档摘要系统的功效在很大程度上取决于为消除摘要中的冗余句子而采用的句子相似度量。句子相似度度量对于基于图的多文档摘要也至关重要,因为两个短语之间是否存在边是由这两个句子的相似程度决定的。为了提高多文档摘要的性能,本研究提供了一种定义混合句子相似性度量的新方法,该方法结合了词汇相似性度量和基于 BERT 的语义相似性度量。在基准数据集上进行的测试表明了所提出的混合句子相似度量在提高多文档摘要性能方面的有效性。
{"title":"Improving Salience-Based Multi-Document Summarization Performance using a Hybrid Sentence Similarity Measure","authors":"Kamal Sarkar, S. Chowdhury","doi":"10.5121/csit.2024.140202","DOIUrl":"https://doi.org/10.5121/csit.2024.140202","url":null,"abstract":"The process of creating a single summary from a group of related text documents obtained from many sources is known as multi-document summarization. The efficacy of a multidocument summarization system is heavily reliant upon the sentence similarity metric employed to eliminate redundant sentences from the summary, given that the documents may contain redundant information. The sentence similarity measure is also crucial for a graph-based multi-document summarization, where the presence of an edge between two phrases is decided by how similar the two sentences are to one another. To enhance multi-document summarization performance, this study provides a new method for defining a hybrid sentence similarity measure combining a lexical similarity measure and a BERT-based semantic similarity measure. Tests conducted on the benchmark datasets demonstrate how well the proposed hybrid sentence similarity metric is effective for enhancing multi-document summarization performance.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"11 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140492233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrative Sentiment Analysis: Leveraging Audio, Visual, and Textual Data 综合情感分析:利用音频、视觉和文本数据
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140211
Jason S. Chu, Sindhu Ghanta
Exploring the area of multimodal sentiment analysis, this paper addresses the growing significance of this field, driven by the exponential rise in multimodal data across platforms like YouTube. Traditional sentiment analysis, primarily focused on textual data, often overlooks the complexities and nuances of human emotions conveyed through audio and visual cues. Addressing this gap, our study explores a comprehensive approach that integrates data from text, audio, and images, applying state-of-the-art machine learning and deep learning techniques tailored to each modality. Our methodology is tested on the CMU-MOSEI dataset, a multimodal collection from YouTube, offering a diverse range of human sentiments. Our research highlights the limitations of conventional text-based sentiment analysis, especially in the context of the intricate expressions of sentiment that multimodal data encapsulates. By fusing audio and visual information with textual analysis, we aim to capture a more complete spectrum of human emotions. Our experimental results demonstrate notable improvements in precision, recall and accuracy for emotion prediction, validating the efficacy of our multimodal approach over single-modality methods. This study not only contributes to the ongoing advancements in sentiment analysis but also underscores the potential of multimodal approaches in providing more accurate and nuanced interpretations of human emotions.
随着 YouTube 等平台上的多模态数据呈指数级增长,多模态情感分析领域的重要性日益凸显。传统的情感分析主要侧重于文本数据,往往忽略了通过音频和视觉线索传达的人类情感的复杂性和细微差别。为了弥补这一不足,我们的研究探索了一种综合方法,该方法整合了文本、音频和图像数据,并针对每种模式应用了最先进的机器学习和深度学习技术。我们的方法在 CMU-MOSEI 数据集上进行了测试,该数据集是来自 YouTube 的多模态集合,提供了各种人类情感。我们的研究凸显了传统基于文本的情感分析的局限性,尤其是在多模态数据所包含的错综复杂的情感表达背景下。通过将音频和视频信息与文本分析相结合,我们旨在捕捉更全面的人类情感。我们的实验结果表明,情感预测的精确度、召回率和准确率都有显著提高,这验证了我们的多模态方法比单模态方法更有效。这项研究不仅推动了情感分析领域的不断进步,还凸显了多模态方法在提供更准确、更细致的人类情感解读方面的潜力。
{"title":"Integrative Sentiment Analysis: Leveraging Audio, Visual, and Textual Data","authors":"Jason S. Chu, Sindhu Ghanta","doi":"10.5121/csit.2024.140211","DOIUrl":"https://doi.org/10.5121/csit.2024.140211","url":null,"abstract":"Exploring the area of multimodal sentiment analysis, this paper addresses the growing significance of this field, driven by the exponential rise in multimodal data across platforms like YouTube. Traditional sentiment analysis, primarily focused on textual data, often overlooks the complexities and nuances of human emotions conveyed through audio and visual cues. Addressing this gap, our study explores a comprehensive approach that integrates data from text, audio, and images, applying state-of-the-art machine learning and deep learning techniques tailored to each modality. Our methodology is tested on the CMU-MOSEI dataset, a multimodal collection from YouTube, offering a diverse range of human sentiments. Our research highlights the limitations of conventional text-based sentiment analysis, especially in the context of the intricate expressions of sentiment that multimodal data encapsulates. By fusing audio and visual information with textual analysis, we aim to capture a more complete spectrum of human emotions. Our experimental results demonstrate notable improvements in precision, recall and accuracy for emotion prediction, validating the efficacy of our multimodal approach over single-modality methods. This study not only contributes to the ongoing advancements in sentiment analysis but also underscores the potential of multimodal approaches in providing more accurate and nuanced interpretations of human emotions.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"97 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140492634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
COVBERT: Enhancing Sentiment Analysis Accuracy in COVID-19 X Data through Customized BERT COVERT:通过定制 BERT 提高 COVID-19 X 数据的情感分析准确性
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140212
Vanshaj Gupta, Jaydeep Patel, Safa Shubbar, Kambiz Ghazinour
In a time when social media information is a valuable resource for gaining insights, the COVID-19 pandemic has released a flood of public sentiment, abundant with unstructured text data. This paper introduces CovBERT, a novel adaptation of the BERT model, specifically honed for the nuanced analysis of COVID-19-related discourse on X (formerly Twitter). CovBERT stands out by incorporating a bespoke vocabulary, meticulously curated from pandemic-centric tweets, resulting in a remarkable leap in sentiment analysis accuracy—from the baseline 72% to an impressive 78.64%. This paper not only presents a detailed comparison of CovBERT with the standard BERT model but also juxtaposes it against traditional machine learning approaches, showcasing its superior proficiency in decoding complex emotional undercurrents in social media data. Furthermore, the integration of geolocation analysis pipeline adds another layer of depth, offering a panoramic view of global sentiment trends.
在社交媒体信息成为获取洞察力的宝贵资源的时代,COVID-19 大流行释放出大量非结构化文本数据的公众情绪。本文介绍了 CovBERT,它是 BERT 模型的一种新颖改良,专门用于对 X(原 Twitter)上与 COVID-19 相关的言论进行细致入微的分析。CovBERT 通过从以流行病为中心的推文中精心筛选出的定制词汇而脱颖而出,从而在情感分析准确率方面实现了显著飞跃--从基线的 72% 提高到了令人印象深刻的 78.64%。本文不仅对 CovBERT 与标准 BERT 模型进行了详细比较,还将其与传统机器学习方法进行了对比,展示了 CovBERT 在解码社交媒体数据中复杂情绪暗流方面的卓越能力。此外,地理位置分析管道的整合增加了另一层深度,提供了全球情感趋势的全景视图。
{"title":"COVBERT: Enhancing Sentiment Analysis Accuracy in COVID-19 X Data through Customized BERT","authors":"Vanshaj Gupta, Jaydeep Patel, Safa Shubbar, Kambiz Ghazinour","doi":"10.5121/csit.2024.140212","DOIUrl":"https://doi.org/10.5121/csit.2024.140212","url":null,"abstract":"In a time when social media information is a valuable resource for gaining insights, the COVID-19 pandemic has released a flood of public sentiment, abundant with unstructured text data. This paper introduces CovBERT, a novel adaptation of the BERT model, specifically honed for the nuanced analysis of COVID-19-related discourse on X (formerly Twitter). CovBERT stands out by incorporating a bespoke vocabulary, meticulously curated from pandemic-centric tweets, resulting in a remarkable leap in sentiment analysis accuracy—from the baseline 72% to an impressive 78.64%. This paper not only presents a detailed comparison of CovBERT with the standard BERT model but also juxtaposes it against traditional machine learning approaches, showcasing its superior proficiency in decoding complex emotional undercurrents in social media data. Furthermore, the integration of geolocation analysis pipeline adds another layer of depth, offering a panoramic view of global sentiment trends.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"3 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140492310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building a Robust Federated Learning based Intrusion Detection System in Internet of Things 在物联网中构建基于联盟学习的鲁棒入侵检测系统
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140201
Afrooz Rahmati, Afra Mashhadi, Geethapriya Thamilarasu
The Internet of Things (IoT) has emerged as the next big technological revolution in recent years with the potential to transform every sphere of human life. As devices, applications, and communication networks become increasingly connected and integrated, security and privacy concerns in IoT are growing at an alarming rate as well. While existing research has largely focused on centralized systems to detect security attacks, these systems do not scale well with the rapid growth of IoT devices and pose a single-point of failure risk. Furthermore, since data is extensively dispersed across huge networks of connected devices, decentralized computing is critical. Federated learning (FL) systems in the recent times has gained popularity as the distributed machine learning model that enables IoT edge devices to collaboratively train models in a decentralized manner while ensuring that data on a user’s device stays private without the contents or details of that data ever leaving that device. In this paper, we propose a federated learning based intrusion detection system using LSTM Autoencoder. The proposed technique allows IoT devices to train a global model without revealing their private data, enabling the training model to grow in size while protecting each participants local data. We conduct extensive experiments using the BoT-IoT data set and demonstrate that our solution can not only effectively improve IoT security against unknown attacks but also ensure users data privacy.
近年来,物联网(IoT)已成为下一场重大技术革命,有可能改变人类生活的各个领域。随着设备、应用和通信网络的连接和集成度越来越高,物联网的安全和隐私问题也在以惊人的速度增长。虽然现有的研究主要集中在检测安全攻击的集中式系统上,但这些系统并不能很好地应对物联网设备的快速增长,而且会带来单点故障风险。此外,由于数据广泛分散在巨大的联网设备网络中,因此分散计算至关重要。近来,联邦学习(FL)系统作为分布式机器学习模型广受欢迎,它能让物联网边缘设备以分散的方式协作训练模型,同时确保用户设备上的数据保持私密,数据内容或细节不会离开该设备。在本文中,我们利用 LSTM Autoencoder 提出了一种基于联合学习的入侵检测系统。所提出的技术允许物联网设备在不泄露其私人数据的情况下训练一个全局模型,从而使训练模型的规模不断扩大,同时保护每个参与者的本地数据。我们使用 BoT-IoT 数据集进行了大量实验,证明我们的解决方案不仅能有效提高物联网安全性,抵御未知攻击,还能确保用户的数据隐私。
{"title":"Building a Robust Federated Learning based Intrusion Detection System in Internet of Things","authors":"Afrooz Rahmati, Afra Mashhadi, Geethapriya Thamilarasu","doi":"10.5121/csit.2024.140201","DOIUrl":"https://doi.org/10.5121/csit.2024.140201","url":null,"abstract":"The Internet of Things (IoT) has emerged as the next big technological revolution in recent years with the potential to transform every sphere of human life. As devices, applications, and communication networks become increasingly connected and integrated, security and privacy concerns in IoT are growing at an alarming rate as well. While existing research has largely focused on centralized systems to detect security attacks, these systems do not scale well with the rapid growth of IoT devices and pose a single-point of failure risk. Furthermore, since data is extensively dispersed across huge networks of connected devices, decentralized computing is critical. Federated learning (FL) systems in the recent times has gained popularity as the distributed machine learning model that enables IoT edge devices to collaboratively train models in a decentralized manner while ensuring that data on a user’s device stays private without the contents or details of that data ever leaving that device. In this paper, we propose a federated learning based intrusion detection system using LSTM Autoencoder. The proposed technique allows IoT devices to train a global model without revealing their private data, enabling the training model to grow in size while protecting each participants local data. We conduct extensive experiments using the BoT-IoT data set and demonstrate that our solution can not only effectively improve IoT security against unknown attacks but also ensure users data privacy.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"48 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140492168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prior-Information Enhanced Reinforcement Learning for Energy Management Systems 能源管理系统的先验信息强化学习
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140207
Th´eo Zangato, A. Osmani, Pegah Alizadeh
Amidst increasing energy demands and growing environmental concerns, the promotion of sustainable and energy-efficient practices has become imperative. This paper introduces a reinforcement learning-based technique for optimizing energy consumption and its associated costs, with a focus on energy management systems. A three-step approach for the efficient management of charging cycles in energy storage units within buildings is presented combining RL with prior knowledge. A unique strategy is adopted: clustering building load curves to discern typical energy consumption patterns, embedding domain knowledge into the learning algorithm to refine the agent’s action space and predicting of future observations to make real-time decisions. We showcase the effectiveness of our method using real-world data. It enables controlled exploration and efficient training of Energy Management System (EMS) agents. When compared to the benchmark, our model reduces energy costs by up to 15%, cutting down consumption during peak periods, and demonstrating adaptability across various building consumption profiles.
面对日益增长的能源需求和日益严重的环境问题,推广可持续发展和节能做法已势在必行。本文介绍了一种基于强化学习的技术,用于优化能源消耗及其相关成本,重点关注能源管理系统。本文提出了一种三步法,将强化学习与先验知识相结合,对建筑物内储能装置的充电周期进行有效管理。我们采用了一种独特的策略:对建筑物负载曲线进行聚类,以识别典型的能源消耗模式;将领域知识嵌入学习算法,以完善代理的行动空间;预测未来观察结果,以做出实时决策。我们利用真实世界的数据展示了我们方法的有效性。它能够对能源管理系统(EMS)代理进行可控的探索和高效的训练。与基准相比,我们的模型降低了高达 15% 的能源成本,减少了高峰期的消耗,并展示了对各种建筑消耗情况的适应性。
{"title":"Prior-Information Enhanced Reinforcement Learning for Energy Management Systems","authors":"Th´eo Zangato, A. Osmani, Pegah Alizadeh","doi":"10.5121/csit.2024.140207","DOIUrl":"https://doi.org/10.5121/csit.2024.140207","url":null,"abstract":"Amidst increasing energy demands and growing environmental concerns, the promotion of sustainable and energy-efficient practices has become imperative. This paper introduces a reinforcement learning-based technique for optimizing energy consumption and its associated costs, with a focus on energy management systems. A three-step approach for the efficient management of charging cycles in energy storage units within buildings is presented combining RL with prior knowledge. A unique strategy is adopted: clustering building load curves to discern typical energy consumption patterns, embedding domain knowledge into the learning algorithm to refine the agent’s action space and predicting of future observations to make real-time decisions. We showcase the effectiveness of our method using real-world data. It enables controlled exploration and efficient training of Energy Management System (EMS) agents. When compared to the benchmark, our model reduces energy costs by up to 15%, cutting down consumption during peak periods, and demonstrating adaptability across various building consumption profiles.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"83 1-2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140491938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised Anomaly Detection 无监督异常检测
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140210
Suliman Alnutefy, Ali Alsuwayh
This research focuses on Unsupervised Anomaly Detection using the "ambient_temperature_system_failure.csv" dataset from Numenta Anomaly Benchmark (NAB). The dataset contains time-series temperature readings from an industrial machine's sensor. The aim is to detect anomalies indicating system failures or aberrant behavior without labeled data. Various algorithms, such as K-means, Gaussian/Elliptic Envelopes, Markov Chain, Isolation Forest, One-Class SVM, and RNNs, are applied to analyze the temperature data. These algorithms are chosen for their ability to identify significant deviations in unlabeled datasets. The study explores how these techniques enhance anomaly understanding in time series data, relevant in manufacturing, healthcare, and finance. This research's novelty lies in employing unsupervised learning techniques on a real-world dataset and understanding theiradaptability in anomaly detection. The results are expected to contribute valuable insights to the field, showcasing the practicality and effectiveness of these algorithms across various scenarios.
本研究的重点是使用 Numenta 异常基准(NAB)中的 "ambient_temperature_system_failure.csv "数据集进行无监督异常检测。该数据集包含来自工业机器传感器的时间序列温度读数。其目的是在没有标记数据的情况下,检测表明系统故障或异常行为的异常数据。各种算法,如 K-means、高斯/椭圆包络、马尔可夫链、隔离林、单类 SVM 和 RNNs,都被用于分析温度数据。之所以选择这些算法,是因为它们能够识别未标记数据集中的重大偏差。研究探讨了这些技术如何增强对时间序列数据异常的理解,这些数据与制造业、医疗保健和金融业息息相关。这项研究的新颖之处在于在真实世界数据集上采用无监督学习技术,并了解其在异常检测中的适应性。研究结果有望为该领域贡献有价值的见解,展示这些算法在各种场景中的实用性和有效性。
{"title":"Unsupervised Anomaly Detection","authors":"Suliman Alnutefy, Ali Alsuwayh","doi":"10.5121/csit.2024.140210","DOIUrl":"https://doi.org/10.5121/csit.2024.140210","url":null,"abstract":"This research focuses on Unsupervised Anomaly Detection using the \"ambient_temperature_system_failure.csv\" dataset from Numenta Anomaly Benchmark (NAB). The dataset contains time-series temperature readings from an industrial machine's sensor. The aim is to detect anomalies indicating system failures or aberrant behavior without labeled data. Various algorithms, such as K-means, Gaussian/Elliptic Envelopes, Markov Chain, Isolation Forest, One-Class SVM, and RNNs, are applied to analyze the temperature data. These algorithms are chosen for their ability to identify significant deviations in unlabeled datasets. The study explores how these techniques enhance anomaly understanding in time series data, relevant in manufacturing, healthcare, and finance. This research's novelty lies in employing unsupervised learning techniques on a real-world dataset and understanding theiradaptability in anomaly detection. The results are expected to contribute valuable insights to the field, showcasing the practicality and effectiveness of these algorithms across various scenarios.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"40 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140492305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Improved mT5 Model for Chinese Text Summary Generation 用于生成中文文本摘要的改进型 mT5 模型
Pub Date : 2024-01-27 DOI: 10.5121/csit.2024.140214
Fuping Ren, Jian Chen, Defu Zhang
Understanding complex policy documents can be challenging, highlighting the need for intelligent interpretation of Chinese policies. To enhance Chinese text summarization, this study utilized the mT5 model as the core framework and initial weights. Additionally, it reduced model size through parameter clipping, employed the Gap Sentence Generation (GSG) method as an unsupervised technique, and enhanced the Chinese tokenizer. After training on a meticulously processed 30GB Chinese training corpus, the study developed the enhanced mT5- GSG model. When fine-tuning on Chinese policy texts, it adopted the "Dropout Twice" approach and ingeniously merged the probability distribution of the two dropouts using the Wasserstein distance. Experimental results indicate that the proposed model achieved Rouge-1, Rouge-2, and Rouge-L scores of 56.13%, 45.76%, and 56.41% respectively on the Chinese policy text summarization dataset.
理解复杂的政策文件可能具有挑战性,因此需要对中文政策进行智能解读。为加强中文文本摘要,本研究利用 mT5 模型作为核心框架和初始权重。此外,它还通过参数裁剪缩小了模型大小,采用了间隙句生成(GSG)方法作为无监督技术,并增强了中文标记符。在对经过精心处理的 30GB 中文训练语料进行训练后,该研究开发出了增强型 mT5- GSG 模型。在对中文政策文本进行微调时,研究采用了 "Dropout Twice "方法,并巧妙地利用 Wasserstein 距离合并了两次 dropout 的概率分布。实验结果表明,在中文政策文本摘要数据集上,所提出的模型分别获得了 56.13%、45.76% 和 56.41% 的 Rouge-1、Rouge-2 和 Rouge-L 分数。
{"title":"An Improved mT5 Model for Chinese Text Summary Generation","authors":"Fuping Ren, Jian Chen, Defu Zhang","doi":"10.5121/csit.2024.140214","DOIUrl":"https://doi.org/10.5121/csit.2024.140214","url":null,"abstract":"Understanding complex policy documents can be challenging, highlighting the need for intelligent interpretation of Chinese policies. To enhance Chinese text summarization, this study utilized the mT5 model as the core framework and initial weights. Additionally, it reduced model size through parameter clipping, employed the Gap Sentence Generation (GSG) method as an unsupervised technique, and enhanced the Chinese tokenizer. After training on a meticulously processed 30GB Chinese training corpus, the study developed the enhanced mT5- GSG model. When fine-tuning on Chinese policy texts, it adopted the \"Dropout Twice\" approach and ingeniously merged the probability distribution of the two dropouts using the Wasserstein distance. Experimental results indicate that the proposed model achieved Rouge-1, Rouge-2, and Rouge-L scores of 56.13%, 45.76%, and 56.41% respectively on the Chinese policy text summarization dataset.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"86 9-10","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140491906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AI, Machine Learning and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1