An Enhanced Machine Learning Approach to Identify Noise and Detect Relevant Structures for Predictive Modeling

M. Uddin
{"title":"An Enhanced Machine Learning Approach to Identify Noise and Detect Relevant Structures for Predictive Modeling","authors":"M. Uddin","doi":"10.1109/ITT59889.2023.10184237","DOIUrl":null,"url":null,"abstract":"The era of big data and social networking platforms have provided great repositories of the data for mining useful information for the real-world industry. However, along with this benefit comes the noise in the data. Generally, noise is the data-set that are redundant, false, bad, and/or outliers. Data cleaning, outlier identification, feature engineering, data slicing, etc. are few of many techniques used traditionally. End goal remains ensuring good data (signal) is not lost in bad data (noise) and less processing cost are incurred to extract useful knowledge out of given big data. This paper presents a follow up progress on existing work of the author in relevance of machine learning algorithms, academic and career data predictions and personality computing. All of that have been initially inspired by potential of useful relationships and data points in unstructured data and thus Noise becomes very relevant and may appear Signal in other contexts and predictors in goal. This proposed model is collectively titled as ‘Noise Removal and Structured Data Detection’ based on inherited parallel processing and unique n-Dimensional training approach. Personality features can be quantified into talent traits, matrix indicating the max/min for relevance factors in the academics/career of nD. The engine internals examine and train the algorithm that it minimizes the x,y co-ordinates and maximizes the z co-ordinate. It records and compares the engine internal metrics and reports it back to engine to further optimize the machine learning process until the optimum results are obtained or do not improve any further.","PeriodicalId":223578,"journal":{"name":"2023 9th International Conference on Information Technology Trends (ITT)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 9th International Conference on Information Technology Trends (ITT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITT59889.2023.10184237","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The era of big data and social networking platforms have provided great repositories of the data for mining useful information for the real-world industry. However, along with this benefit comes the noise in the data. Generally, noise is the data-set that are redundant, false, bad, and/or outliers. Data cleaning, outlier identification, feature engineering, data slicing, etc. are few of many techniques used traditionally. End goal remains ensuring good data (signal) is not lost in bad data (noise) and less processing cost are incurred to extract useful knowledge out of given big data. This paper presents a follow up progress on existing work of the author in relevance of machine learning algorithms, academic and career data predictions and personality computing. All of that have been initially inspired by potential of useful relationships and data points in unstructured data and thus Noise becomes very relevant and may appear Signal in other contexts and predictors in goal. This proposed model is collectively titled as ‘Noise Removal and Structured Data Detection’ based on inherited parallel processing and unique n-Dimensional training approach. Personality features can be quantified into talent traits, matrix indicating the max/min for relevance factors in the academics/career of nD. The engine internals examine and train the algorithm that it minimizes the x,y co-ordinates and maximizes the z co-ordinate. It records and compares the engine internal metrics and reports it back to engine to further optimize the machine learning process until the optimum results are obtained or do not improve any further.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种用于预测建模的增强机器学习方法识别噪声和检测相关结构
大数据和社交网络平台的时代为现实世界的行业挖掘有用的信息提供了巨大的数据存储库。然而,伴随着这些好处而来的是数据中的噪音。通常,噪声是冗余的、错误的、坏的和/或异常值的数据集。数据清洗、离群点识别、特征工程、数据切片等是传统技术中的一小部分。最终目标仍然是确保好的数据(信号)不会在坏数据(噪声)中丢失,并减少从给定的大数据中提取有用知识的处理成本。本文介绍了作者在机器学习算法、学术和职业数据预测以及人格计算相关方面的现有工作的后续进展。所有这些最初都是受非结构化数据中有用关系和数据点的潜力的启发,因此噪声变得非常相关,并且可能在其他上下文和目标预测中出现信号。该模型基于继承的并行处理和独特的n维训练方法,被统称为“噪声去除和结构化数据检测”。人格特征可以量化为人才特征,矩阵表示nD的学业/职业相关因素的最大/最小值。引擎内部检查和训练算法,它最小化x,y坐标和最大化z坐标。它记录并比较发动机内部指标,并将其反馈给发动机,以进一步优化机器学习过程,直到获得最佳结果或不再进一步改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
ITT 2023 Cover Page Smart Irrigation with Recycled Water: A Promising Solution for Sustainable Farming Decoding the Black Box: A Comprehensive Review of Explainable Artificial Intelligence Vaxina: Decentralized Vaccination Tracking System Lightweight Convolutional Network For Automated Photovoltaic Defect Detection
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1