International Conference on Advanced Data Mining and Applications最新文献

英文中文

Being Automated or Not? Risk Identification of Occupations with Graph Neural Networks 自动化与否?基于图神经网络的职业风险识别

International Conference on Advanced Data Mining and Applications

Pub Date : 2022-09-06 DOI: 10.48550/arXiv.2209.02182

Dawei Xu, Haoran Yang, Marian-Andrei Rizoiu, Guandong Xu

. The rapid advances in automation technologies, such as artiﬁcial intelligence (AI) and robotics, pose an increasing risk of automation for occupations, with a likely signiﬁcant impact on the labour market. Recent social-economic studies suggest that nearly 50% of occupations are at high risk of being automated in the next decade. However, the lack of granular data and empirically informed models have limited the accuracy of these studies and made it challenging to predict which jobs will be automated. In this paper, we study the automation risk of occupations by performing a classiﬁcation task between automated and non-automated occupations. The available information is 910 occupations’ task statements, skills and interactions categorised by Standard Occupational Classiﬁcation (SOC). To fully utilize this information, we propose a graph-based semi-supervised classiﬁcation method named A utomated O ccupation C lassiﬁcation based on G raph C onvolutional N etworks ( AOC-GCN ) to identify the automated risk for occupations. This model integrates a heterogeneous graph to capture occupations’ lo-cal and global contexts. The results show that our proposed method outperforms the baseline models by considering the information of both internal features of occupations and their external interactions. This study could help policymakers identify potential automated occupations and support individuals’ decision-making before entering the job market.

．人工智能(AI)和机器人等自动化技术的快速发展，给职业带来了越来越大的自动化风险，可能对劳动力市场产生重大影响。最近的社会经济研究表明，近50%的职业在未来十年被自动化取代的风险很高。然而，缺乏细粒度数据和经验模型限制了这些研究的准确性，并使预测哪些工作将被自动化变得具有挑战性。本文通过在自动化和非自动化职业之间执行分类任务来研究职业的自动化风险。现有信息是910个职业的任务陈述、技能和互动，按标准职业分类(SOC)分类。为了充分利用这些信息，我们提出了一种基于图的半监督分类方法，即基于图C卷积N网络的自动化O职业C分类(AOC-GCN)来识别职业的自动化风险。该模型集成了一个异构图来捕捉职业的本地和全球背景。结果表明，通过考虑职业的内部特征及其外部相互作用的信息，我们提出的方法优于基线模型。这项研究可以帮助决策者识别潜在的自动化职业，并支持个人在进入就业市场之前的决策。

{"title":"Being Automated or Not? Risk Identification of Occupations with Graph Neural Networks","authors":"Dawei Xu, Haoran Yang, Marian-Andrei Rizoiu, Guandong Xu","doi":"10.48550/arXiv.2209.02182","DOIUrl":"https://doi.org/10.48550/arXiv.2209.02182","url":null,"abstract":". The rapid advances in automation technologies, such as artiﬁcial intelligence (AI) and robotics, pose an increasing risk of automation for occupations, with a likely signiﬁcant impact on the labour market. Recent social-economic studies suggest that nearly 50% of occupations are at high risk of being automated in the next decade. However, the lack of granular data and empirically informed models have limited the accuracy of these studies and made it challenging to predict which jobs will be automated. In this paper, we study the automation risk of occupations by performing a classiﬁcation task between automated and non-automated occupations. The available information is 910 occupations’ task statements, skills and interactions categorised by Standard Occupational Classiﬁcation (SOC). To fully utilize this information, we propose a graph-based semi-supervised classiﬁcation method named A utomated O ccupation C lassiﬁcation based on G raph C onvolutional N etworks ( AOC-GCN ) to identify the automated risk for occupations. This model integrates a heterogeneous graph to capture occupations’ lo-cal and global contexts. The results show that our proposed method outperforms the baseline models by considering the information of both internal features of occupations and their external interactions. This study could help policymakers identify potential automated occupations and support individuals’ decision-making before entering the job market.","PeriodicalId":244337,"journal":{"name":"International Conference on Advanced Data Mining and Applications","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133540153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deterministic Graph-Walking Program Mining 确定性图行走程序挖掘

International Conference on Advanced Data Mining and Applications

Pub Date : 2022-08-22 DOI: 10.48550/arXiv.2208.10290

Peter Belcák, Roger Wattenhofer

. Owing to their versatility, graph structures admit representations of intricate relationships between the separate entities compris-ing the data. We formalise the notion of connection between two vertex sets in terms of edge and vertex features by introducing graph-walking programs. We give two algorithms for mining of deterministic graph-walking programs that yield programs in the order of increasing length. These programs characterise linear long-distance relationships between the given two vertex sets in the context of the whole graph.

．由于其通用性，图结构允许表示组成数据的独立实体之间的复杂关系。我们通过引入图行走程序，形式化了两个顶点集之间的连接概念，即边和顶点特征。本文给出了两种确定性图行走程序的挖掘算法，这两种算法产生的程序按长度递增的顺序排列。这些程序描述了在整个图的背景下给定的两个顶点集之间的线性远距离关系。

引用次数: 0

Profit Maximization using Social Networks in Two-Phase Setting 两阶段环境下社会网络的利润最大化

International Conference on Advanced Data Mining and Applications

Pub Date : 2022-07-16 DOI: 10.48550/arXiv.2207.07830

Poonam Sharma, Suman Banerjee

Now-a-days, emph{Online Social Networks} have been predominantly used by commercial houses for viral marketing where the goal is to maximize profit. In this paper, we study the problem of Profit Maximization in the twombox{-}phase setting. The input to the problem is a emph{social network} where the users are associated with a cost and benefit value, and a fixed amount of budget splitted into two parts. Here, the cost and the benefit associated with a node signify its incentive demand and the amount of benefit that can be earned by influencing that user, respectively. The goal of this problem is to find out the optimal seed sets for both phases such that the aggregated profit at the end of the diffusion process is maximized. First, we develop a mathematical model based on the emph{Independent Cascade Model} of diffusion that captures the aggregated profit in an emph{expected} sense. Subsequently, we show that selecting an optimal seed set for the first phase even considering the optimal seed set for the second phase can be selected efficiently, is an $textsf{NP}$-Hard Problem. Next, we propose two solution methodologies, namely the emph{single greedy} and the emph{double greedy} approach for our problem that works based on marginal gain computation. A detailed analysis of both methodologies has been done to understand their time and space requirements. We perform an extensive set of experiments to demonstrate the effectiveness and efficiency of the proposed approaches with real-world datasets. From the experiments, we observe that the proposed solution approaches lead to more profit compared to the baseline methods and in particular, the double greedy approach leads to up to $5 %$ improvement compared to its singlembox{-}phase counterpart.

如今，emph{在线社交网络}主要被商业住宅用于病毒式营销，其目标是实现利润最大化。本文研究了两种mbox{-}阶段设置下的利润最大化问题。问题的输入是emph{社交网络}，其中用户与成本和收益值相关联，并且将固定的预算分成两部分。在这里，与节点相关的成本和收益分别表示其激励需求和通过影响该用户可以获得的收益。该问题的目标是找出两个阶段的最优种子集，使扩散过程结束时的总利润最大化。首先，我们建立了一个基于emph{独立级联模型}扩散的数学模型，该模型在emph{预料之中}意义上捕获了总利润。随后，我们证明了即使考虑了第二阶段的最优种子集，也可以有效地选择第一阶段的最优种子集，这是一个$textsf{NP}$ -难问题。接下来，我们提出两种解决方法，即emph{单身贪婪}和emph{双重贪婪}方法，用于基于边际增益计算的问题。对这两种方法进行了详细的分析，以了解它们的时间和空间要求。我们进行了一组广泛的实验来证明所提出的方法与现实世界数据集的有效性和效率。从实验中，我们观察到，与基线方法相比，所提出的解决方案方法带来了更多的利润，特别是，与单一mbox{-}阶段相比，双贪婪方法带来了$5 %$的改进。

{"title":"Profit Maximization using Social Networks in Two-Phase Setting","authors":"Poonam Sharma, Suman Banerjee","doi":"10.48550/arXiv.2207.07830","DOIUrl":"https://doi.org/10.48550/arXiv.2207.07830","url":null,"abstract":"Now-a-days, emph{Online Social Networks} have been predominantly used by commercial houses for viral marketing where the goal is to maximize profit. In this paper, we study the problem of Profit Maximization in the twombox{-}phase setting. The input to the problem is a emph{social network} where the users are associated with a cost and benefit value, and a fixed amount of budget splitted into two parts. Here, the cost and the benefit associated with a node signify its incentive demand and the amount of benefit that can be earned by influencing that user, respectively. The goal of this problem is to find out the optimal seed sets for both phases such that the aggregated profit at the end of the diffusion process is maximized. First, we develop a mathematical model based on the emph{Independent Cascade Model} of diffusion that captures the aggregated profit in an emph{expected} sense. Subsequently, we show that selecting an optimal seed set for the first phase even considering the optimal seed set for the second phase can be selected efficiently, is an $textsf{NP}$-Hard Problem. Next, we propose two solution methodologies, namely the emph{single greedy} and the emph{double greedy} approach for our problem that works based on marginal gain computation. A detailed analysis of both methodologies has been done to understand their time and space requirements. We perform an extensive set of experiments to demonstrate the effectiveness and efficiency of the proposed approaches with real-world datasets. From the experiments, we observe that the proposed solution approaches lead to more profit compared to the baseline methods and in particular, the double greedy approach leads to up to $5 %$ improvement compared to its singlembox{-}phase counterpart.","PeriodicalId":244337,"journal":{"name":"International Conference on Advanced Data Mining and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129418416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identification of Stock Market Manipulation with Deep Learning 用深度学习识别股票市场操纵

International Conference on Advanced Data Mining and Applications

Pub Date : 2022-02-07 DOI: 10.1007/978-3-030-95405-5_29

Jillian Tallboys, Ye Zhu, S. Rajasegarar

引用次数: 1

Clique percolation method: memory efficient almost exact communities 团渗透法:内存效率高，几乎精确的社团

International Conference on Advanced Data Mining and Applications

Pub Date : 2021-10-04 DOI: 10.1007/978-3-030-95408-6_9

Alexis Baudin, Maximilien Danisch, Sergey Kirgizov, Clémence Magnien, M. Ghanem

引用次数: 2

Signal Classification using Smooth Coefficients of Multiple wavelets 基于多小波平滑系数的信号分类

International Conference on Advanced Data Mining and Applications

Pub Date : 2021-09-21 DOI: 10.1007/978-3-031-22137-8_13

Paul Grant, M. Islam

引用次数: 1

HisRec: Bridging Heterogeneous Information Spaces for Recommendation via Attentive Embedding hrecc:通过关注嵌入桥接异构信息空间的推荐

International Conference on Advanced Data Mining and Applications

Pub Date : 2020-11-12 DOI: 10.1007/978-3-030-65390-3_33

Jingwei Ma, Lei Zhu, Jiahui Wen, Mingyang Zhong

引用次数: 0

Low-Light Image Enhancement with Color Transfer Based on Local Statistical Feature 基于局部统计特征的色彩转移微光图像增强

International Conference on Advanced Data Mining and Applications

Pub Date : 2020-11-12 DOI: 10.1007/978-3-030-65390-3_48

Zhigao Zhang, Bin Wang

引用次数: 0

MSPP: A Highly Efficient and Scalable Algorithm for Mining Similar Pairs of Points MSPP:一种高效、可扩展的相似点对挖掘算法

International Conference on Advanced Data Mining and Applications

Pub Date : 2020-07-31 DOI: 10.1007/978-3-030-65390-3_3

S. Saha, A. Soliman, S. Rajasekaran

引用次数: 0

Single Image Dehazing Algorithm Based on Sky Region Segmentation 基于天空区域分割的单幅图像去雾算法

International Conference on Advanced Data Mining and Applications

Pub Date : 2020-07-10 DOI: 10.1007/978-3-030-35231-8_35

Wei-xiang Li, Wei Jie, S. M. Zadeh

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Conference on Advanced Data Mining and Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀