首页 > 最新文献

Data Science and Engineering最新文献

英文 中文
Efficient Top-k Frequent Itemset Mining on Massive Data 在海量数据上高效挖掘顶-k 频项集
IF 4.2 2区 计算机科学 Q1 Computer Science Pub Date : 2024-02-06 DOI: 10.1007/s41019-024-00241-2
Xiaolong Wan, Xixian Han
{"title":"Efficient Top-k Frequent Itemset Mining on Massive Data","authors":"Xiaolong Wan, Xixian Han","doi":"10.1007/s41019-024-00241-2","DOIUrl":"https://doi.org/10.1007/s41019-024-00241-2","url":null,"abstract":"","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139799346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Where To Go at the Next Timestamp 下一个时间戳的去向
IF 4.2 2区 计算机科学 Q1 Computer Science Pub Date : 2024-01-28 DOI: 10.1007/s41019-023-00240-9
Jiaqi Duan, Xiangfu Meng, Guihong Liu
{"title":"Where To Go at the Next Timestamp","authors":"Jiaqi Duan, Xiangfu Meng, Guihong Liu","doi":"10.1007/s41019-023-00240-9","DOIUrl":"https://doi.org/10.1007/s41019-023-00240-9","url":null,"abstract":"","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139592182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construct and Query A Fine-Grained Geospatial Knowledge Graph 构建和查询细粒度地理空间知识图谱
IF 4.2 2区 计算机科学 Q1 Computer Science Pub Date : 2024-01-22 DOI: 10.1007/s41019-023-00237-4
Bo Wei, Xi Guo, Xiaodi Li, Ziyan Wu, Jing Zhao, Qiping Zou
{"title":"Construct and Query A Fine-Grained Geospatial Knowledge Graph","authors":"Bo Wei, Xi Guo, Xiaodi Li, Ziyan Wu, Jing Zhao, Qiping Zou","doi":"10.1007/s41019-023-00237-4","DOIUrl":"https://doi.org/10.1007/s41019-023-00237-4","url":null,"abstract":"","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139609320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DB-GPT: Large Language Model Meets Database DB-GPT:大型语言模型与数据库的结合
IF 4.2 2区 计算机科学 Q1 Computer Science Pub Date : 2024-01-19 DOI: 10.1007/s41019-023-00235-6
Xuanhe Zhou, Zhaoyan Sun, Guoliang Li
{"title":"DB-GPT: Large Language Model Meets Database","authors":"Xuanhe Zhou, Zhaoyan Sun, Guoliang Li","doi":"10.1007/s41019-023-00235-6","DOIUrl":"https://doi.org/10.1007/s41019-023-00235-6","url":null,"abstract":"","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139612922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explicit Behavior Interaction with Heterogeneous Graph for Multi-behavior Recommendation 利用异构图明确行为交互,实现多行为推荐
IF 4.2 2区 计算机科学 Q1 Computer Science Pub Date : 2024-01-19 DOI: 10.1007/s41019-023-00238-3
Zhongping Zhang, Yin Jia, Yuehan Hou, Xinlu Yu
{"title":"Explicit Behavior Interaction with Heterogeneous Graph for Multi-behavior Recommendation","authors":"Zhongping Zhang, Yin Jia, Yuehan Hou, Xinlu Yu","doi":"10.1007/s41019-023-00238-3","DOIUrl":"https://doi.org/10.1007/s41019-023-00238-3","url":null,"abstract":"","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139613741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncovering Flat and Hierarchical Topics by Community Discovery on Word Co-occurrence Network. 通过词语共现网络上的社群发现揭示扁平和分层主题
IF 4.2 2区 计算机科学 Q1 Computer Science Pub Date : 2024-01-01 Epub Date: 2024-03-13 DOI: 10.1007/s41019-023-00239-2
Eric Austin, Shraddha Makwana, Amine Trabelsi, Christine Largeron, Osmar R Zaïane

Topic modeling aims to discover latent themes in collections of text documents. It has various applications across fields such as sociology, opinion analysis, and media studies. In such areas, it is essential to have easily interpretable, diverse, and coherent topics. An efficient topic modeling technique should accurately identify flat and hierarchical topics, especially useful in disciplines where topics can be logically arranged into a tree format. In this paper, we propose Community Topic, a novel algorithm that exploits word co-occurrence networks to mine communities and produces topics. We also evaluate the proposed approach using several metrics and compare it with usual baselines, confirming its good performances. Community Topic enables quick identification of flat topics and topic hierarchy, facilitating the on-demand exploration of sub- and super-topics. It also obtains good results on datasets in different languages.

主题建模旨在发现文本文档集合中的潜在主题。它在社会学、舆论分析和媒体研究等领域有着广泛的应用。在这些领域,拥有易于解释、多样且连贯的主题至关重要。高效的主题建模技术应能准确识别扁平和分层主题,尤其是在主题可按逻辑排列成树形格式的学科中。在本文中,我们提出了 "社区话题"(Community Topic)这一新型算法,该算法利用词语共现网络挖掘社区并生成话题。我们还使用多个指标对所提出的方法进行了评估,并将其与通常的基线进行了比较,证实了其良好的性能。Community Topic 可以快速识别平面主题和主题层次,便于按需探索子主题和超级主题。它在不同语言的数据集上也取得了良好的效果。
{"title":"Uncovering Flat and Hierarchical Topics by Community Discovery on Word Co-occurrence Network.","authors":"Eric Austin, Shraddha Makwana, Amine Trabelsi, Christine Largeron, Osmar R Zaïane","doi":"10.1007/s41019-023-00239-2","DOIUrl":"10.1007/s41019-023-00239-2","url":null,"abstract":"<p><p>Topic modeling aims to discover latent themes in collections of text documents. It has various applications across fields such as sociology, opinion analysis, and media studies. In such areas, it is essential to have easily interpretable, diverse, and coherent topics. An efficient topic modeling technique should accurately identify flat and hierarchical topics, especially useful in disciplines where topics can be logically arranged into a tree format. In this paper, we propose Community Topic, a novel algorithm that exploits word co-occurrence networks to mine communities and produces topics. We also evaluate the proposed approach using several metrics and compare it with usual baselines, confirming its good performances. Community Topic enables quick identification of flat topics and topic hierarchy, facilitating the on-demand exploration of sub- and super-topics. It also obtains good results on datasets in different languages.</p>","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10980674/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140337633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AIoT-CitySense: AI and IoT-Driven City-Scale Sensing for Roadside Infrastructure Maintenance AIoT-CitySense:人工智能和物联网驱动的城市规模传感技术用于路边基础设施维护
IF 4.2 2区 计算机科学 Q1 Computer Science Pub Date : 2023-12-19 DOI: 10.1007/s41019-023-00236-5
A. Forkan, Yongjin Kang, Felip Martí, Abhik Banerjee, Chris McCarthy, Hadi Ghaderi, Breno Costa, Anas Dawod, Dimitrios Georgakopolous, P. Jayaraman
{"title":"AIoT-CitySense: AI and IoT-Driven City-Scale Sensing for Roadside Infrastructure Maintenance","authors":"A. Forkan, Yongjin Kang, Felip Martí, Abhik Banerjee, Chris McCarthy, Hadi Ghaderi, Breno Costa, Anas Dawod, Dimitrios Georgakopolous, P. Jayaraman","doi":"10.1007/s41019-023-00236-5","DOIUrl":"https://doi.org/10.1007/s41019-023-00236-5","url":null,"abstract":"","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138962482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Anomaly Detection with Sub-Extreme Values: Health Provider Billing 亚极值异常检测:医疗机构账单
IF 4.2 2区 计算机科学 Q1 Computer Science Pub Date : 2023-11-29 DOI: 10.1007/s41019-023-00234-7
Rob Muspratt, Musa Mammadov
{"title":"Anomaly Detection with Sub-Extreme Values: Health Provider Billing","authors":"Rob Muspratt, Musa Mammadov","doi":"10.1007/s41019-023-00234-7","DOIUrl":"https://doi.org/10.1007/s41019-023-00234-7","url":null,"abstract":"","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139211057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph Neural Network-Based Short‑Term Load Forecasting with Temporal Convolution 基于时态卷积的图神经网络短期负荷预测
IF 4.2 2区 计算机科学 Q1 Computer Science Pub Date : 2023-11-20 DOI: 10.1007/s41019-023-00233-8
Chenchen Sun, Yan Ning, Derong Shen, Tiezheng Nie
{"title":"Graph Neural Network-Based Short‑Term Load Forecasting with Temporal Convolution","authors":"Chenchen Sun, Yan Ning, Derong Shen, Tiezheng Nie","doi":"10.1007/s41019-023-00233-8","DOIUrl":"https://doi.org/10.1007/s41019-023-00233-8","url":null,"abstract":"","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":4.2,"publicationDate":"2023-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139259513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint Representation Learning with Generative Adversarial Imputation Network for Improved Classification of Longitudinal Data 基于生成对抗输入网络的联合表示学习改进纵向数据分类
2区 计算机科学 Q1 Computer Science Pub Date : 2023-10-17 DOI: 10.1007/s41019-023-00232-9
Sharon Torao Pingi, Duoyi Zhang, Md Abul Bashar, Richi Nayak
Abstract Generative adversarial networks (GANs) have demonstrated their effectiveness in generating temporal data to fill in missing values, enhancing the classification performance of time series data. Longitudinal datasets encompass multivariate time series data with additional static features that contribute to sample variability over time. These datasets often encounter missing values due to factors such as irregular sampling. However, existing GAN-based imputation methods that address this type of data missingness often overlook the impact of static features on temporal observations and classification outcomes. This paper presents a novel method, fusion-aided imputer-classifier GAN (FaIC-GAN), tailored for longitudinal data classification. FaIC-GAN simultaneously leverages partially observed temporal data and static features to enhance imputation and classification learning. We present four multimodal fusion strategies that effectively extract correlated information from both static and temporal modalities. Our extensive experiments reveal that FaIC-GAN successfully exploits partially observed temporal data and static features, resulting in improved classification accuracy compared to unimodal models. Our post-additive and attention-based multimodal fusion approaches within the FaIC-GAN model consistently rank among the top three methods for classification.
摘要生成对抗网络(GANs)在生成时间数据来填补缺失值,提高时间序列数据的分类性能方面已经证明了其有效性。纵向数据集包含具有额外静态特征的多变量时间序列数据,这些静态特征有助于样本随时间的变化。由于不规则采样等因素,这些数据集经常会遇到缺失值。然而,解决这类数据缺失的现有基于gan的插值方法往往忽略了静态特征对时间观测和分类结果的影响。本文提出了一种专为纵向数据分类而设计的新方法——融合辅助imputer-classifier GAN (FaIC-GAN)。FaIC-GAN同时利用部分观测到的时间数据和静态特征来增强输入和分类学习。我们提出了四种多模态融合策略,有效地从静态和时间模态中提取相关信息。我们的大量实验表明,FaIC-GAN成功地利用了部分观测到的时间数据和静态特征,与单峰模型相比,提高了分类精度。在FaIC-GAN模型中,我们的后加和基于注意力的多模态融合方法一直名列前三种分类方法之列。
{"title":"Joint Representation Learning with Generative Adversarial Imputation Network for Improved Classification of Longitudinal Data","authors":"Sharon Torao Pingi, Duoyi Zhang, Md Abul Bashar, Richi Nayak","doi":"10.1007/s41019-023-00232-9","DOIUrl":"https://doi.org/10.1007/s41019-023-00232-9","url":null,"abstract":"Abstract Generative adversarial networks (GANs) have demonstrated their effectiveness in generating temporal data to fill in missing values, enhancing the classification performance of time series data. Longitudinal datasets encompass multivariate time series data with additional static features that contribute to sample variability over time. These datasets often encounter missing values due to factors such as irregular sampling. However, existing GAN-based imputation methods that address this type of data missingness often overlook the impact of static features on temporal observations and classification outcomes. This paper presents a novel method, fusion-aided imputer-classifier GAN (FaIC-GAN), tailored for longitudinal data classification. FaIC-GAN simultaneously leverages partially observed temporal data and static features to enhance imputation and classification learning. We present four multimodal fusion strategies that effectively extract correlated information from both static and temporal modalities. Our extensive experiments reveal that FaIC-GAN successfully exploits partially observed temporal data and static features, resulting in improved classification accuracy compared to unimodal models. Our post-additive and attention-based multimodal fusion approaches within the FaIC-GAN model consistently rank among the top three methods for classification.","PeriodicalId":52220,"journal":{"name":"Data Science and Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135996121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data Science and Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1