首页 > 最新文献

Data最新文献

英文 中文
Data in Astrophysics and Geophysics: Novel Research and Applications 天体物理学和地球物理学数据:新颖的研究和应用
Pub Date : 2024-02-08 DOI: 10.3390/data9020032
V. Srećković, Milan S. Dimitrijević, Z. Mijić
Rapid development of communication technologies and constant technological improvements as a result of scientific discoveries require the establishment of specific databases [...]
通信技术的迅速发展和科学发现带来的技术不断进步要求建立特定的数据库 [...]
{"title":"Data in Astrophysics and Geophysics: Novel Research and Applications","authors":"V. Srećković, Milan S. Dimitrijević, Z. Mijić","doi":"10.3390/data9020032","DOIUrl":"https://doi.org/10.3390/data9020032","url":null,"abstract":"Rapid development of communication technologies and constant technological improvements as a result of scientific discoveries require the establishment of specific databases [...]","PeriodicalId":502371,"journal":{"name":"Data","volume":" 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139791366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Yinshan Mountains Record over 10,000 Landslides 阴山山脉记录了一万多次滑坡
Pub Date : 2024-02-08 DOI: 10.3390/data9020031
Jingjing Sun, Chong Xu, Liye Feng, Lei Li, Xuewei Zhang, Wentao Yang
China boasts a vast expanse of mountainous terrain, characterized by intricate geological conditions and structural features, resulting in frequent geological disasters. Among these, landslides, as prototypical geological hazards, pose significant threats to both lives and property. Consequently, conducting a comprehensive landslide inventory in mountainous regions is imperative for current research. This study concentrates on the Yinshan Mountains, an ancient fault-block mountain range spanning east–west in the central Inner Mongolia Autonomous Region, extending from Langshan Mountains in the west to Damaqun Mountains in the east, with the narrow sense Xiao–Yin Mountains District in between. Employing multi-temporal high-resolution remote sensing images from Google Earth, this study conducted visual interpretation, identifying 10,968 landslides in the Yinshan area, encompassing a total area of 308.94 km2. The largest landslide occupies 2.95 km2, while the smallest covers 84.47 m2. Specifically, the Langshan area comprises 331 landslides with a total area of 11.96 km2, the narrow sense Xiao–Yin Mountains include 3393 landslides covering 64.13 km2, and the Manhan Mountains, Damaqun Mountains, and adjacent areas account for 7244 landslides over a total area of 232.85 km2. This research not only contributes to global landslide cataloging initiatives but also serves as a robust foundation for future geohazard prevention and management efforts.
中国山地面积广阔,地质条件和结构特点错综复杂,地质灾害频发。其中,山体滑坡作为典型的地质灾害,对生命和财产造成了重大威胁。因此,对山区进行全面的滑坡清查是当前研究的当务之急。本研究以阴山山脉为研究对象,阴山山脉是内蒙古自治区中部一条横跨东西的古老断块山脉,西起狼牙山,东至大马群山,中间是狭义的小阴山区。本研究利用谷歌地球的多时相高分辨率遥感图像进行了目视解译,在阴山地区识别出 10968 个滑坡体,总面积达 308.94 平方公里。最大的滑坡面积为 2.95 平方公里,最小的滑坡面积为 84.47 平方米。具体来说,兰山地区有 331 个滑坡体,总面积为 11.96 平方公里;狭义小阴山地区有 3393 个滑坡体,总面积为 64.13 平方公里;满汉山、大马群山及邻近地区有 7244 个滑坡体,总面积为 232.85 平方公里。这项研究不仅有助于全球滑坡编目工作,还为今后的地质灾害预防和管理工作奠定了坚实的基础。
{"title":"The Yinshan Mountains Record over 10,000 Landslides","authors":"Jingjing Sun, Chong Xu, Liye Feng, Lei Li, Xuewei Zhang, Wentao Yang","doi":"10.3390/data9020031","DOIUrl":"https://doi.org/10.3390/data9020031","url":null,"abstract":"China boasts a vast expanse of mountainous terrain, characterized by intricate geological conditions and structural features, resulting in frequent geological disasters. Among these, landslides, as prototypical geological hazards, pose significant threats to both lives and property. Consequently, conducting a comprehensive landslide inventory in mountainous regions is imperative for current research. This study concentrates on the Yinshan Mountains, an ancient fault-block mountain range spanning east–west in the central Inner Mongolia Autonomous Region, extending from Langshan Mountains in the west to Damaqun Mountains in the east, with the narrow sense Xiao–Yin Mountains District in between. Employing multi-temporal high-resolution remote sensing images from Google Earth, this study conducted visual interpretation, identifying 10,968 landslides in the Yinshan area, encompassing a total area of 308.94 km2. The largest landslide occupies 2.95 km2, while the smallest covers 84.47 m2. Specifically, the Langshan area comprises 331 landslides with a total area of 11.96 km2, the narrow sense Xiao–Yin Mountains include 3393 landslides covering 64.13 km2, and the Manhan Mountains, Damaqun Mountains, and adjacent areas account for 7244 landslides over a total area of 232.85 km2. This research not only contributes to global landslide cataloging initiatives but also serves as a robust foundation for future geohazard prevention and management efforts.","PeriodicalId":502371,"journal":{"name":"Data","volume":"104 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139794279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Yinshan Mountains Record over 10,000 Landslides 阴山山脉记录了一万多次滑坡
Pub Date : 2024-02-08 DOI: 10.3390/data9020031
Jingjing Sun, Chong Xu, Liye Feng, Lei Li, Xuewei Zhang, Wentao Yang
China boasts a vast expanse of mountainous terrain, characterized by intricate geological conditions and structural features, resulting in frequent geological disasters. Among these, landslides, as prototypical geological hazards, pose significant threats to both lives and property. Consequently, conducting a comprehensive landslide inventory in mountainous regions is imperative for current research. This study concentrates on the Yinshan Mountains, an ancient fault-block mountain range spanning east–west in the central Inner Mongolia Autonomous Region, extending from Langshan Mountains in the west to Damaqun Mountains in the east, with the narrow sense Xiao–Yin Mountains District in between. Employing multi-temporal high-resolution remote sensing images from Google Earth, this study conducted visual interpretation, identifying 10,968 landslides in the Yinshan area, encompassing a total area of 308.94 km2. The largest landslide occupies 2.95 km2, while the smallest covers 84.47 m2. Specifically, the Langshan area comprises 331 landslides with a total area of 11.96 km2, the narrow sense Xiao–Yin Mountains include 3393 landslides covering 64.13 km2, and the Manhan Mountains, Damaqun Mountains, and adjacent areas account for 7244 landslides over a total area of 232.85 km2. This research not only contributes to global landslide cataloging initiatives but also serves as a robust foundation for future geohazard prevention and management efforts.
中国山地面积广阔,地质条件和结构特点错综复杂,地质灾害频发。其中,山体滑坡作为典型的地质灾害,对生命和财产造成了重大威胁。因此,对山区进行全面的滑坡清查是当前研究的当务之急。本研究以阴山山脉为研究对象,阴山山脉是内蒙古自治区中部一条横跨东西的古老断块山脉,西起狼牙山,东至大马群山,中间是狭义的小阴山区。本研究利用谷歌地球的多时相高分辨率遥感图像进行了目视解译,在阴山地区识别出 10968 个滑坡体,总面积达 308.94 平方公里。最大的滑坡面积为 2.95 平方公里,最小的滑坡面积为 84.47 平方米。具体来说,兰山地区有 331 个滑坡体,总面积为 11.96 平方公里;狭义小阴山地区有 3393 个滑坡体,总面积为 64.13 平方公里;满汉山、大马群山及邻近地区有 7244 个滑坡体,总面积为 232.85 平方公里。这项研究不仅有助于全球滑坡编目工作,还为今后的地质灾害预防和管理工作奠定了坚实的基础。
{"title":"The Yinshan Mountains Record over 10,000 Landslides","authors":"Jingjing Sun, Chong Xu, Liye Feng, Lei Li, Xuewei Zhang, Wentao Yang","doi":"10.3390/data9020031","DOIUrl":"https://doi.org/10.3390/data9020031","url":null,"abstract":"China boasts a vast expanse of mountainous terrain, characterized by intricate geological conditions and structural features, resulting in frequent geological disasters. Among these, landslides, as prototypical geological hazards, pose significant threats to both lives and property. Consequently, conducting a comprehensive landslide inventory in mountainous regions is imperative for current research. This study concentrates on the Yinshan Mountains, an ancient fault-block mountain range spanning east–west in the central Inner Mongolia Autonomous Region, extending from Langshan Mountains in the west to Damaqun Mountains in the east, with the narrow sense Xiao–Yin Mountains District in between. Employing multi-temporal high-resolution remote sensing images from Google Earth, this study conducted visual interpretation, identifying 10,968 landslides in the Yinshan area, encompassing a total area of 308.94 km2. The largest landslide occupies 2.95 km2, while the smallest covers 84.47 m2. Specifically, the Langshan area comprises 331 landslides with a total area of 11.96 km2, the narrow sense Xiao–Yin Mountains include 3393 landslides covering 64.13 km2, and the Manhan Mountains, Damaqun Mountains, and adjacent areas account for 7244 landslides over a total area of 232.85 km2. This research not only contributes to global landslide cataloging initiatives but also serves as a robust foundation for future geohazard prevention and management efforts.","PeriodicalId":502371,"journal":{"name":"Data","volume":"78 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139854228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data in Astrophysics and Geophysics: Novel Research and Applications 天体物理学和地球物理学数据:新颖的研究和应用
Pub Date : 2024-02-08 DOI: 10.3390/data9020032
V. Srećković, Milan S. Dimitrijević, Z. Mijić
Rapid development of communication technologies and constant technological improvements as a result of scientific discoveries require the establishment of specific databases [...]
通信技术的迅速发展和科学发现带来的技术不断进步要求建立特定的数据库 [...]
{"title":"Data in Astrophysics and Geophysics: Novel Research and Applications","authors":"V. Srećković, Milan S. Dimitrijević, Z. Mijić","doi":"10.3390/data9020032","DOIUrl":"https://doi.org/10.3390/data9020032","url":null,"abstract":"Rapid development of communication technologies and constant technological improvements as a result of scientific discoveries require the establishment of specific databases [...]","PeriodicalId":502371,"journal":{"name":"Data","volume":"64 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139851299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Expanded Brain CT Dataset for the Development of AI Systems for Intracranial Hemorrhage Detection and Classification 用于开发颅内出血检测和分类人工智能系统的扩展脑 CT 数据集
Pub Date : 2024-02-06 DOI: 10.3390/data9020030
A. Khoruzhaya, T. Bobrovskaya, D. V. Kozlov, Dmitriy Kuligovskiy, Vladimir P. Novik, Kirill M. Arzamasov, E. I. Kremneva
Intracranial hemorrhage (ICH) is a dangerous life-threatening condition leading to disability. Timely and high-quality diagnosis plays a huge role in the course and outcome of this disease. The gold standard in determining ICH is computed tomography. This method requires a prompt involvement of highly qualified personnel, which is not always possible, for example, in case of a staff shortage or increased workload. In such a situation, every minute counts, and time can be lost. The solution to this problem seems to be a set of diagnostic decisions, including the use of artificial intelligence, which will help to identify patients with ICH in a timely manner and provide prompt and quality medical care. However, the main obstacle to the development of artificial intelligence is a lack of high-quality datasets for training and testing. In this paper, we present a dataset including 800 brain CT scans consisting of multiple series of DICOM images with and without signs of ICH, enriched with clinical and technical parameters, as well as the methodology of its generation utilizing natural language processing tools. The dataset is publicly available, which contributes to increased competition in the development of artificial intelligence systems and their advancement and quality improvement.
颅内出血(ICH)是一种危及生命并导致残疾的危险疾病。及时和高质量的诊断对疾病的进程和预后起着重要作用。确定 ICH 的金标准是计算机断层扫描。这种方法需要高素质人员的及时参与,但这并不总是可能的,例如在人员短缺或工作量增加的情况下。在这种情况下,必须分秒必争,否则就会耽误时间。解决这一问题的办法似乎是制定一套诊断决策,包括使用人工智能,这将有助于及时发现非物质文化遗产患者,并提供及时、优质的医疗护理。然而,人工智能发展的主要障碍是缺乏用于训练和测试的高质量数据集。在本文中,我们介绍了一个数据集,其中包括 800 张脑 CT 扫描图像,这些图像由多个系列的 DICOM 图像组成,既有 ICH 的迹象,也有非 ICH 的迹象,并添加了临床和技术参数,同时还介绍了利用自然语言处理工具生成数据集的方法。该数据集是公开的,这有助于提高人工智能系统开发的竞争性,促进其进步和质量改进。
{"title":"Expanded Brain CT Dataset for the Development of AI Systems for Intracranial Hemorrhage Detection and Classification","authors":"A. Khoruzhaya, T. Bobrovskaya, D. V. Kozlov, Dmitriy Kuligovskiy, Vladimir P. Novik, Kirill M. Arzamasov, E. I. Kremneva","doi":"10.3390/data9020030","DOIUrl":"https://doi.org/10.3390/data9020030","url":null,"abstract":"Intracranial hemorrhage (ICH) is a dangerous life-threatening condition leading to disability. Timely and high-quality diagnosis plays a huge role in the course and outcome of this disease. The gold standard in determining ICH is computed tomography. This method requires a prompt involvement of highly qualified personnel, which is not always possible, for example, in case of a staff shortage or increased workload. In such a situation, every minute counts, and time can be lost. The solution to this problem seems to be a set of diagnostic decisions, including the use of artificial intelligence, which will help to identify patients with ICH in a timely manner and provide prompt and quality medical care. However, the main obstacle to the development of artificial intelligence is a lack of high-quality datasets for training and testing. In this paper, we present a dataset including 800 brain CT scans consisting of multiple series of DICOM images with and without signs of ICH, enriched with clinical and technical parameters, as well as the methodology of its generation utilizing natural language processing tools. The dataset is publicly available, which contributes to increased competition in the development of artificial intelligence systems and their advancement and quality improvement.","PeriodicalId":502371,"journal":{"name":"Data","volume":"213 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139799614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Expanded Brain CT Dataset for the Development of AI Systems for Intracranial Hemorrhage Detection and Classification 用于开发颅内出血检测和分类人工智能系统的扩展脑 CT 数据集
Pub Date : 2024-02-06 DOI: 10.3390/data9020030
A. Khoruzhaya, T. Bobrovskaya, D. V. Kozlov, Dmitriy Kuligovskiy, Vladimir P. Novik, Kirill M. Arzamasov, E. I. Kremneva
Intracranial hemorrhage (ICH) is a dangerous life-threatening condition leading to disability. Timely and high-quality diagnosis plays a huge role in the course and outcome of this disease. The gold standard in determining ICH is computed tomography. This method requires a prompt involvement of highly qualified personnel, which is not always possible, for example, in case of a staff shortage or increased workload. In such a situation, every minute counts, and time can be lost. The solution to this problem seems to be a set of diagnostic decisions, including the use of artificial intelligence, which will help to identify patients with ICH in a timely manner and provide prompt and quality medical care. However, the main obstacle to the development of artificial intelligence is a lack of high-quality datasets for training and testing. In this paper, we present a dataset including 800 brain CT scans consisting of multiple series of DICOM images with and without signs of ICH, enriched with clinical and technical parameters, as well as the methodology of its generation utilizing natural language processing tools. The dataset is publicly available, which contributes to increased competition in the development of artificial intelligence systems and their advancement and quality improvement.
颅内出血(ICH)是一种危及生命并导致残疾的危险疾病。及时和高质量的诊断对疾病的进程和预后起着重要作用。确定 ICH 的金标准是计算机断层扫描。这种方法需要高素质人员的及时参与,但这并不总是可能的,例如在人员短缺或工作量增加的情况下。在这种情况下,必须分秒必争,否则就会耽误时间。解决这一问题的办法似乎是制定一套诊断决策,包括使用人工智能,这将有助于及时发现非物质文化遗产患者,并提供及时、优质的医疗护理。然而,人工智能发展的主要障碍是缺乏用于训练和测试的高质量数据集。在本文中,我们介绍了一个数据集,其中包括 800 张脑 CT 扫描图像,这些图像由多个系列的 DICOM 图像组成,既有 ICH 的迹象,也有非 ICH 的迹象,并添加了临床和技术参数,同时还介绍了利用自然语言处理工具生成数据集的方法。该数据集是公开的,这有助于提高人工智能系统开发的竞争性,促进其进步和质量改进。
{"title":"Expanded Brain CT Dataset for the Development of AI Systems for Intracranial Hemorrhage Detection and Classification","authors":"A. Khoruzhaya, T. Bobrovskaya, D. V. Kozlov, Dmitriy Kuligovskiy, Vladimir P. Novik, Kirill M. Arzamasov, E. I. Kremneva","doi":"10.3390/data9020030","DOIUrl":"https://doi.org/10.3390/data9020030","url":null,"abstract":"Intracranial hemorrhage (ICH) is a dangerous life-threatening condition leading to disability. Timely and high-quality diagnosis plays a huge role in the course and outcome of this disease. The gold standard in determining ICH is computed tomography. This method requires a prompt involvement of highly qualified personnel, which is not always possible, for example, in case of a staff shortage or increased workload. In such a situation, every minute counts, and time can be lost. The solution to this problem seems to be a set of diagnostic decisions, including the use of artificial intelligence, which will help to identify patients with ICH in a timely manner and provide prompt and quality medical care. However, the main obstacle to the development of artificial intelligence is a lack of high-quality datasets for training and testing. In this paper, we present a dataset including 800 brain CT scans consisting of multiple series of DICOM images with and without signs of ICH, enriched with clinical and technical parameters, as well as the methodology of its generation utilizing natural language processing tools. The dataset is publicly available, which contributes to increased competition in the development of artificial intelligence systems and their advancement and quality improvement.","PeriodicalId":502371,"journal":{"name":"Data","volume":"131 2-3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139859584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Data Pipeline for Comparing the Effects of Momentum on Sports Leagues 比较动量对体育联盟影响的综合数据管道
Pub Date : 2024-02-01 DOI: 10.3390/data9020029
Jordan Truman Paul Noel, Vinicius Prado da Fonseca, Amilcar Soares
Momentum has been a consistently studied aspect of sports science for decades. Among the established literature, there has, at times, been a discrepancy between conclusions. However, if momentum is indeed an actual phenomenon, it would affect all aspects of sports, from player evaluation to pre-game prediction and betting. Therefore, using momentum-based features that quantify a team’s linear trend of play, we develop a data pipeline that uses a small sample of recent games to assess teams’ quality of play and measure the predictive power of momentum-based features versus the predictive power of more traditional frequency-based features across several leagues using several machine learning techniques. More precisely, we use our pipeline to determine the differences in the predictive power of momentum-based features and standard statistical features for the National Hockey League (NHL), National Basketball Association (NBA), and five major first-division European football leagues. Our findings show little evidence that momentum has superior predictive power in the NBA. Still, we found some instances of the effects of momentum on the NHL that produced better pre-game predictors, whereas we view a similar trend in European football/soccer. Our results indicate that momentum-based features combined with frequency-based features could improve pre-game prediction models and that, in the future, momentum should be studied more from a feature/performance indicator point-of-view and less from the view of the dependence of sequential outcomes, thus attempting to distance momentum from the binary view of winning and losing.
几十年来,动量一直是体育科学研究的一个方面。在已有的文献中,有时会出现结论不一致的情况。然而,如果动量确实是一种实际现象,那么它将影响体育的方方面面,从球员评估到赛前预测和投注。因此,我们使用基于动量的特征来量化一支球队的线性比赛趋势,开发了一个数据管道,使用近期比赛的小样本来评估球队的比赛质量,并使用几种机器学习技术来衡量基于动量的特征的预测能力与几个联赛中更传统的基于频率的特征的预测能力。更准确地说,我们使用我们的管道来确定基于动量特征的预测能力与标准统计特征在美国国家冰球联盟(NHL)、美国国家篮球协会(NBA)和欧洲五大甲级足球联赛中的差异。我们的研究结果表明,几乎没有证据表明动量在 NBA 中具有更强的预测能力。尽管如此,我们还是发现了一些动量对国家曲棍球协会的影响,这些影响产生了更好的赛前预测结果,而我们在欧洲足球/橄榄球中也发现了类似的趋势。我们的研究结果表明,基于动量的特征与基于频率的特征相结合,可以改进赛前预测模型,而且未来应更多地从特征/性能指标的角度研究动量,而不是从连续结果的依赖性角度研究动量,从而尝试将动量与二元胜负观拉开距离。
{"title":"A Comprehensive Data Pipeline for Comparing the Effects of Momentum on Sports Leagues","authors":"Jordan Truman Paul Noel, Vinicius Prado da Fonseca, Amilcar Soares","doi":"10.3390/data9020029","DOIUrl":"https://doi.org/10.3390/data9020029","url":null,"abstract":"Momentum has been a consistently studied aspect of sports science for decades. Among the established literature, there has, at times, been a discrepancy between conclusions. However, if momentum is indeed an actual phenomenon, it would affect all aspects of sports, from player evaluation to pre-game prediction and betting. Therefore, using momentum-based features that quantify a team’s linear trend of play, we develop a data pipeline that uses a small sample of recent games to assess teams’ quality of play and measure the predictive power of momentum-based features versus the predictive power of more traditional frequency-based features across several leagues using several machine learning techniques. More precisely, we use our pipeline to determine the differences in the predictive power of momentum-based features and standard statistical features for the National Hockey League (NHL), National Basketball Association (NBA), and five major first-division European football leagues. Our findings show little evidence that momentum has superior predictive power in the NBA. Still, we found some instances of the effects of momentum on the NHL that produced better pre-game predictors, whereas we view a similar trend in European football/soccer. Our results indicate that momentum-based features combined with frequency-based features could improve pre-game prediction models and that, in the future, momentum should be studied more from a feature/performance indicator point-of-view and less from the view of the dependence of sequential outcomes, thus attempting to distance momentum from the binary view of winning and losing.","PeriodicalId":502371,"journal":{"name":"Data","volume":"257 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139821370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Organ-On-A-Chip (OOC) Image Dataset for Machine Learning and Tissue Model Evaluation 用于机器学习和组织模型评估的芯片上器官 (OOC) 图像数据集
Pub Date : 2024-02-01 DOI: 10.3390/data9020028
Valerija Movcana, Arnis Strods, Karīna Narbute, Fēlikss Rūmnieks, Roberts Rimša, G. Mozolevskis, Maksims Ivanovs, Roberts Kadiķis, Karlis Zviedris, Laura Leja, Anastasija Zujeva, Tamāra Laimiņa, A. Abols
Organ-on-a-chip (OOC) technology has emerged as a groundbreaking approach for emulating the physiological environment, revolutionizing biomedical research, drug development, and personalized medicine. OOC platforms offer more physiologically relevant microenvironments, enabling real-time monitoring of tissue, to develop functional tissue models. Imaging methods are the most common approach for daily monitoring of tissue development. Image-based machine learning serves as a valuable tool for enhancing and monitoring OOC models in real-time. This involves the classification of images generated through microscopy contributing to the refinement of model performance. This paper presents an image dataset, containing cell images generated from OOC setup with different cell types. There are 3072 images generated by an automated brightfield microscopy setup. For some images, parameters such as cell type, seeding density, time after seeding and flow rate are provided. These parameters along with predefined criteria can contribute to the evaluation of image quality and identification of potential artifacts. This dataset can be used as a basis for training machine learning classifiers for automated data analysis generated from an OOC setup providing more reliable tissue models, automated decision-making processes within the OOC framework and efficient research in the future.
芯片上器官(OOC)技术已成为模拟生理环境的开创性方法,为生物医学研究、药物开发和个性化医疗带来了革命性的变化。OOC 平台可提供更贴近生理的微环境,实现对组织的实时监测,从而开发出功能性组织模型。成像方法是日常监测组织发育的最常用方法。基于图像的机器学习是实时增强和监测 OOC 模型的重要工具。这包括对显微镜下生成的图像进行分类,从而提高模型的性能。本文介绍了一个图像数据集,其中包含由不同细胞类型的 OOC 设置生成的细胞图像。自动明视野显微镜装置生成了 3072 幅图像。对于某些图像,提供了细胞类型、播种密度、播种后时间和流速等参数。这些参数和预定义标准有助于评估图像质量和识别潜在伪影。该数据集可作为训练机器学习分类器的基础,用于对 OOC 设置生成的数据进行自动分析,从而提供更可靠的组织模型、OOC 框架内的自动决策过程以及未来的高效研究。
{"title":"Organ-On-A-Chip (OOC) Image Dataset for Machine Learning and Tissue Model Evaluation","authors":"Valerija Movcana, Arnis Strods, Karīna Narbute, Fēlikss Rūmnieks, Roberts Rimša, G. Mozolevskis, Maksims Ivanovs, Roberts Kadiķis, Karlis Zviedris, Laura Leja, Anastasija Zujeva, Tamāra Laimiņa, A. Abols","doi":"10.3390/data9020028","DOIUrl":"https://doi.org/10.3390/data9020028","url":null,"abstract":"Organ-on-a-chip (OOC) technology has emerged as a groundbreaking approach for emulating the physiological environment, revolutionizing biomedical research, drug development, and personalized medicine. OOC platforms offer more physiologically relevant microenvironments, enabling real-time monitoring of tissue, to develop functional tissue models. Imaging methods are the most common approach for daily monitoring of tissue development. Image-based machine learning serves as a valuable tool for enhancing and monitoring OOC models in real-time. This involves the classification of images generated through microscopy contributing to the refinement of model performance. This paper presents an image dataset, containing cell images generated from OOC setup with different cell types. There are 3072 images generated by an automated brightfield microscopy setup. For some images, parameters such as cell type, seeding density, time after seeding and flow rate are provided. These parameters along with predefined criteria can contribute to the evaluation of image quality and identification of potential artifacts. This dataset can be used as a basis for training machine learning classifiers for automated data analysis generated from an OOC setup providing more reliable tissue models, automated decision-making processes within the OOC framework and efficient research in the future.","PeriodicalId":502371,"journal":{"name":"Data","volume":"47 38","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139683915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Data Pipeline for Comparing the Effects of Momentum on Sports Leagues 比较动量对体育联盟影响的综合数据管道
Pub Date : 2024-02-01 DOI: 10.3390/data9020029
Jordan Truman Paul Noel, Vinicius Prado da Fonseca, Amilcar Soares
Momentum has been a consistently studied aspect of sports science for decades. Among the established literature, there has, at times, been a discrepancy between conclusions. However, if momentum is indeed an actual phenomenon, it would affect all aspects of sports, from player evaluation to pre-game prediction and betting. Therefore, using momentum-based features that quantify a team’s linear trend of play, we develop a data pipeline that uses a small sample of recent games to assess teams’ quality of play and measure the predictive power of momentum-based features versus the predictive power of more traditional frequency-based features across several leagues using several machine learning techniques. More precisely, we use our pipeline to determine the differences in the predictive power of momentum-based features and standard statistical features for the National Hockey League (NHL), National Basketball Association (NBA), and five major first-division European football leagues. Our findings show little evidence that momentum has superior predictive power in the NBA. Still, we found some instances of the effects of momentum on the NHL that produced better pre-game predictors, whereas we view a similar trend in European football/soccer. Our results indicate that momentum-based features combined with frequency-based features could improve pre-game prediction models and that, in the future, momentum should be studied more from a feature/performance indicator point-of-view and less from the view of the dependence of sequential outcomes, thus attempting to distance momentum from the binary view of winning and losing.
几十年来,动量一直是体育科学研究的一个方面。在已有的文献中,有时会出现结论不一致的情况。然而,如果动量确实是一种实际现象,那么它将影响体育的方方面面,从球员评估到赛前预测和投注。因此,我们使用基于动量的特征来量化一支球队的线性比赛趋势,开发了一个数据管道,使用近期比赛的小样本来评估球队的比赛质量,并使用几种机器学习技术来衡量基于动量的特征的预测能力与几个联赛中更传统的基于频率的特征的预测能力。更准确地说,我们使用我们的管道来确定基于动量特征的预测能力与标准统计特征在美国国家冰球联盟(NHL)、美国国家篮球协会(NBA)和欧洲五大甲级足球联赛中的差异。我们的研究结果表明,几乎没有证据表明动量在 NBA 中具有更强的预测能力。尽管如此,我们还是发现了一些动量对国家曲棍球协会的影响,这些影响产生了更好的赛前预测结果,而我们在欧洲足球/橄榄球中也发现了类似的趋势。我们的研究结果表明,基于动量的特征与基于频率的特征相结合,可以改进赛前预测模型,而且未来应更多地从特征/性能指标的角度研究动量,而不是从连续结果的依赖性角度研究动量,从而尝试将动量与二元胜负观拉开距离。
{"title":"A Comprehensive Data Pipeline for Comparing the Effects of Momentum on Sports Leagues","authors":"Jordan Truman Paul Noel, Vinicius Prado da Fonseca, Amilcar Soares","doi":"10.3390/data9020029","DOIUrl":"https://doi.org/10.3390/data9020029","url":null,"abstract":"Momentum has been a consistently studied aspect of sports science for decades. Among the established literature, there has, at times, been a discrepancy between conclusions. However, if momentum is indeed an actual phenomenon, it would affect all aspects of sports, from player evaluation to pre-game prediction and betting. Therefore, using momentum-based features that quantify a team’s linear trend of play, we develop a data pipeline that uses a small sample of recent games to assess teams’ quality of play and measure the predictive power of momentum-based features versus the predictive power of more traditional frequency-based features across several leagues using several machine learning techniques. More precisely, we use our pipeline to determine the differences in the predictive power of momentum-based features and standard statistical features for the National Hockey League (NHL), National Basketball Association (NBA), and five major first-division European football leagues. Our findings show little evidence that momentum has superior predictive power in the NBA. Still, we found some instances of the effects of momentum on the NHL that produced better pre-game predictors, whereas we view a similar trend in European football/soccer. Our results indicate that momentum-based features combined with frequency-based features could improve pre-game prediction models and that, in the future, momentum should be studied more from a feature/performance indicator point-of-view and less from the view of the dependence of sequential outcomes, thus attempting to distance momentum from the binary view of winning and losing.","PeriodicalId":502371,"journal":{"name":"Data","volume":"26 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139881413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping Hierarchical File Structures to Semantic Data Models for Efficient Data Integration into Research Data Management Systems 将分层文件结构映射为语义数据模型,以便将数据高效整合到研究数据管理系统中
Pub Date : 2024-01-26 DOI: 10.3390/data9020024
Henrik tom Wörden, Florian Spreckelsen, Stefan Luther, Ulrich Parlitz, A. Schlemmer
Although other methods exist to store and manage data in modern information technology, the standard solution is file systems. Therefore, keeping well-organized file structures and file system layouts can be key to a sustainable research data management infrastructure. However, file structures alone lack several important capabilities for FAIR data management: the two most significant being insufficient visualization of data and inadequate possibilities for searching and obtaining an overview. Research data management systems (RDMSs) can fill this gap, but many do not support the simultaneous use of the file system and RDMS. This simultaneous use can have many benefits, but keeping data in RDMS in synchrony with the file structure is challenging. Here, we present concepts that allow for keeping file structures and semantic data models (in RDMS) synchronous. Furthermore, we propose a specification in yaml format that allows for a structured and extensible declaration and implementation of a mapping between the file system and data models used in semantic research data management. Implementing these concepts will facilitate the re-use of specifications for multiple use cases. Furthermore, the specification can serve as a machine-readable and, at the same time, human-readable documentation of specific file system structures. We demonstrate our work using the Open Source RDMS LinkAhead (previously named “CaosDB”).
尽管现代信息技术中还有其他存储和管理数据的方法,但标准解决方案是文件系统。因此,保持良好的文件结构和文件系统布局是可持续研究数据管理基础设施的关键。然而,仅靠文件结构无法实现 FAIR 数据管理的几个重要功能:其中最重要的两个功能是数据可视化不足,以及搜索和获取概览的可能性不足。研究数据管理系统(RDMS)可以填补这一空白,但许多系统并不支持同时使用文件系统和 RDMS。这种同时使用的方式有很多好处,但让 RDMS 中的数据与文件结构保持同步却是一项挑战。在这里,我们提出了允许文件结构和语义数据模型(在 RDMS 中)保持同步的概念。此外,我们还提出了一种 yaml 格式的规范,可以结构化、可扩展地声明和实现文件系统与语义研究数据管理中使用的数据模型之间的映射。实施这些概念将有助于在多种用例中重复使用规范。此外,该规范可以作为特定文件系统结构的机器可读文档,同时也是人类可读文档。我们使用开源 RDMS LinkAhead(以前名为 "CaosDB")演示了我们的工作。
{"title":"Mapping Hierarchical File Structures to Semantic Data Models for Efficient Data Integration into Research Data Management Systems","authors":"Henrik tom Wörden, Florian Spreckelsen, Stefan Luther, Ulrich Parlitz, A. Schlemmer","doi":"10.3390/data9020024","DOIUrl":"https://doi.org/10.3390/data9020024","url":null,"abstract":"Although other methods exist to store and manage data in modern information technology, the standard solution is file systems. Therefore, keeping well-organized file structures and file system layouts can be key to a sustainable research data management infrastructure. However, file structures alone lack several important capabilities for FAIR data management: the two most significant being insufficient visualization of data and inadequate possibilities for searching and obtaining an overview. Research data management systems (RDMSs) can fill this gap, but many do not support the simultaneous use of the file system and RDMS. This simultaneous use can have many benefits, but keeping data in RDMS in synchrony with the file structure is challenging. Here, we present concepts that allow for keeping file structures and semantic data models (in RDMS) synchronous. Furthermore, we propose a specification in yaml format that allows for a structured and extensible declaration and implementation of a mapping between the file system and data models used in semantic research data management. Implementing these concepts will facilitate the re-use of specifications for multiple use cases. Furthermore, the specification can serve as a machine-readable and, at the same time, human-readable documentation of specific file system structures. We demonstrate our work using the Open Source RDMS LinkAhead (previously named “CaosDB”).","PeriodicalId":502371,"journal":{"name":"Data","volume":"77 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139593674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1