首页 > 最新文献

The 12th International Conference on Advances in Information Technology最新文献

英文 中文
A model of Cyber Threat Information Sharing with the Novel Network Topology 基于新型网络拓扑结构的网络威胁信息共享模型
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3468885
J. Hautamaki, T. Hamalainen
The digitized environments are particularly vulnerable to various attacks. In such a situation of a security attack, detecting and responding to attacks require effective actions. One of the most significant ways to improve resilience to security attacks is to obtain accurate and timely situational aspect of the security awareness. The efficient production and utilization of situation information is achieved by sharing information with other actors in the information sharing network quickly and reliably without compromising the confidential information of one's own organization. At the same time, it should also be possible to avoid a flood of irrelevant information in the sharing network, which wastes resources and slows down the implementation of security measures. In our study, we have investigated how security-related information can be shared online as efficiently as possible by building a security information sharing topology based on the two most widely used network optimization algorithms. In the article, we present a model of an information sharing network, in which three different parameters have been used to optimize the network topology: the activity level of organization, the similarity of information systems between different actors and the requirement for the level of information privacy generally in the organization.
数字化环境特别容易受到各种攻击。在这种安全攻击的情况下,检测和响应攻击需要采取有效的措施。提高对安全攻击的弹性的最重要的方法之一是获得准确和及时的安全意识的情景方面。在不泄露本组织机密信息的前提下,快速、可靠地与信息共享网络中的其他参与者共享信息,实现态势信息的高效生产和利用。同时,还应该能够避免共享网络中大量不相关的信息,从而浪费资源,减缓安全措施的实施。在我们的研究中,我们通过构建基于两种最广泛使用的网络优化算法的安全信息共享拓扑,研究了如何尽可能有效地在线共享安全相关信息。在本文中,我们提出了一个信息共享网络模型,其中使用了三个不同的参数来优化网络拓扑:组织的活动水平、不同参与者之间信息系统的相似性和组织中一般信息隐私水平的要求。
{"title":"A model of Cyber Threat Information Sharing with the Novel Network Topology","authors":"J. Hautamaki, T. Hamalainen","doi":"10.1145/3468784.3468885","DOIUrl":"https://doi.org/10.1145/3468784.3468885","url":null,"abstract":"The digitized environments are particularly vulnerable to various attacks. In such a situation of a security attack, detecting and responding to attacks require effective actions. One of the most significant ways to improve resilience to security attacks is to obtain accurate and timely situational aspect of the security awareness. The efficient production and utilization of situation information is achieved by sharing information with other actors in the information sharing network quickly and reliably without compromising the confidential information of one's own organization. At the same time, it should also be possible to avoid a flood of irrelevant information in the sharing network, which wastes resources and slows down the implementation of security measures. In our study, we have investigated how security-related information can be shared online as efficiently as possible by building a security information sharing topology based on the two most widely used network optimization algorithms. In the article, we present a model of an information sharing network, in which three different parameters have been used to optimize the network topology: the activity level of organization, the similarity of information systems between different actors and the requirement for the level of information privacy generally in the organization.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128257863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visualising Developing Nations Health Records: Opportunities, Challenges and Research Agenda 发展中国家健康记录可视化:机遇、挑战和研究议程
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3471607
Afamefuna Umejiaku, Tommy Dang
The benefits of effectively visualizing health records in huge volumes has resulted in health organizations, insurance companies, policy and decision makers, governments and drug manufactures’ transformation in the way research is conducted. This has also played a key role in determining investment of resources. Health records contain highly valuable information; processing these records in large volumes is now possible due to technological advancement which allows for the extraction of highly valuable knowledge that has resulted in breakthroughs in scientific communities. To visualize health records in large volumes, the records need to be stored in electronic forms, properly documented, processed, and analyzed. A good visualization technique is used to present the analyzed information, allowing for effective knowledge extraction which is done in a secured manner protecting the privacy of the patients whose health records were used. As research and technological advancement have improved, the quality of knowledge extracted from health records have also improved; unfortunately, the numerous benefits of visualizing health records have only been felt in developed nations, unlike other sectors where technological advancement in developed nations have had similar impact in developing nations. This paper identifies the characteristics of health records and the challenges involved in processing large volumes of health records. This is to identify possible steps that could be taken for developing nations to benefit from visualizing health records in huge volumes.
大量健康记录的有效可视化带来的好处,已经促使卫生组织、保险公司、政策和决策者、政府和药品制造商在开展研究的方式上发生了转变。这也在决定资源投资方面发挥了关键作用。健康记录包含非常有价值的信息;由于技术的进步,现在可以大量处理这些记录,从而可以提取导致科学界突破的极有价值的知识。为了可视化大量的健康记录,需要将记录存储在电子形式中,并进行适当的记录、处理和分析。使用良好的可视化技术来呈现分析的信息,允许以安全的方式进行有效的知识提取,从而保护使用其健康记录的患者的隐私。随着研究和技术进步的提高,从健康记录中提取的知识质量也有所提高;不幸的是,健康记录可视化的诸多好处只在发达国家得到了体现,而在其他领域,发达国家的技术进步对发展中国家产生了类似的影响。本文指出了健康记录的特点和处理大量健康记录所面临的挑战。这是为了确定发展中国家可以采取的可能步骤,以便从大量健康记录的可视化中受益。
{"title":"Visualising Developing Nations Health Records: Opportunities, Challenges and Research Agenda","authors":"Afamefuna Umejiaku, Tommy Dang","doi":"10.1145/3468784.3471607","DOIUrl":"https://doi.org/10.1145/3468784.3471607","url":null,"abstract":"The benefits of effectively visualizing health records in huge volumes has resulted in health organizations, insurance companies, policy and decision makers, governments and drug manufactures’ transformation in the way research is conducted. This has also played a key role in determining investment of resources. Health records contain highly valuable information; processing these records in large volumes is now possible due to technological advancement which allows for the extraction of highly valuable knowledge that has resulted in breakthroughs in scientific communities. To visualize health records in large volumes, the records need to be stored in electronic forms, properly documented, processed, and analyzed. A good visualization technique is used to present the analyzed information, allowing for effective knowledge extraction which is done in a secured manner protecting the privacy of the patients whose health records were used. As research and technological advancement have improved, the quality of knowledge extracted from health records have also improved; unfortunately, the numerous benefits of visualizing health records have only been felt in developed nations, unlike other sectors where technological advancement in developed nations have had similar impact in developing nations. This paper identifies the characteristics of health records and the challenges involved in processing large volumes of health records. This is to identify possible steps that could be taken for developing nations to benefit from visualizing health records in huge volumes.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127883668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Improving the Robustness of a Convolutional Neural Network with Out-of-Distribution Data Fine-Tuning and Image Preprocessing 用离分布数据微调和图像预处理提高卷积神经网络的鲁棒性
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3470655
Shafinul Haque, A. Liu, Serena Liu, Jonathan H. Chan
Deep convolutional neural networks trained on readily available datasets are often susceptible to decreases in performance when executing tasks on new data from a different domain. Making models generalize well on data in a new domain is the task of domain adaptation. Recently, a simple method, known as Out-of-Distribution Image Detector for Neural Networks (ODIN), was proposed for identifying out-of-distribution (OOD) images in a dataset. This paper proposes fine-tuning an image classifier model using OOD images detected in an ideal training set to improve the model's ability to classify real-life images. This work aims to investigate the effectiveness of such a technique, as well as image preprocessing methods like background removal and image cropping, at increasing the robustness of a ResNet50V2 baseline image classifier in the context of a multi-class classification task. It was observed that fine-tuning with OOD images identified by ODIN consistently increased the model's performance and that a combination of cropping images and fine-tuning with OOD images resulted in the greatest increase in the model's performance.
在现成可用的数据集上训练的深度卷积神经网络在对来自不同领域的新数据执行任务时,往往容易受到性能下降的影响。使模型能很好地泛化新领域的数据是领域自适应的任务。最近,一种简单的方法被称为神经网络的离分布图像检测器(ODIN),用于识别数据集中的离分布(OOD)图像。本文提出使用理想训练集中检测到的OOD图像对图像分类器模型进行微调,以提高模型对真实图像的分类能力。这项工作旨在研究这种技术的有效性,以及图像预处理方法,如背景去除和图像裁剪,在多类分类任务的背景下增加ResNet50V2基线图像分类器的鲁棒性。我们观察到,使用ODIN识别的OOD图像进行微调可以持续提高模型的性能,并且将裁剪图像与使用OOD图像进行微调相结合可以最大程度地提高模型的性能。
{"title":"Improving the Robustness of a Convolutional Neural Network with Out-of-Distribution Data Fine-Tuning and Image Preprocessing","authors":"Shafinul Haque, A. Liu, Serena Liu, Jonathan H. Chan","doi":"10.1145/3468784.3470655","DOIUrl":"https://doi.org/10.1145/3468784.3470655","url":null,"abstract":"Deep convolutional neural networks trained on readily available datasets are often susceptible to decreases in performance when executing tasks on new data from a different domain. Making models generalize well on data in a new domain is the task of domain adaptation. Recently, a simple method, known as Out-of-Distribution Image Detector for Neural Networks (ODIN), was proposed for identifying out-of-distribution (OOD) images in a dataset. This paper proposes fine-tuning an image classifier model using OOD images detected in an ideal training set to improve the model's ability to classify real-life images. This work aims to investigate the effectiveness of such a technique, as well as image preprocessing methods like background removal and image cropping, at increasing the robustness of a ResNet50V2 baseline image classifier in the context of a multi-class classification task. It was observed that fine-tuning with OOD images identified by ODIN consistently increased the model's performance and that a combination of cropping images and fine-tuning with OOD images resulted in the greatest increase in the model's performance.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122953040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
OutViz: Visualizing the Outliers of Multivariate Time Series OutViz:多变量时间序列的异常值可视化
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3471606
Jake Gonzalez, Tommy Dang
This paper proposes OutViz, a dual view framework for representing and filtering multivariate time series data to highlight abnormal patterns in a dataset. The first view of the proposed visualization incorporates a parallel coordinate chart that allows the user to analyze the scores of features extracted from a dimensionality reduction density-based clustering outlier detection algorithm to determine why a particular time series is predicted to be an outlier. Also included on the parallel coordinates chart is an outlier score rank axis that allows the user to select a range of time series data to be filtered and displayed on the second view of the framework. The second view of our proposed framework uses a multi-line chart to represent how each time series variable changes over a range of time. Each time series is represented as a line with the position on the horizontal axis representing a point in time, while the vertical axis encodes the data value. Use cases using real-world multivariate time series data are demonstrated to show the advantages of using the proposed framework for data analytics as well as some findings uncovered while using OutViz on life expectancy data from 236 countries between the year 1960 and 2018, and carbon dioxide emissions data from 210 countries between the year 1960 and 2016.
本文提出了一种双视图框架OutViz,用于表示和过滤多变量时间序列数据,以突出数据集中的异常模式。提出的可视化的第一个视图包含一个平行坐标图,允许用户分析从基于降维密度的聚类离群值检测算法提取的特征的分数,以确定为什么预测特定的时间序列是离群值。平行坐标图中还包括一个离群值排名轴,它允许用户选择要过滤的时间序列数据范围,并显示在框架的第二个视图上。我们提出的框架的第二个视图使用多线图来表示每个时间序列变量在一段时间内的变化情况。每个时间序列用一条线表示,横轴上的位置表示一个时间点,纵轴编码数据值。使用实际多变量时间序列数据的用例展示了使用所提出的数据分析框架的优势,以及使用OutViz对1960年至2018年236个国家的预期寿命数据和1960年至2016年210个国家的二氧化碳排放数据所发现的一些发现。
{"title":"OutViz: Visualizing the Outliers of Multivariate Time Series","authors":"Jake Gonzalez, Tommy Dang","doi":"10.1145/3468784.3471606","DOIUrl":"https://doi.org/10.1145/3468784.3471606","url":null,"abstract":"This paper proposes OutViz, a dual view framework for representing and filtering multivariate time series data to highlight abnormal patterns in a dataset. The first view of the proposed visualization incorporates a parallel coordinate chart that allows the user to analyze the scores of features extracted from a dimensionality reduction density-based clustering outlier detection algorithm to determine why a particular time series is predicted to be an outlier. Also included on the parallel coordinates chart is an outlier score rank axis that allows the user to select a range of time series data to be filtered and displayed on the second view of the framework. The second view of our proposed framework uses a multi-line chart to represent how each time series variable changes over a range of time. Each time series is represented as a line with the position on the horizontal axis representing a point in time, while the vertical axis encodes the data value. Use cases using real-world multivariate time series data are demonstrated to show the advantages of using the proposed framework for data analytics as well as some findings uncovered while using OutViz on life expectancy data from 236 countries between the year 1960 and 2018, and carbon dioxide emissions data from 210 countries between the year 1960 and 2016.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114804416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The 3-dimensional Plant Organs Point Clouds Classification for the Phenotyping Application based on CNNs. 基于cnn的植物器官点云三维分类在表型分析中的应用。
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3469949
Kanittha Rungyaem, K. Sukvichai, T. Phatrapornnant
The rice breeding produces the high-throughput via a genotyping technology. It can rapidly test and analyze on a large number of samples while the performance of phenotypic evaluation is still very low because of the manually evaluation. Therefore, this is the main barrier retarding the new rice varieties development. This research is aimed to develop a method for classifying plant organs from 3D point cloud in order to analyze plant morphology or architecture automatically. The rice plant was scanned with a 3D laser scan machine. The points in the cloud were reduced by the skeleton skimming method because the number of points in each cloud group is too large. Thus, it is necessary to preprocess before importing into neural networks for classification. The PointNet was selected as the 3D classifier in this research. The first experiment was conducted in order to evaluate the proposed method. The result showed that the proposed method can classify rice organs, regardless of rice varieties, with accuracy of 87.04%. Then, the second experiment was conducted in order to obtain the accuracy of the network for each rice variety to demonstrate the influence of rice cultivars in the classification due to their different shapes. The results showed that the SPRLR, which had large numbers of leaves and yield, has the lowest accuracy of 51.61% while the other varieties with the greater leaf and panicle distribution have a much better accuracy. The Nieow dum had 91.16% accuracy while Jae hwa, Kaow lueng and Kam had 89.06%, 86.52% and 75.22% accuracy respectively.
水稻育种通过基因分型技术产生高通量。它可以对大量样品进行快速测试和分析,但由于人工评估,表型评估的性能仍然很低。因此,这是制约水稻新品种培育的主要障碍。本研究旨在开发一种基于三维点云的植物器官分类方法,以实现植物形态或结构的自动分析。水稻植株用3D激光扫描仪进行了扫描。由于每个云组中的点数量太大,采用骨架略读法对云中的点进行了缩减。因此,在导入神经网络进行分类之前,有必要进行预处理。本研究选择PointNet作为三维分类器。为了评估所提出的方法,进行了第一次实验。结果表明,该方法可以对不同品种的水稻器官进行分类,准确率为87.04%。然后,进行第二次实验,以获得每个水稻品种的网络精度,以证明水稻品种因其形状不同而对分类的影响。结果表明,叶片数量多、产量大的SPRLR精度最低,为51.61%,而其他叶片和穗分布较大的品种精度较高。Nieow dum的准确率为91.16%,Jae hwa、Kaow lueng和Kam的准确率分别为89.06%、86.52%和75.22%。
{"title":"The 3-dimensional Plant Organs Point Clouds Classification for the Phenotyping Application based on CNNs.","authors":"Kanittha Rungyaem, K. Sukvichai, T. Phatrapornnant","doi":"10.1145/3468784.3469949","DOIUrl":"https://doi.org/10.1145/3468784.3469949","url":null,"abstract":"The rice breeding produces the high-throughput via a genotyping technology. It can rapidly test and analyze on a large number of samples while the performance of phenotypic evaluation is still very low because of the manually evaluation. Therefore, this is the main barrier retarding the new rice varieties development. This research is aimed to develop a method for classifying plant organs from 3D point cloud in order to analyze plant morphology or architecture automatically. The rice plant was scanned with a 3D laser scan machine. The points in the cloud were reduced by the skeleton skimming method because the number of points in each cloud group is too large. Thus, it is necessary to preprocess before importing into neural networks for classification. The PointNet was selected as the 3D classifier in this research. The first experiment was conducted in order to evaluate the proposed method. The result showed that the proposed method can classify rice organs, regardless of rice varieties, with accuracy of 87.04%. Then, the second experiment was conducted in order to obtain the accuracy of the network for each rice variety to demonstrate the influence of rice cultivars in the classification due to their different shapes. The results showed that the SPRLR, which had large numbers of leaves and yield, has the lowest accuracy of 51.61% while the other varieties with the greater leaf and panicle distribution have a much better accuracy. The Nieow dum had 91.16% accuracy while Jae hwa, Kaow lueng and Kam had 89.06%, 86.52% and 75.22% accuracy respectively.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122641235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HMaViz: Human-machine analytics for visual recommendation HMaViz:用于视觉推荐的人机分析
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3471601
Ngan V. T. Nguyen, Vung V. Pham, Tommy Dang
Visualizations are context-specific. Understanding the context of visualizations before deciding to use them is a daunting task since users have various backgrounds, and there are thousands of available visual representations (and their variances). To this end, this paper proposes a visual analytics framework to achieve the following research goals: (1) to automatically generate a number of suitable representations for visualizing the input data and present it to users as a catalog of visualizations with different levels of abstractions and data characteristics on one/two/multi-dimensional spaces (2) to infer aspects of the user’s interest based on their interactions (3) to narrow down a smaller set of visualizations that suit users analysis intention. The results of this process give our analytics system the means to better understand the user’s analysis process and enable it to better provide timely recommendations.
可视化是特定于上下文的。在决定使用可视化之前,理解它们的上下文是一项艰巨的任务,因为用户具有不同的背景,并且有数千种可用的可视化表示(及其差异)。为此,本文提出了可视化分析框架,以实现以下研究目标:(1)自动生成一些适合可视化输入数据的表示,并将其作为在一维/二维/多维空间上具有不同抽象级别和数据特征的可视化目录呈现给用户(2)根据用户的交互推断用户感兴趣的方面(3)缩小适合用户分析意图的较小的可视化集合。这个过程的结果使我们的分析系统能够更好地理解用户的分析过程,并使其能够更好地提供及时的建议。
{"title":"HMaViz: Human-machine analytics for visual recommendation","authors":"Ngan V. T. Nguyen, Vung V. Pham, Tommy Dang","doi":"10.1145/3468784.3471601","DOIUrl":"https://doi.org/10.1145/3468784.3471601","url":null,"abstract":"Visualizations are context-specific. Understanding the context of visualizations before deciding to use them is a daunting task since users have various backgrounds, and there are thousands of available visual representations (and their variances). To this end, this paper proposes a visual analytics framework to achieve the following research goals: (1) to automatically generate a number of suitable representations for visualizing the input data and present it to users as a catalog of visualizations with different levels of abstractions and data characteristics on one/two/multi-dimensional spaces (2) to infer aspects of the user’s interest based on their interactions (3) to narrow down a smaller set of visualizations that suit users analysis intention. The results of this process give our analytics system the means to better understand the user’s analysis process and enable it to better provide timely recommendations.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125865080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Phytochemicals as potential inhibitors for novel coronavirus 2019-nCoV/SARS-CoV-2: a graph-based computational analysis 植物化学物质作为新型冠状病毒2019-nCoV/SARS-CoV-2的潜在抑制剂:基于图的计算分析
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3468886
M. Mandal
Corona viruses (CoVs) are a group of infectious viruses that causes the regular cold to more extreme illnesses like Middle East Respiratory Syndrome (MERS-CoV), Severe Acute Respiratory Syndrome (SARS-CoV) and epic Covid (nCoV) is another strain that has been recently recognized in people. The formulation of effective drugs and treatment strategies are desperately required for 2019-nCoV/SARS-CoV-2 outbreak. Reducing the clinical trial period of existing as well as new drugs, the phytochemicals present in natural products would be helpful to get a quick treatment solution for this pandemic. Here, computationally some of the effective phytochemicals are identified for treating Covid. Publicly available databases have been used for collecting the phytochemicals and their associated genes that also interact with Corona viruses. Then a bipartite graph has been built with two sets of inputs; one set is the set of phytochemicals and the second set is the set of viruses. Thereafter, the eigen vector centrality which is the measure of most influential node in a graph has been calculated for each phytochemical. We found four such phytochemicals which have the top four eigen vector score. Then again, all possible cliques from the bipartite graph have been calculated and it has been seen that the same top four phytochemicals are present in almost all the bicliques. Finally, these top four phytochemicals have been investigated for their molecular and drug likeliness properties. Also the ADMET profile of the top phytochemicals are explored and analyzed.
冠状病毒(cov)是一组传染性病毒,可导致普通感冒到更极端的疾病,如中东呼吸综合征(MERS-CoV)、严重急性呼吸综合征(SARS-CoV),而新冠病毒(nCoV)是最近在人类中发现的另一种病毒。2019-nCoV/SARS-CoV-2疫情迫切需要制定有效的药物和治疗策略。减少现有药物和新药的临床试验期,天然产品中存在的植物化学物质将有助于获得针对此次大流行的快速治疗方案。在这里,通过计算确定了一些治疗Covid的有效植物化学物质。公开可用的数据库已用于收集也与冠状病毒相互作用的植物化学物质及其相关基因。然后建立了具有两组输入的二部图;一组是植物化学物质,另一组是病毒。然后,计算了每种植物化学物质的特征向量中心性,即图中最具影响力节点的度量。我们发现了四种这样的植物化学物质,它们具有前四个特征向量得分。然后,从二部图中计算出所有可能的派系,并且已经看到几乎所有的自行车中都存在相同的前四种植物化学物质。最后,对这四种植物化学物质的分子和药物可能性进行了研究。并对顶级植物化学物质的ADMET谱进行了探讨和分析。
{"title":"Phytochemicals as potential inhibitors for novel coronavirus 2019-nCoV/SARS-CoV-2: a graph-based computational analysis","authors":"M. Mandal","doi":"10.1145/3468784.3468886","DOIUrl":"https://doi.org/10.1145/3468784.3468886","url":null,"abstract":"Corona viruses (CoVs) are a group of infectious viruses that causes the regular cold to more extreme illnesses like Middle East Respiratory Syndrome (MERS-CoV), Severe Acute Respiratory Syndrome (SARS-CoV) and epic Covid (nCoV) is another strain that has been recently recognized in people. The formulation of effective drugs and treatment strategies are desperately required for 2019-nCoV/SARS-CoV-2 outbreak. Reducing the clinical trial period of existing as well as new drugs, the phytochemicals present in natural products would be helpful to get a quick treatment solution for this pandemic. Here, computationally some of the effective phytochemicals are identified for treating Covid. Publicly available databases have been used for collecting the phytochemicals and their associated genes that also interact with Corona viruses. Then a bipartite graph has been built with two sets of inputs; one set is the set of phytochemicals and the second set is the set of viruses. Thereafter, the eigen vector centrality which is the measure of most influential node in a graph has been calculated for each phytochemical. We found four such phytochemicals which have the top four eigen vector score. Then again, all possible cliques from the bipartite graph have been calculated and it has been seen that the same top four phytochemicals are present in almost all the bicliques. Finally, these top four phytochemicals have been investigated for their molecular and drug likeliness properties. Also the ADMET profile of the top phytochemicals are explored and analyzed.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126516354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Learning from Others: A Data Driven Transfer Learning based Daily New COVID-19 Case Prediction in India using an Ensemble of LSTM-RNNs 向他人学习:使用lstm - rnn集合的基于数据驱动的迁移学习的印度每日新冠肺炎病例预测
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3470769
Debasrita Chakraborty, Debayan Goswami, Ashish Ghosh, Jonathan H. Chan, Susmita K. Ghosh
Accurate prediction of the number of COVID-19 infected cases per day is fast becoming a critical necessity globally to mitigate the burden on various health systems. In a densely populated country like India which has currently the second highest number of infections and limited medical support, it is a need for the authorities to know the statistics beforehand to address these issues more effectively. In this article, a data driven transfer learning based model is proposed that takes into account the conditions of different countries which have witnessed the COVID-19 infection. We have taken four countries to be the source domain for transfer learning scenario namely, the United States of America, Spain, Brazil and Bangladesh. We have pre-trained four different LSTM-RNN models with each of the country’s data and have re-trained (fine tuned) each of the models using only a very small portion of Indian data on COVID-19. Predictions of these four models are averaged to get the actual prediction. It is seen that such an ensemble model outperforms all the compared models and accurately predicts even the daily cases. This may be due to the fact that the four LSTM-RNNs used here could successfully take into account the diversities of conditions. As India is a diverse nation with variety of climates, it makes more sense to incorporate such transfer learning techniques.
准确预测每天COVID-19感染病例的数量正迅速成为全球减轻各种卫生系统负担的关键必要条件。在印度这样一个人口稠密的国家,目前感染人数第二高,医疗支持有限,当局需要事先了解统计数据,以便更有效地解决这些问题。本文提出了一种基于数据驱动的迁移学习模型,该模型考虑了不同国家的COVID-19感染情况。我们选取了四个国家作为迁移学习情景的源域,即美国、西班牙、巴西和孟加拉国。我们使用每个国家的数据预训练了四个不同的LSTM-RNN模型,并仅使用印度关于COVID-19的一小部分数据对每个模型进行了重新训练(微调)。对这四种模型的预测值进行平均,得到实际的预测值。可以看出,这种集成模型优于所有比较的模型,甚至可以准确地预测日常情况。这可能是因为这里使用的四个lstm - rnn可以成功地考虑到条件的多样性。由于印度是一个气候多样的多元化国家,因此采用这种迁移学习技术更有意义。
{"title":"Learning from Others: A Data Driven Transfer Learning based Daily New COVID-19 Case Prediction in India using an Ensemble of LSTM-RNNs","authors":"Debasrita Chakraborty, Debayan Goswami, Ashish Ghosh, Jonathan H. Chan, Susmita K. Ghosh","doi":"10.1145/3468784.3470769","DOIUrl":"https://doi.org/10.1145/3468784.3470769","url":null,"abstract":"Accurate prediction of the number of COVID-19 infected cases per day is fast becoming a critical necessity globally to mitigate the burden on various health systems. In a densely populated country like India which has currently the second highest number of infections and limited medical support, it is a need for the authorities to know the statistics beforehand to address these issues more effectively. In this article, a data driven transfer learning based model is proposed that takes into account the conditions of different countries which have witnessed the COVID-19 infection. We have taken four countries to be the source domain for transfer learning scenario namely, the United States of America, Spain, Brazil and Bangladesh. We have pre-trained four different LSTM-RNN models with each of the country’s data and have re-trained (fine tuned) each of the models using only a very small portion of Indian data on COVID-19. Predictions of these four models are averaged to get the actual prediction. It is seen that such an ensemble model outperforms all the compared models and accurately predicts even the daily cases. This may be due to the fact that the four LSTM-RNNs used here could successfully take into account the diversities of conditions. As India is a diverse nation with variety of climates, it makes more sense to incorporate such transfer learning techniques.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131277219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Real-time Sound Visualization via Multidimensional Clustering and Projections 基于多维聚类和投影的实时声音可视化
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3471604
N. Le, Ngan V. T. Nguyen, Tommy Dang
Sound plays a vital role in every aspect of human life since it is one of the primary sensory information that our auditory system collects and allows us to perceive the world. Sound clustering and visualization is the process of collecting and analyzing audio samples; that process is a prerequisite of sound classification, which is the core of automatic speech recognition, virtual assistants, and text to speech applications. Nevertheless, understanding how to recognize and properly interpret complex, high-dimensional audio data is the most significant challenge in sound clustering and visualization. This paper proposed a web-based platform to visualize and cluster similar sound samples of musical notes and human speech in real-time. For visualizing high-dimensional data like audio, Mel-Frequency Cepstral Coefficients (MFCCs) were initially developed to represent the sounds made by the human vocal tract are extracted. Then, t-distributed Stochastic Neighbor Embedding (t-SNE), a dimensionality reduction technique, was designed for high dimensional datasets is applied. This paper focuses on both data clustering and high-dimensional visualization methods to properly present the clustering results in the most meaningful way to uncover potentially interesting behavioral patterns of musical notes played by different instruments.
声音在人类生活的各个方面都起着至关重要的作用,因为它是听觉系统收集的主要感官信息之一,使我们能够感知世界。声音聚类和可视化是收集和分析音频样本的过程;这个过程是声音分类的先决条件,而声音分类是自动语音识别、虚拟助手和文本到语音应用程序的核心。然而,理解如何识别和正确解释复杂的高维音频数据是声音聚类和可视化中最重要的挑战。本文提出了一个基于web的平台,可以实时地对音符和人类语音的相似声音样本进行可视化和聚类。为了可视化音频等高维数据,最初开发了Mel-Frequency倒谱系数(MFCCs)来表示提取的人类声道发出的声音。然后,针对高维数据集,设计了t分布随机邻域嵌入(t-SNE)降维技术。本文重点研究了数据聚类和高维可视化两种方法,以最有意义的方式恰当地呈现聚类结果,以揭示不同乐器演奏的音符潜在的有趣行为模式。
{"title":"Real-time Sound Visualization via Multidimensional Clustering and Projections","authors":"N. Le, Ngan V. T. Nguyen, Tommy Dang","doi":"10.1145/3468784.3471604","DOIUrl":"https://doi.org/10.1145/3468784.3471604","url":null,"abstract":"Sound plays a vital role in every aspect of human life since it is one of the primary sensory information that our auditory system collects and allows us to perceive the world. Sound clustering and visualization is the process of collecting and analyzing audio samples; that process is a prerequisite of sound classification, which is the core of automatic speech recognition, virtual assistants, and text to speech applications. Nevertheless, understanding how to recognize and properly interpret complex, high-dimensional audio data is the most significant challenge in sound clustering and visualization. This paper proposed a web-based platform to visualize and cluster similar sound samples of musical notes and human speech in real-time. For visualizing high-dimensional data like audio, Mel-Frequency Cepstral Coefficients (MFCCs) were initially developed to represent the sounds made by the human vocal tract are extracted. Then, t-distributed Stochastic Neighbor Embedding (t-SNE), a dimensionality reduction technique, was designed for high dimensional datasets is applied. This paper focuses on both data clustering and high-dimensional visualization methods to properly present the clustering results in the most meaningful way to uncover potentially interesting behavioral patterns of musical notes played by different instruments.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132710308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Study of Relationship Between Music Streaming Behavior and Big Five Personality Traits of Spotify Users 音乐流媒体行为与Spotify用户五大人格特征的关系研究
Pub Date : 2021-06-29 DOI: 10.1145/3468784.3469854
Thanit Hongpanarak, J. Mongkolnavin
Personality Traits are important customer insights for business. Persuasive messages in advertising campaigns are more effective when customized to fit the customers' personalities. Researches suggested that music preference can reflect personality traits. However, those studies collected music listening history by using self-report of which the data obtained can be incomplete. This research aims to increase the completeness of music listening data by conducting a study on the three-month music streaming history of volunteers recorded automatically by Spotify. The eight audio features of each song (Acousticness, Danceability, Energy, Instrumentalness, Liveness, Speechiness, Valence, and Tempo) were extracted using Spotify's Application Programming Interface. The averages of these features calculated from songs in the music streaming history of each volunteer were used to represent his music preference. Pearson's Correlation method was employed to analyze relationships between the Big 5 Personality Traits and the music preference of 40 volunteers. The result shows a positive correlation between Openness-to-Experience and Liveness, a positive correlation between Extraversion and Acousticness, and a negative correlation between Extraversion with Energy and Speechiness. Agreeableness shows a positive correlation with Tempo. Instrumentalness is the only song feature that has a positive correlation with Neuroticism.
个性特征是商业中重要的客户洞察。在广告活动中,有说服力的信息如果根据顾客的个性进行定制,效果会更好。研究表明,音乐偏好可以反映个性特征。然而,这些研究采用自我报告的方式收集音乐听史,所得数据可能不完整。本次研究的目的是通过对Spotify自动录制的志愿者三个月的音乐流媒体历史进行研究,提高音乐收听数据的完整性。每首歌的八个音频特征(声学,舞蹈性,能量,器乐性,活泼,言语性,价和节奏)是使用Spotify的应用程序编程接口提取的。从每个志愿者的音乐流媒体历史中计算出的这些特征的平均值被用来表示他的音乐偏好。采用Pearson相关法分析了40名志愿者的五大人格特征与音乐偏好之间的关系。结果表明,开放性与活泼度呈正相关,外向性与声学性呈正相关,外向性与能量和言语性呈负相关。宜人性与节奏呈正相关。器乐性是唯一与神经质呈正相关的歌曲特征。
{"title":"A Study of Relationship Between Music Streaming Behavior and Big Five Personality Traits of Spotify Users","authors":"Thanit Hongpanarak, J. Mongkolnavin","doi":"10.1145/3468784.3469854","DOIUrl":"https://doi.org/10.1145/3468784.3469854","url":null,"abstract":"Personality Traits are important customer insights for business. Persuasive messages in advertising campaigns are more effective when customized to fit the customers' personalities. Researches suggested that music preference can reflect personality traits. However, those studies collected music listening history by using self-report of which the data obtained can be incomplete. This research aims to increase the completeness of music listening data by conducting a study on the three-month music streaming history of volunteers recorded automatically by Spotify. The eight audio features of each song (Acousticness, Danceability, Energy, Instrumentalness, Liveness, Speechiness, Valence, and Tempo) were extracted using Spotify's Application Programming Interface. The averages of these features calculated from songs in the music streaming history of each volunteer were used to represent his music preference. Pearson's Correlation method was employed to analyze relationships between the Big 5 Personality Traits and the music preference of 40 volunteers. The result shows a positive correlation between Openness-to-Experience and Liveness, a positive correlation between Extraversion and Acousticness, and a negative correlation between Extraversion with Energy and Speechiness. Agreeableness shows a positive correlation with Tempo. Instrumentalness is the only song feature that has a positive correlation with Neuroticism.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133097368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
The 12th International Conference on Advances in Information Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1