首页 > 最新文献

Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)最新文献

英文 中文
DT2I: Dense Text-to-Image Generation from Region Descriptions 从区域描述生成密集文本到图像
Stanislav Frolov, Prateek Bansal, Jörn Hees, A. Dengel
Despite astonishing progress, generating realistic images of complex scenes remains a challenging problem. Recently, layout-to-image synthesis approaches have attracted much interest by conditioning the generator on a list of bounding boxes and corresponding class labels. However, previous approaches are very restrictive because the set of labels is fixed a priori. Meanwhile, text-to-image synthesis methods have substantially improved and provide a flexible way for conditional image generation. In this work, we introduce dense text-to-image (DT2I) synthesis as a new task to pave the way toward more intuitive image generation. Furthermore, we propose DTC-GAN, a novel method to generate images from semantically rich region descriptions, and a multi-modal region feature matching loss to encourage semantic image-text matching. Our results demonstrate the capability of our approach to generate plausible images of complex scenes using region captions.
尽管取得了惊人的进步,但生成复杂场景的逼真图像仍然是一个具有挑战性的问题。最近,布局到图像的合成方法通过在边界框列表和相应的类标签上调节生成器,引起了人们的极大兴趣。然而,以前的方法是非常严格的,因为标签集是先验固定的。同时,文本到图像的合成方法也有了很大的改进,为条件图像生成提供了一种灵活的方式。在这项工作中,我们引入密集文本到图像(DT2I)合成作为一项新任务,为更直观的图像生成铺平道路。此外,我们提出了一种从语义丰富的区域描述生成图像的新方法DTC-GAN,以及一种多模态区域特征匹配损失来促进语义图像-文本匹配。我们的结果证明了我们的方法能够使用区域说明生成复杂场景的可信图像。
{"title":"DT2I: Dense Text-to-Image Generation from Region Descriptions","authors":"Stanislav Frolov, Prateek Bansal, Jörn Hees, A. Dengel","doi":"10.48550/arXiv.2204.02035","DOIUrl":"https://doi.org/10.48550/arXiv.2204.02035","url":null,"abstract":"Despite astonishing progress, generating realistic images of complex scenes remains a challenging problem. Recently, layout-to-image synthesis approaches have attracted much interest by conditioning the generator on a list of bounding boxes and corresponding class labels. However, previous approaches are very restrictive because the set of labels is fixed a priori. Meanwhile, text-to-image synthesis methods have substantially improved and provide a flexible way for conditional image generation. In this work, we introduce dense text-to-image (DT2I) synthesis as a new task to pave the way toward more intuitive image generation. Furthermore, we propose DTC-GAN, a novel method to generate images from semantically rich region descriptions, and a multi-modal region feature matching loss to encourage semantic image-text matching. Our results demonstrate the capability of our approach to generate plausible images of complex scenes using region captions.","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"35 1","pages":"395-406"},"PeriodicalIF":0.0,"publicationDate":"2022-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79365337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
MetaAudio: A Few-Shot Audio Classification Benchmark MetaAudio:少量音频分类基准
Calum Heggan, S. Budgett, Timothy M. Hospedales, Mehrdad Yaghoobi
Currently available benchmarks for few-shot learning (machine learning with few training examples) are limited in the domains they cover, primarily focusing on image classification. This work aims to alleviate this reliance on image-based benchmarks by offering the first comprehensive, public and fully reproducible audio based alternative, covering a variety of sound domains and experimental settings. We compare the few-shot classification performance of a variety of techniques on seven audio datasets (spanning environmental sounds to human-speech). Extending this, we carry out in-depth analyses of joint training (where all datasets are used during training) and cross-dataset adaptation protocols, establishing the possibility of a generalised audio few-shot classification algorithm. Our experimentation shows gradient-based meta-learning methods such as MAML and Meta-Curvature consistently outperform both metric and baseline methods. We also demonstrate that the joint training routine helps overall generalisation for the environmental sound databases included, as well as being a somewhat-effective method of tackling the cross-dataset/domain setting.
目前可用的few-shot学习(具有少量训练样例的机器学习)基准在其涵盖的领域中是有限的,主要集中在图像分类上。这项工作旨在通过提供第一个全面、公开和完全可复制的基于音频的替代方案,覆盖各种声音域和实验设置,减轻对基于图像的基准的依赖。我们比较了各种技术在七个音频数据集(从环境声音到人类语音)上的少镜头分类性能。在此基础上,我们对联合训练(在训练期间使用所有数据集)和跨数据集适应协议进行了深入分析,建立了通用音频少镜头分类算法的可能性。我们的实验表明,基于梯度的元学习方法(如MAML和meta-曲率)始终优于度量和基线方法。我们还证明,联合训练例程有助于所包括的环境声音数据库的总体泛化,同时也是处理跨数据集/域设置的有效方法。
{"title":"MetaAudio: A Few-Shot Audio Classification Benchmark","authors":"Calum Heggan, S. Budgett, Timothy M. Hospedales, Mehrdad Yaghoobi","doi":"10.48550/arXiv.2204.02121","DOIUrl":"https://doi.org/10.48550/arXiv.2204.02121","url":null,"abstract":"Currently available benchmarks for few-shot learning (machine learning with few training examples) are limited in the domains they cover, primarily focusing on image classification. This work aims to alleviate this reliance on image-based benchmarks by offering the first comprehensive, public and fully reproducible audio based alternative, covering a variety of sound domains and experimental settings. We compare the few-shot classification performance of a variety of techniques on seven audio datasets (spanning environmental sounds to human-speech). Extending this, we carry out in-depth analyses of joint training (where all datasets are used during training) and cross-dataset adaptation protocols, establishing the possibility of a generalised audio few-shot classification algorithm. Our experimentation shows gradient-based meta-learning methods such as MAML and Meta-Curvature consistently outperform both metric and baseline methods. We also demonstrate that the joint training routine helps overall generalisation for the environmental sound databases included, as well as being a somewhat-effective method of tackling the cross-dataset/domain setting.","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"41 1","pages":"219-230"},"PeriodicalIF":0.0,"publicationDate":"2022-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86763804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
SAM-kNN Regressor for Online Learning in Water Distribution Networks 基于SAM-kNN回归器的配水网络在线学习
Jonathan Jakob, André Artelt, M. Hasenjäger, Barbara Hammer
. Water distribution networks are a key component of modern infrastructure for housing and industry. They transport and distribute water via widely branched networks from sources to consumers. In order to guarantee a working network at all times, the water supply company continuously monitors the network and takes actions when necessary – e.g. reacting to leakages, sensor faults and drops in water quality. Since real world networks are too large and complex to be monitored by a human, algorithmic monitoring systems have been developed. A popular type of such systems are residual based anomaly detection systems that can detect events such as leakages and sensor faults. For a continuous high quality monitoring, it is necessary for these systems to adapt to changed demands and presence of various anomalies. In this work, we propose an adaption of the incremental SAM-kNN classifier for regression to build a residual based anomaly detection system for water distribution networks that is able to adapt to any kind of change.
.配水网络是现代住宅和工业基础设施的重要组成部分。它们通过广泛的分支网络将水从水源输送到消费者。为了保证供水网络始终正常工作,供水公司持续监控供水网络,并在必要时采取行动,例如对泄漏、传感器故障和水质下降做出反应。由于现实世界的网络过于庞大和复杂,无法由人类进行监控,因此开发了算法监控系统。这种系统的一种流行类型是基于残余的异常检测系统,它可以检测泄漏和传感器故障等事件。为了实现持续的高质量监测,这些系统必须适应不断变化的需求和各种异常的存在。在这项工作中,我们提出将增量SAM-kNN分类器用于回归,以构建一个能够适应任何类型变化的基于残差的配水网络异常检测系统。
{"title":"SAM-kNN Regressor for Online Learning in Water Distribution Networks","authors":"Jonathan Jakob, André Artelt, M. Hasenjäger, Barbara Hammer","doi":"10.48550/arXiv.2204.01436","DOIUrl":"https://doi.org/10.48550/arXiv.2204.01436","url":null,"abstract":". Water distribution networks are a key component of modern infrastructure for housing and industry. They transport and distribute water via widely branched networks from sources to consumers. In order to guarantee a working network at all times, the water supply company continuously monitors the network and takes actions when necessary – e.g. reacting to leakages, sensor faults and drops in water quality. Since real world networks are too large and complex to be monitored by a human, algorithmic monitoring systems have been developed. A popular type of such systems are residual based anomaly detection systems that can detect events such as leakages and sensor faults. For a continuous high quality monitoring, it is necessary for these systems to adapt to changed demands and presence of various anomalies. In this work, we propose an adaption of the incremental SAM-kNN classifier for regression to build a residual based anomaly detection system for water distribution networks that is able to adapt to any kind of change.","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"91 6 1","pages":"752-762"},"PeriodicalIF":0.0,"publicationDate":"2022-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89443601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Message Passing Neural Networks for Hypergraphs 超图的消息传递神经网络
Sajjad Heydari, L. Livi
{"title":"Message Passing Neural Networks for Hypergraphs","authors":"Sajjad Heydari, L. Livi","doi":"10.1007/978-3-031-15931-2_48","DOIUrl":"https://doi.org/10.1007/978-3-031-15931-2_48","url":null,"abstract":"","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"64 1","pages":"583-592"},"PeriodicalIF":0.0,"publicationDate":"2022-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80344310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection 基于注意机制的改进轻量级YOLOv5人脸检测模型
Sheng Xu
Coronavirus 2019 has brought severe challenges to social stability and public health worldwide. One effective way of curbing the epidemic is to require people to wear masks in public places and monitor mask-wearing states by utilizing suitable automatic detectors. However, existing deep learning based models struggle to simultaneously achieve the requirements of both high precision and real-time performance. To solve this problem, we propose an improved lightweight face mask detector based on YOLOv5, which can achieve an excellent balance of precision and speed. Firstly, a novel backbone ShuffleCANet that combines ShuffleNetV2 network with Coordinate Attention mechanism is proposed as the backbone. Afterwards, an efficient path aggression network BiFPN is applied as the feature fusion neck. Furthermore, the localization loss is replaced with α-CIoU in model training phase to obtain higher-quality anchors. Some valuable strategies such as data augmentation, adaptive image scaling, and anchor cluster operation are also utilized. Experimental results on AIZOO face mask dataset show the superiority of the proposed model. Compared with the original YOLOv5, the proposed model increases the inference speed by 28.3% while still improving the precision by 0.58%. It achieves the best mean average precision of 95.2% compared with other seven existing models, which is 4.4% higher than the baseline.
新冠肺炎疫情给全球社会稳定和公共卫生带来严峻挑战。控制疫情的有效途径之一是要求人们在公共场所佩戴口罩,并利用适当的自动探测器监测佩戴口罩的国家。然而,现有的基于深度学习的模型很难同时满足高精度和实时性的要求。为了解决这一问题,我们提出了一种基于YOLOv5的改进型轻型口罩检测器,它可以实现精度和速度的良好平衡。首先,提出了一种将ShuffleNetV2网络与坐标注意机制相结合的新型骨干网络ShuffleCANet;然后,采用一种高效的路径攻击网络BiFPN作为特征融合颈部。在模型训练阶段用α-CIoU代替局部化损失,获得更高质量的锚点。此外,还采用了一些有价值的策略,如数据增强、自适应图像缩放和锚簇操作。在AIZOO人脸数据集上的实验结果表明了该模型的优越性。与原来的YOLOv5模型相比,该模型的推理速度提高了28.3%,精度仍提高了0.58%。与其他7种现有模型相比,该模型的平均精度达到95.2%,比基线高4.4%。
{"title":"An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection","authors":"Sheng Xu","doi":"10.48550/arXiv.2203.16506","DOIUrl":"https://doi.org/10.48550/arXiv.2203.16506","url":null,"abstract":"Coronavirus 2019 has brought severe challenges to social stability and public health worldwide. One effective way of curbing the epidemic is to require people to wear masks in public places and monitor mask-wearing states by utilizing suitable automatic detectors. However, existing deep learning based models struggle to simultaneously achieve the requirements of both high precision and real-time performance. To solve this problem, we propose an improved lightweight face mask detector based on YOLOv5, which can achieve an excellent balance of precision and speed. Firstly, a novel backbone ShuffleCANet that combines ShuffleNetV2 network with Coordinate Attention mechanism is proposed as the backbone. Afterwards, an efficient path aggression network BiFPN is applied as the feature fusion neck. Furthermore, the localization loss is replaced with α-CIoU in model training phase to obtain higher-quality anchors. Some valuable strategies such as data augmentation, adaptive image scaling, and anchor cluster operation are also utilized. Experimental results on AIZOO face mask dataset show the superiority of the proposed model. Compared with the original YOLOv5, the proposed model increases the inference speed by 28.3% while still improving the precision by 0.58%. It achieves the best mean average precision of 95.2% compared with other seven existing models, which is 4.4% higher than the baseline.","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"3 1","pages":"531-543"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84130603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A Spiking Neural Network based on Neural Manifold for Augmenting Intracortical Brain-Computer Interface Data 基于神经流形的脉冲神经网络增强脑机接口数据
Shengjie Zheng, Wenyi Li, Lang Qian, Che He, Xiaojian Li
ql20@mails.tsinghua.edu.cn Abstract. Brain-computer interfaces (BCIs), transform neural signals in the brain into instructions to control external devices. However, obtaining sufficient training data is difficult as well as limited. With the advent of advanced machine learning methods, the capability of brain-computer interfaces has been enhanced like never before, however, these methods require a large amount of data for training and thus require data augmentation of the limited data available. Here, we use spiking neural networks (SNN) as data generators. It is touted as the next-generation neural network and is considered as one of the algorithms oriented to general artificial intelligence because it borrows the neural information processing from biological neurons. We use the SNN to generate neural spike information that is bio-interpretable and conforms to the intrinsic patterns in the original neural data. Experiments show that the model can direct-ly synthesize new spike trains, which in turn improves the generalization ability of the BCI decoder. Both the input and output of the spiking neural model are spike information, which is a brain-inspired intelligence approach that can be better integrated with BCI in the specific patterns of neural population activity rather than on individual neurons[4]. The neural population dynamics exist in low-dimensional neural manifolds in a high-dimensional neural space[5]. Here, we employ a bio-interpretive SNN that mimics the neural information generation as well as the com-munication of biological neural populations. We analyze motor cortical neural population data recorded from monkeys to derive motor-related neural population dynamics. The neural spike properties of the SNN itself allow the direct generation of biologically meaningful spike trains that match the activity of real biological neural populations. We explored the interaction between the spike train synthesizer and the BCI decoder. Our results show that based on a small amount of training data as a template, data conforming to the dynamics of neural populations are generated, thus enhancing the decoding ability of the BCI decoder.
ql20@mails.tsinghua.edu.cn抽象。脑机接口(bci),将大脑中的神经信号转换为控制外部设备的指令。然而,获得足够的训练数据是困难的,也是有限的。随着先进的机器学习方法的出现,脑机接口的能力得到了前所未有的增强,然而,这些方法需要大量的数据进行训练,因此需要对有限的可用数据进行数据扩充。在这里,我们使用尖峰神经网络(SNN)作为数据生成器。它借鉴了生物神经元的神经信息处理,被誉为下一代神经网络,被认为是面向通用人工智能的算法之一。我们使用SNN生成生物可解释的神经尖峰信息,并符合原始神经数据中的固有模式。实验表明,该模型可以直接合成新的尖峰序列,从而提高了BCI解码器的泛化能力。尖峰神经模型的输入和输出都是尖峰信息,这是一种大脑启发的智能方法,可以在神经群体活动的特定模式中更好地与BCI集成,而不是在单个神经元上[4]。神经种群动态存在于高维神经空间中的低维神经流形中[5]。在这里,我们采用了一种生物解释SNN,它模拟了神经信息的生成以及生物神经群体的交流。我们分析了从猴子记录的运动皮质神经种群数据,得出运动相关的神经种群动态。SNN本身的神经尖峰特性允许直接产生与真实生物神经群的活动相匹配的具有生物学意义的尖峰序列。我们探索了尖峰串合成器和BCI解码器之间的相互作用。我们的研究结果表明,以少量的训练数据为模板,生成符合神经种群动态的数据,从而增强了BCI解码器的解码能力。
{"title":"A Spiking Neural Network based on Neural Manifold for Augmenting Intracortical Brain-Computer Interface Data","authors":"Shengjie Zheng, Wenyi Li, Lang Qian, Che He, Xiaojian Li","doi":"10.48550/arXiv.2204.05132","DOIUrl":"https://doi.org/10.48550/arXiv.2204.05132","url":null,"abstract":"ql20@mails.tsinghua.edu.cn Abstract. Brain-computer interfaces (BCIs), transform neural signals in the brain into instructions to control external devices. However, obtaining sufficient training data is difficult as well as limited. With the advent of advanced machine learning methods, the capability of brain-computer interfaces has been enhanced like never before, however, these methods require a large amount of data for training and thus require data augmentation of the limited data available. Here, we use spiking neural networks (SNN) as data generators. It is touted as the next-generation neural network and is considered as one of the algorithms oriented to general artificial intelligence because it borrows the neural information processing from biological neurons. We use the SNN to generate neural spike information that is bio-interpretable and conforms to the intrinsic patterns in the original neural data. Experiments show that the model can direct-ly synthesize new spike trains, which in turn improves the generalization ability of the BCI decoder. Both the input and output of the spiking neural model are spike information, which is a brain-inspired intelligence approach that can be better integrated with BCI in the specific patterns of neural population activity rather than on individual neurons[4]. The neural population dynamics exist in low-dimensional neural manifolds in a high-dimensional neural space[5]. Here, we employ a bio-interpretive SNN that mimics the neural information generation as well as the com-munication of biological neural populations. We analyze motor cortical neural population data recorded from monkeys to derive motor-related neural population dynamics. The neural spike properties of the SNN itself allow the direct generation of biologically meaningful spike trains that match the activity of real biological neural populations. We explored the interaction between the spike train synthesizer and the BCI decoder. Our results show that based on a small amount of training data as a template, data conforming to the dynamics of neural populations are generated, thus enhancing the decoding ability of the BCI decoder.","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"11 1","pages":"519-530"},"PeriodicalIF":0.0,"publicationDate":"2022-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88606360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Using Orientation to Distinguish Overlapping Chromosomes 利用取向区分重叠染色体
Daniel Kluvanec, Thomas B. Phillips, K. McCaffrey, N. A. Moubayed
A difficult step in the process of karyotyping is segmenting chromosomes that touch or overlap. In an attempt to automate the process, previous studies turned to Deep Learning methods, with some formulating the task as a semantic segmentation problem. These models treat separate chromosome instances as semantic classes, which we show to be problematic, since it is uncertain which chromosome should be classed as #1 and #2. Assigning class labels based on comparison rules, such as the shorter/longer chromosome alleviates, but does not fully re-solve the issue. Instead, we separate the chromosome instances in a second stage, predict-ing the orientation of the chromosomes by the model and use it as one of the key distinguishing factors of the chromosomes. We demonstrate this method to be effective. Furthermore, we introduce a novel Double-Angle representation that a neural network can use to predict the orientation. The representation maps any direction and its reverse to the same point. Lastly, we present a new expanded synthetic dataset, which is based on Pommier’s dataset, but ad-dresses its issues with insufficient separation between its training and testing sets.
染色体组型的一个困难步骤是分割接触或重叠的染色体。为了使这一过程自动化,之前的研究转向了深度学习方法,其中一些将任务表述为语义分割问题。这些模型将单独的染色体实例视为语义类,我们认为这是有问题的,因为不确定哪条染色体应该被分类为#1和#2。根据比较规则分配类标签,例如染色体较短/较长,可以缓解,但不能完全解决这个问题。相反,我们在第二阶段分离染色体实例,通过模型预测染色体的方向,并将其作为染色体的关键区分因素之一。我们证明这种方法是有效的。此外,我们引入了一种新的双角度表示,神经网络可以使用它来预测方向。表示将任何方向及其反向映射到同一点。最后,我们提出了一个新的扩展合成数据集,该数据集基于Pommier的数据集,但解决了其训练集和测试集之间分离不足的问题。
{"title":"Using Orientation to Distinguish Overlapping Chromosomes","authors":"Daniel Kluvanec, Thomas B. Phillips, K. McCaffrey, N. A. Moubayed","doi":"10.48550/arXiv.2203.13004","DOIUrl":"https://doi.org/10.48550/arXiv.2203.13004","url":null,"abstract":"A difficult step in the process of karyotyping is segmenting chromosomes that touch or overlap. In an attempt to automate the process, previous studies turned to Deep Learning methods, with some formulating the task as a semantic segmentation problem. These models treat separate chromosome instances as semantic classes, which we show to be problematic, since it is uncertain which chromosome should be classed as #1 and #2. Assigning class labels based on comparison rules, such as the shorter/longer chromosome alleviates, but does not fully re-solve the issue. Instead, we separate the chromosome instances in a second stage, predict-ing the orientation of the chromosomes by the model and use it as one of the key distinguishing factors of the chromosomes. We demonstrate this method to be effective. Furthermore, we introduce a novel Double-Angle representation that a neural network can use to predict the orientation. The representation maps any direction and its reverse to the same point. Lastly, we present a new expanded synthetic dataset, which is based on Pommier’s dataset, but ad-dresses its issues with insufficient separation between its training and testing sets.","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"98 1","pages":"391-403"},"PeriodicalIF":0.0,"publicationDate":"2022-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80872938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Self-Supervised Anomaly Detection by Self-Distillation and Negative Sampling 基于自蒸馏和负采样的自监督异常检测
Nima Rafiee, Rahil Gholamipoorfard, Nikolas Adaloglou, Simon Jaxy, Julius Ramakers, M. Kollmann
{"title":"Self-Supervised Anomaly Detection by Self-Distillation and Negative Sampling","authors":"Nima Rafiee, Rahil Gholamipoorfard, Nikolas Adaloglou, Simon Jaxy, Julius Ramakers, M. Kollmann","doi":"10.1007/978-3-031-15937-4_39","DOIUrl":"https://doi.org/10.1007/978-3-031-15937-4_39","url":null,"abstract":"","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"282 1","pages":"459-470"},"PeriodicalIF":0.0,"publicationDate":"2022-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86392509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Chinese Character Style Transfer Model Based on Convolutional Neural Network 基于卷积神经网络的汉字风格迁移模型
Weiran Chen, Chunping Liu, Yi Ji
{"title":"Chinese Character Style Transfer Model Based on Convolutional Neural Network","authors":"Weiran Chen, Chunping Liu, Yi Ji","doi":"10.1007/978-3-031-15937-4_47","DOIUrl":"https://doi.org/10.1007/978-3-031-15937-4_47","url":null,"abstract":"","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"30 1","pages":"558-569"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74001850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attentional Local Contrastive Learning for Face Forgery Detection 人脸伪造检测中的注意局部对比学习
Yunshu Dai, Jianwei Fei, Huaming Wang, Zhihua Xia
{"title":"Attentional Local Contrastive Learning for Face Forgery Detection","authors":"Yunshu Dai, Jianwei Fei, Huaming Wang, Zhihua Xia","doi":"10.1007/978-3-031-15919-0_59","DOIUrl":"https://doi.org/10.1007/978-3-031-15919-0_59","url":null,"abstract":"","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"41 1","pages":"709-721"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73185795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1