首页 > 最新文献

Procedings of the British Machine Vision Conference 2016最新文献

英文 中文
Next-Best Stereo: Extending Next-Best View Optimisation For Collaborative Sensors 次优立体:扩展协同传感器的次优视图优化
Pub Date : 2016-08-01 DOI: 10.5244/C.30.65
Oscar Alejandro Mendez Maldonado, Simon Hadfield, N. Pugeault, R. Bowden
Most 3D reconstruction approaches passively optimise over all data, exhaustively matching pairs, rather than actively selecting data to process. This is costly both in terms of time and computer resources, and quickly becomes intractable for large datasets. This work proposes an approach to intelligently filter large amounts of data for 3D reconstructions of unknown scenes using monocular cameras. Our contributions are twofold: First, we present a novel approach to efficiently optimise the Next-Best View ( NBV ) in terms of accuracy and coverage using partial scene geometry. Second, we extend this to intelligently selecting stereo pairs by jointly optimising the baseline and vergence to find the NBV ’s best stereo pair to perform reconstruction. Both contributions are extremely efficient, taking 0.8ms and 0.3ms per pose, respectively. Experimental evaluation shows that the proposed method allows efficient selection of stereo pairs for reconstruction, such that a dense model can be obtained with only a small number of images. Once a complete model has been obtained, the remaining computational budget is used to intelligently refine areas of uncertainty, achieving results comparable to state-of-the-art batch approaches on the Middlebury dataset, using as little as 3.8% of the views.
大多数3D重建方法被动地优化所有数据,彻底匹配成对,而不是主动选择数据进行处理。这在时间和计算机资源方面都是昂贵的,并且对于大型数据集来说很快就变得难以处理。这项工作提出了一种使用单目相机智能过滤大量数据用于未知场景的3D重建的方法。我们的贡献有两个方面:首先,我们提出了一种利用局部场景几何结构在精度和覆盖范围方面有效优化次优视图(NBV)的新方法。其次,我们将其扩展到智能选择立体对,通过联合优化基线和收敛来找到NBV的最佳立体对进行重建。这两种贡献都非常有效,每个姿势分别花费0.8ms和0.3ms。实验结果表明,该方法可以有效地选择用于重建的立体对,从而在少量图像的情况下获得密集的模型。一旦获得完整的模型,剩余的计算预算将用于智能地细化不确定区域,从而获得与Middlebury数据集上最先进的批处理方法相当的结果,仅使用3.8%的视图。
{"title":"Next-Best Stereo: Extending Next-Best View Optimisation For Collaborative Sensors","authors":"Oscar Alejandro Mendez Maldonado, Simon Hadfield, N. Pugeault, R. Bowden","doi":"10.5244/C.30.65","DOIUrl":"https://doi.org/10.5244/C.30.65","url":null,"abstract":"Most 3D reconstruction approaches passively optimise over all data, exhaustively matching pairs, rather than actively selecting data to process. This is costly both in terms of time and computer resources, and quickly becomes intractable for large datasets. This work proposes an approach to intelligently filter large amounts of data for 3D reconstructions of unknown scenes using monocular cameras. Our contributions are twofold: First, we present a novel approach to efficiently optimise the Next-Best View ( NBV ) in terms of accuracy and coverage using partial scene geometry. Second, we extend this to intelligently selecting stereo pairs by jointly optimising the baseline and vergence to find the NBV ’s best stereo pair to perform reconstruction. Both contributions are extremely efficient, taking 0.8ms and 0.3ms per pose, respectively. Experimental evaluation shows that the proposed method allows efficient selection of stereo pairs for reconstruction, such that a dense model can be obtained with only a small number of images. Once a complete model has been obtained, the remaining computational budget is used to intelligently refine areas of uncertainty, achieving results comparable to state-of-the-art batch approaches on the Middlebury dataset, using as little as 3.8% of the views.","PeriodicalId":125761,"journal":{"name":"Procedings of the British Machine Vision Conference 2016","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130817690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Learning Neural Network Architectures using Backpropagation 使用反向传播学习神经网络架构
Pub Date : 2015-11-17 DOI: 10.5244/C.30.104
Suraj Srinivas, R. Venkatesh Babu
Deep neural networks with millions of parameters are at the heart of many state of the art machine learning models today. However, recent works have shown that models with much smaller number of parameters can also perform just as well. In this work, we introduce the problem of architecture-learning, i.e; learning the architecture of a neural network along with weights. We introduce a new trainable parameter called tri-state ReLU, which helps in eliminating unnecessary neurons. We also propose a smooth regularizer which encourages the total number of neurons after elimination to be small. The resulting objective is differentiable and simple to optimize. We experimentally validate our method on both small and large networks, and show that it can learn models with a considerably small number of parameters without affecting prediction accuracy.
拥有数百万个参数的深度神经网络是当今许多最先进的机器学习模型的核心。然而,最近的研究表明,参数数量少得多的模型也可以表现得很好。在这项工作中,我们引入了建筑学习的问题,即;学习神经网络的结构和权重。我们引入了一个新的可训练参数,称为三状态ReLU,它有助于消除不必要的神经元。我们还提出了一个平滑正则器,它鼓励消除后的神经元总数变小。得到的目标是可微的,易于优化。我们在小型和大型网络上实验验证了我们的方法,并表明它可以在不影响预测精度的情况下学习具有相当少参数的模型。
{"title":"Learning Neural Network Architectures using Backpropagation","authors":"Suraj Srinivas, R. Venkatesh Babu","doi":"10.5244/C.30.104","DOIUrl":"https://doi.org/10.5244/C.30.104","url":null,"abstract":"Deep neural networks with millions of parameters are at the heart of many state of the art machine learning models today. However, recent works have shown that models with much smaller number of parameters can also perform just as well. In this work, we introduce the problem of architecture-learning, i.e; learning the architecture of a neural network along with weights. We introduce a new trainable parameter called tri-state ReLU, which helps in eliminating unnecessary neurons. We also propose a smooth regularizer which encourages the total number of neurons after elimination to be small. The resulting objective is differentiable and simple to optimize. We experimentally validate our method on both small and large networks, and show that it can learn models with a considerably small number of parameters without affecting prediction accuracy.","PeriodicalId":125761,"journal":{"name":"Procedings of the British Machine Vision Conference 2016","volume":"449 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131799223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Multi-H: Efficient recovery of tangent planes in stereo images Multi-H:立体图像中切平面的高效恢复
Pub Date : 1900-01-01 DOI: 10.5244/C.30.13
D. Baráth, Jiri Matas, Levente Hajder
Multi-H – an efficient method for the recovery of the tangent planes of a set of point correspondences satisfying the epipolar constraint is proposed. The problem is formulated as a search for a labeling minimizing an energy that includes a data and spatial regularization terms. The number of planes is controlled by a combination of MeanShift [6] and α-expansion [3]. Experiments on the fountain-P11 3D dataset show that Multi-H provides highly accurate tangent plane estimates. It also outperforms all state-of-the-art techniques for multihomography estimation on the publicly available AdelaideRMF dataset. Since Multi-H achieves nearly error-free performance, we introduce and make public a more challenging dataset for multi-plane fitting evaluation.
提出了一种恢复满足极外约束的一组点对应的切平面的有效方法- Multi-H。这个问题被表述为寻找一个包含数据和空间正则化项的最小化能量的标记。平面数由MeanShift[6]和α-expansion[3]联合控制。在fountain-P11三维数据集上的实验表明,Multi-H算法提供了高精度的切平面估计。它还优于所有最先进的技术,在公开可用的AdelaideRMF数据集上进行多单应性估计。由于Multi-H实现了几乎无误差的性能,我们引入并公开了一个更具挑战性的数据集,用于多平面拟合评估。
{"title":"Multi-H: Efficient recovery of tangent planes in stereo images","authors":"D. Baráth, Jiri Matas, Levente Hajder","doi":"10.5244/C.30.13","DOIUrl":"https://doi.org/10.5244/C.30.13","url":null,"abstract":"Multi-H – an efficient method for the recovery of the tangent planes of a set of point correspondences satisfying the epipolar constraint is proposed. The problem is formulated as a search for a labeling minimizing an energy that includes a data and spatial regularization terms. The number of planes is controlled by a combination of MeanShift [6] and α-expansion [3]. Experiments on the fountain-P11 3D dataset show that Multi-H provides highly accurate tangent plane estimates. It also outperforms all state-of-the-art techniques for multihomography estimation on the publicly available AdelaideRMF dataset. Since Multi-H achieves nearly error-free performance, we introduce and make public a more challenging dataset for multi-plane fitting evaluation.","PeriodicalId":125761,"journal":{"name":"Procedings of the British Machine Vision Conference 2016","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134261174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Localizing Periodicity in Time Series and Videos 时间序列和视频的周期性局部化
Pub Date : 1900-01-01 DOI: 10.5244/C.30.47
Giorgos Karvounas, I. Oikonomidis, Antonis A. Argyros
Periodicity detection is a problem that has received a lot of attention, thus several important tools exist to analyse purely periodic signals. However, in many real world scenarios (time series, videos of human activities, etc) periodic signals appear in the context of non-periodic ones. In this work we propose a method that, given a time series representing a periodic signal that has a non-periodic prefix and tail, estimates the start, the end and the period of the periodic part of the signal. We formulate this as an optimization problem that is solved based on evolutionary optimization techniques. Quantitative experiments on synthetic data demonstrate that the proposed method is successful in localizing the periodic part of a signal and exhibits robustness in the presence of noisy measurements. Also, it does so even when the periodic part of the signal is too short compared to its non-periodic prefix and tail. We also provide quantitative and qualitative results obtained from the application of the proposed method to the problem of unsupervised localization and segmentation of periodic activities in real world videos.
周期性检测是一个受到广泛关注的问题,因此存在一些重要的工具来分析纯周期信号。然而,在许多现实世界的场景中(时间序列、人类活动的视频等),周期性信号出现在非周期性信号的背景下。在这项工作中,我们提出了一种方法,给定一个时间序列表示一个具有非周期前缀和尾部的周期信号,估计信号周期部分的开始,结束和周期。我们将其表述为基于进化优化技术解决的优化问题。合成数据的定量实验表明,该方法能够成功地定位信号的周期部分,并且在存在噪声测量的情况下具有鲁棒性。此外,即使信号的周期部分与非周期前缀和尾部相比太短,它也会这样做。我们还提供了将该方法应用于现实世界视频中周期性活动的无监督定位和分割问题所获得的定量和定性结果。
{"title":"Localizing Periodicity in Time Series and Videos","authors":"Giorgos Karvounas, I. Oikonomidis, Antonis A. Argyros","doi":"10.5244/C.30.47","DOIUrl":"https://doi.org/10.5244/C.30.47","url":null,"abstract":"Periodicity detection is a problem that has received a lot of attention, thus several important tools exist to analyse purely periodic signals. However, in many real world scenarios (time series, videos of human activities, etc) periodic signals appear in the context of non-periodic ones. In this work we propose a method that, given a time series representing a periodic signal that has a non-periodic prefix and tail, estimates the start, the end and the period of the periodic part of the signal. We formulate this as an optimization problem that is solved based on evolutionary optimization techniques. Quantitative experiments on synthetic data demonstrate that the proposed method is successful in localizing the periodic part of a signal and exhibits robustness in the presence of noisy measurements. Also, it does so even when the periodic part of the signal is too short compared to its non-periodic prefix and tail. We also provide quantitative and qualitative results obtained from the application of the proposed method to the problem of unsupervised localization and segmentation of periodic activities in real world videos.","PeriodicalId":125761,"journal":{"name":"Procedings of the British Machine Vision Conference 2016","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115066343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Variational Weakly Supervised Gaussian Processes 变分弱监督高斯过程
Pub Date : 1900-01-01 DOI: 10.5244/C.30.71
M. Kandemir, Manuel Haussmann, Ferran Diego, K. Rajamani, J. Laak, F. Hamprecht
We introduce the first model to perform weakly supervised learning with Gaussian processes on up to millions of instances. The key ingredient to achieve this scalability is to replace the standard assumption of MIL that the bag-level prediction is the maximum of instance-level estimates with the accumulated evidence of instances within a bag. This enables us to devise a novel variational inference scheme that operates solely by closedform updates. Keeping all its parameters but one fixed, our model updates the remaining parameter to the global optimum. This virtue leads to charmingly fast convergence, fitting perfectly to large-scale learning setups. Our model performs significantly better in two medical applications than adaptation of GPMIL to scalable inference and various scalable MIL algorithms. It also proves to be very competitive in object classification against state-of-the-art adaptations of deep learning to weakly supervised learning.
我们引入了第一个在多达数百万个实例上使用高斯过程执行弱监督学习的模型。实现这种可伸缩性的关键因素是将MIL的标准假设(即包级预测是实例级估计的最大值)替换为包内实例的累积证据。这使我们能够设计一种新的变分推理方案,该方案仅通过封闭形式的更新来操作。模型保持所有参数不变,只保留一个参数不变,将剩余参数更新为全局最优。这种优点导致了迷人的快速收敛,完全适合大规模的学习设置。我们的模型在两种医疗应用中表现明显优于GPMIL对可扩展推理和各种可扩展MIL算法的适应。它也被证明在对象分类方面与深度学习对弱监督学习的最新适应非常有竞争力。
{"title":"Variational Weakly Supervised Gaussian Processes","authors":"M. Kandemir, Manuel Haussmann, Ferran Diego, K. Rajamani, J. Laak, F. Hamprecht","doi":"10.5244/C.30.71","DOIUrl":"https://doi.org/10.5244/C.30.71","url":null,"abstract":"We introduce the first model to perform weakly supervised learning with Gaussian processes on up to millions of instances. The key ingredient to achieve this scalability is to replace the standard assumption of MIL that the bag-level prediction is the maximum of instance-level estimates with the accumulated evidence of instances within a bag. This enables us to devise a novel variational inference scheme that operates solely by closedform updates. Keeping all its parameters but one fixed, our model updates the remaining parameter to the global optimum. This virtue leads to charmingly fast convergence, fitting perfectly to large-scale learning setups. Our model performs significantly better in two medical applications than adaptation of GPMIL to scalable inference and various scalable MIL algorithms. It also proves to be very competitive in object classification against state-of-the-art adaptations of deep learning to weakly supervised learning.","PeriodicalId":125761,"journal":{"name":"Procedings of the British Machine Vision Conference 2016","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133244300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Attention Networks for Weakly Supervised Object Localization 弱监督对象定位的注意网络
Pub Date : 1900-01-01 DOI: 10.5244/C.30.52
Eu Wern Teh, Mrigank Rochan, Yang Wang
We consider the problem of weakly supervised learning for object localization. Given a collection of images with image-level annotations indicating the presence/absence of an object, our goal is to localize the object in each image. We propose a neural network architecture called the attention network for this problem. Given a set of candidate regions in an image, the attention network first computes an attention score on each candidate region in the image. Then these candidate regions are combined together with their attention scores to form a whole-image feature vector. This feature vector is used for classifying the image. The object localization is implicitly achieved via the attention scores on candidate regions. We demonstrate that our approach achieves superior performance on several benchmark datasets.
研究了用于目标定位的弱监督学习问题。给定一组带有图像级注释的图像集合,指示对象的存在/不存在,我们的目标是在每个图像中定位对象。针对这一问题,我们提出了一种称为注意力网络的神经网络架构。给定图像中的一组候选区域,注意力网络首先计算图像中每个候选区域的注意力得分。然后将这些候选区域与它们的注意力分数组合在一起,形成一个完整的图像特征向量。该特征向量用于对图像进行分类。通过对候选区域的注意力得分隐式地实现目标定位。我们证明了我们的方法在几个基准数据集上取得了卓越的性能。
{"title":"Attention Networks for Weakly Supervised Object Localization","authors":"Eu Wern Teh, Mrigank Rochan, Yang Wang","doi":"10.5244/C.30.52","DOIUrl":"https://doi.org/10.5244/C.30.52","url":null,"abstract":"We consider the problem of weakly supervised learning for object localization. Given a collection of images with image-level annotations indicating the presence/absence of an object, our goal is to localize the object in each image. We propose a neural network architecture called the attention network for this problem. Given a set of candidate regions in an image, the attention network first computes an attention score on each candidate region in the image. Then these candidate regions are combined together with their attention scores to form a whole-image feature vector. This feature vector is used for classifying the image. The object localization is implicitly achieved via the attention scores on candidate regions. We demonstrate that our approach achieves superior performance on several benchmark datasets.","PeriodicalId":125761,"journal":{"name":"Procedings of the British Machine Vision Conference 2016","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121291423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 64
期刊
Procedings of the British Machine Vision Conference 2016
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1