Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications最新文献

英文中文

FastHebb: Scaling Hebbian Training of Deep Neural Networks to ImageNet Level FastHebb:将深度神经网络的Hebbian训练扩展到ImageNet级别

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-07-07 DOI: 10.48550/arXiv.2207.03172

Gabriele Lagani, C. Gennaro, Hannes Fassold, G. Amato

Learning algorithms for Deep Neural Networks are typically based on supervised end-to-end Stochastic Gradient Descent (SGD) training with error backpropagation (backprop). Backprop algorithms require a large number of labelled training samples to achieve high performance. However, in many realistic applications, even if there is plenty of image samples, very few of them are labelled, and semi-supervised sample-efficient training strategies have to be used. Hebbian learning represents a possible approach towards sample efficient training; however, in current solutions, it does not scale well to large datasets. In this paper, we present FastHebb, an efficient and scalable solution for Hebbian learning which achieves higher efficiency by 1) merging together update computation and aggregation over a batch of inputs, and 2) leveraging efficient matrix multiplication algorithms on GPU. We validate our approach on different computer vision benchmarks, in a semi-supervised learning scenario. FastHebb outperforms previous solutions by up to 50 times in terms of training speed, and notably, for the first time, we are able to bring Hebbian algorithms to ImageNet scale.

深度神经网络的学习算法通常是基于有监督的端到端随机梯度下降(SGD)训练和误差反向传播(backprop)。Backprop算法需要大量的标记训练样本才能达到高性能。然而，在许多实际应用中，即使有大量的图像样本，也很少有图像样本被标记，因此必须使用半监督样本效率训练策略。Hebbian学习代表了一种样本高效训练的可能方法;然而，在目前的解决方案中，它不能很好地扩展到大型数据集。在本文中，我们提出了FastHebb，一种高效且可扩展的Hebbian学习解决方案，它通过将一批输入的更新计算和聚合合并在一起，以及在GPU上利用高效的矩阵乘法算法来实现更高的效率。我们在半监督学习场景中，在不同的计算机视觉基准上验证了我们的方法。FastHebb在训练速度方面比以前的解决方案高出50倍，值得注意的是，我们第一次能够将Hebbian算法带到ImageNet规模。

{"title":"FastHebb: Scaling Hebbian Training of Deep Neural Networks to ImageNet Level","authors":"Gabriele Lagani, C. Gennaro, Hannes Fassold, G. Amato","doi":"10.48550/arXiv.2207.03172","DOIUrl":"https://doi.org/10.48550/arXiv.2207.03172","url":null,"abstract":"Learning algorithms for Deep Neural Networks are typically based on supervised end-to-end Stochastic Gradient Descent (SGD) training with error backpropagation (backprop). Backprop algorithms require a large number of labelled training samples to achieve high performance. However, in many realistic applications, even if there is plenty of image samples, very few of them are labelled, and semi-supervised sample-efficient training strategies have to be used. Hebbian learning represents a possible approach towards sample efficient training; however, in current solutions, it does not scale well to large datasets. In this paper, we present FastHebb, an efficient and scalable solution for Hebbian learning which achieves higher efficiency by 1) merging together update computation and aggregation over a batch of inputs, and 2) leveraging efficient matrix multiplication algorithms on GPU. We validate our approach on different computer vision benchmarks, in a semi-supervised learning scenario. FastHebb outperforms previous solutions by up to 50 times in terms of training speed, and notably, for the first time, we are able to bring Hebbian algorithms to ImageNet scale.","PeriodicalId":90051,"journal":{"name":"Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications","volume":"294 1","pages":"251-264"},"PeriodicalIF":0.0,"publicationDate":"2022-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75684465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Causal Disentanglement with Network Information for Debiased Recommendations 基于网络信息的无偏见推荐因果解纠集

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-04-14 DOI: 10.48550/arXiv.2204.07221

Paras Sheth, Ruocheng Guo, Lu Cheng, Huan Liu, K. Candan

Recommender systems aim to recommend new items to users by learning user and item representations. In practice, these representations are highly entangled as they consist of information about multiple factors, including user's interests, item attributes along with confounding factors such as user conformity, and item popularity. Considering these entangled representations for inferring user preference may lead to biased recommendations (e.g., when the recommender model recommends popular items even if they do not align with the user's interests). Recent research proposes to debias by modeling a recommender system from a causal perspective. The exposure and the ratings are analogous to the treatment and the outcome in the causal inference framework, respectively. The critical challenge in this setting is accounting for the hidden confounders. These confounders are unobserved, making it hard to measure them. On the other hand, since these confounders affect both the exposure and the ratings, it is essential to account for them in generating debiased recommendations. To better approximate hidden confounders, we propose to leverage network information (i.e., user-social and user-item networks), which are shown to influence how users discover and interact with an item. Aside from the user conformity, aspects of confounding such as item popularity present in the network information is also captured in our method with the aid of textit{causal disentanglement} which unravels the learned representations into independent factors that are responsible for (a) modeling the exposure of an item to the user, (b) predicting the ratings, and (c) controlling the hidden confounders. Experiments on real-world datasets validate the effectiveness of the proposed model for debiasing recommender systems.

推荐系统的目标是通过学习用户和项目表示向用户推荐新项目。在实践中，这些表征是高度纠缠的，因为它们包含了多个因素的信息，包括用户兴趣、物品属性以及混淆因素，如用户一致性和物品受欢迎程度。考虑这些纠缠的表示来推断用户偏好可能会导致有偏见的推荐(例如，当推荐模型推荐受欢迎的项目时，即使它们与用户的兴趣不一致)。最近的研究提出从因果关系的角度对推荐系统进行建模来消除偏见。暴露和评级分别类似于因果推理框架中的治疗和结果。在这种情况下，关键的挑战是考虑隐藏的混杂因素。这些混杂因素是无法观察到的，因此很难测量它们。另一方面，由于这些混杂因素会影响曝光率和评级，因此在产生无偏见的建议时，必须考虑到它们。为了更好地近似隐藏的混杂因素，我们建议利用网络信息(即用户-社会和用户-项目网络)，这些信息被证明会影响用户如何发现和与项目交互。除了用户一致性之外，在我们的方法中，通过因果解缠textit{(causal disentanglement)}也捕获了网络信息中存在的混淆方面，例如项目受欢迎程度，该方法将学习到的表征分解为独立因素，这些因素负责(a)对用户的项目暴露建模，(b)预测评级，(c)控制隐藏的混杂因素。在真实数据集上的实验验证了该模型对推荐系统去偏见的有效性。

{"title":"Causal Disentanglement with Network Information for Debiased Recommendations","authors":"Paras Sheth, Ruocheng Guo, Lu Cheng, Huan Liu, K. Candan","doi":"10.48550/arXiv.2204.07221","DOIUrl":"https://doi.org/10.48550/arXiv.2204.07221","url":null,"abstract":"Recommender systems aim to recommend new items to users by learning user and item representations. In practice, these representations are highly entangled as they consist of information about multiple factors, including user's interests, item attributes along with confounding factors such as user conformity, and item popularity. Considering these entangled representations for inferring user preference may lead to biased recommendations (e.g., when the recommender model recommends popular items even if they do not align with the user's interests). Recent research proposes to debias by modeling a recommender system from a causal perspective. The exposure and the ratings are analogous to the treatment and the outcome in the causal inference framework, respectively. The critical challenge in this setting is accounting for the hidden confounders. These confounders are unobserved, making it hard to measure them. On the other hand, since these confounders affect both the exposure and the ratings, it is essential to account for them in generating debiased recommendations. To better approximate hidden confounders, we propose to leverage network information (i.e., user-social and user-item networks), which are shown to influence how users discover and interact with an item. Aside from the user conformity, aspects of confounding such as item popularity present in the network information is also captured in our method with the aid of textit{causal disentanglement} which unravels the learned representations into independent factors that are responsible for (a) modeling the exposure of an item to the user, (b) predicting the ratings, and (c) controlling the hidden confounders. Experiments on real-world datasets validate the effectiveness of the proposed model for debiasing recommender systems.","PeriodicalId":90051,"journal":{"name":"Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications","volume":"13 1","pages":"265-273"},"PeriodicalIF":0.0,"publicationDate":"2022-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84855699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Learned Indexing in Proteins: Substituting Complex Distance Calculations with Embedding and Clustering Techniques 蛋白质的学习索引:用嵌入和聚类技术代替复杂的距离计算

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-01-01 DOI: 10.48550/arXiv.2208.08910

Jaroslav Olha, Terézia Slanináková, Martin Gendiar, Matej Antol, Vlastislav Dohnal

. Despite the constant evolution of similarity searching research, it continues to face the same challenges stemming from the complexity of the data, such as the curse of dimensionality and computationally expensive distance functions. Various machine learning techniques have proven capable of replacing elaborate mathematical models with combinations of simple linear functions, often gaining speed and sim-plicity at the cost of formal guarantees of accuracy and correctness of querying.Theauthors explore the potential of this research trend by presenting a lightweight solution for the complex problem of 3D protein structure search. The solution consists of three steps – (i) transformation of 3D protein structural information into very compact vectors, (ii) use of a probabilistic model to group these vectors and respond to queries by returning a given number of similar objects, and (iii) a ﬁnal ﬁltering step which applies basic vector distance functions to reﬁne the result.

．尽管相似搜索研究不断发展，但它仍然面临着数据复杂性带来的挑战，例如维数诅咒和计算代价昂贵的距离函数。各种机器学习技术已经被证明能够用简单线性函数的组合取代复杂的数学模型，通常以牺牲查询的准确性和正确性的形式保证为代价获得速度和简洁性。作者通过提出3D蛋白质结构搜索复杂问题的轻量级解决方案来探索这一研究趋势的潜力。该解决方案包括三个步骤- (i)将3D蛋白质结构信息转换为非常紧凑的向量，(ii)使用概率模型对这些向量进行分组，并通过返回给定数量的相似对象来响应查询，以及(iii)最后的过滤步骤，该步骤应用基本向量距离函数来优化结果。

{"title":"Learned Indexing in Proteins: Substituting Complex Distance Calculations with Embedding and Clustering Techniques","authors":"Jaroslav Olha, Terézia Slanináková, Martin Gendiar, Matej Antol, Vlastislav Dohnal","doi":"10.48550/arXiv.2208.08910","DOIUrl":"https://doi.org/10.48550/arXiv.2208.08910","url":null,"abstract":". Despite the constant evolution of similarity searching research, it continues to face the same challenges stemming from the complexity of the data, such as the curse of dimensionality and computationally expensive distance functions. Various machine learning techniques have proven capable of replacing elaborate mathematical models with combinations of simple linear functions, often gaining speed and sim-plicity at the cost of formal guarantees of accuracy and correctness of querying.Theauthors explore the potential of this research trend by presenting a lightweight solution for the complex problem of 3D protein structure search. The solution consists of three steps – (i) transformation of 3D protein structural information into very compact vectors, (ii) use of a probabilistic model to group these vectors and respond to queries by returning a given number of similar objects, and (iii) a ﬁnal ﬁltering step which applies basic vector distance functions to reﬁne the result.","PeriodicalId":90051,"journal":{"name":"Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications","volume":"19 1","pages":"274-282"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82534773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Self-supervised Information Retrieval Trained from Self-generated Sets of Queries and Relevant Documents 自生成查询集和相关文档训练的自监督信息检索

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-01-01 DOI: 10.1007/978-3-031-17849-8_23

G. Moro, Lorenzo Valgimigli, Alex Rossi, Cristiano Casadei, Andrea Montefiori

引用次数: 1

Visual Recommendation and Visual Search for Fashion E-Commerce 面向时尚电子商务的视觉推荐与视觉搜索

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-01-01 DOI: 10.1007/978-3-031-17849-8_25

Alessandro Abluton

引用次数: 0

On the Expected Exclusion Power of Binary Partitions for Metric Search 度量搜索中二元分区的期望排除能力

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-01-01 DOI: 10.1007/978-3-031-17849-8_9

Lucia Vadicamo, A. Dearle, R. Connor

引用次数: 0

Concept of Relational Similarity Search 关系相似性搜索的概念

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-01-01 DOI: 10.1007/978-3-031-17849-8_8

Vladimir Mic, P. Zezula

引用次数: 0

Approximate Nearest Neighbor Search on Standard Search Engines 标准搜索引擎上的近似近邻搜索

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-01-01 DOI: 10.1007/978-3-031-17849-8_17

F. Carrara, Lucia Vadicamo, C. Gennaro, G. Amato

引用次数: 3

Graph Edit Distance Compacted Search Tree 图编辑距离压缩搜索树

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-01-01 DOI: 10.1007/978-3-031-17849-8_14

Ibrahim Chegrane, Imane Hocine, Saïd Yahiaoui, A. Bendjoudi, Nadia Nouali-Taboudjemat

引用次数: 0

Deep Vision-Language Model for Efficient Multi-modal Similarity Search in Fashion Retrieval 时尚检索中高效多模态相似度搜索的深度视觉语言模型

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

Pub Date : 2022-01-01 DOI: 10.1007/978-3-031-17849-8_4

G. Moro, S. Salvatori

引用次数: 3

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Similarity search and applications : proceedings of the ... International Conference on Similarity Search and Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀