首页 > 最新文献

Proceedings of the 6th Annual ACM Lifelog Search Challenge最新文献

英文 中文
MyEachtra: Event-Based Interactive Lifelog Retrieval System for LSC’23 基于事件的交互式生命日志检索系统[j]
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593100
Ly-Duyen Tran, Binh T. Nguyen, Liting Zhou, C. Gurrin
Retrieval is a fundamental challenge within the research community of lifelog and the Lifelog Search Challenge (LSC) has been an important annual benchmarking activity for interactive lifelog retrieval systems since 2018. This paper proposes MyEachtra (/mai-AK-truh/), a system designed for the upcoming LSC’23 workshop. Improved upon MyScéal, which was the top performing system from LSC’20 to LSC’22, MyEachtra includes modifications to address the challenges of non-owner user understanding of lifelog contexts and open-ended lifelog question answering. Specifically, MyEachtra shifts the focus from images to events as retrieval units. Events are segmented using location metadata as well as visual and time differences between successive images. A pilot study on different approaches to aggregate images into events was conducted to test the automatic performance of the system, which showed promising results. For known-item queries, showing only the top 3 events proved to be adequate to find relevant images. However, future evaluation of the performance for ad-hoc and question-answering queries is necessary for a complete analysis of the MyEachtra.
检索是生命日志研究界的一个基本挑战,自2018年以来,生命日志搜索挑战(LSC)一直是交互式生命日志检索系统的重要年度基准测试活动。本文提出了MyEachtra (/mai-AK-truh/),这是一个为即将到来的LSC ' 23研讨会设计的系统。MyEachtra改进了从LSC ' 20到LSC ' 22表现最好的mysc系统,包括修改,以解决非所有者用户对生活日志上下文的理解和开放式生活日志问题回答的挑战。具体来说,MyEachtra将焦点从图像转移到作为检索单元的事件。使用位置元数据以及连续图像之间的视觉和时间差异来分割事件。为了测试系统的自动性能,对不同的将图像聚合成事件的方法进行了初步研究,结果显示出很好的效果。对于已知项目查询,仅显示前3个事件已被证明足以找到相关图像。但是,为了对MyEachtra进行完整的分析,有必要对临时查询和问答查询的性能进行将来的评估。
{"title":"MyEachtra: Event-Based Interactive Lifelog Retrieval System for LSC’23","authors":"Ly-Duyen Tran, Binh T. Nguyen, Liting Zhou, C. Gurrin","doi":"10.1145/3592573.3593100","DOIUrl":"https://doi.org/10.1145/3592573.3593100","url":null,"abstract":"Retrieval is a fundamental challenge within the research community of lifelog and the Lifelog Search Challenge (LSC) has been an important annual benchmarking activity for interactive lifelog retrieval systems since 2018. This paper proposes MyEachtra (/mai-AK-truh/), a system designed for the upcoming LSC’23 workshop. Improved upon MyScéal, which was the top performing system from LSC’20 to LSC’22, MyEachtra includes modifications to address the challenges of non-owner user understanding of lifelog contexts and open-ended lifelog question answering. Specifically, MyEachtra shifts the focus from images to events as retrieval units. Events are segmented using location metadata as well as visual and time differences between successive images. A pilot study on different approaches to aggregate images into events was conducted to test the automatic performance of the system, which showed promising results. For known-item queries, showing only the top 3 events proved to be adequate to find relevant images. However, future evaluation of the performance for ad-hoc and question-answering queries is necessary for a complete analysis of the MyEachtra.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133036739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
MemoriEase: An Interactive Lifelog Retrieval System for LSC’23 记忆库:一种交互式生活日志检索系统
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593101
Quang-Linh Tran, Ly-Duyen Tran, Binh T. Nguyen, C. Gurrin
Lifelogging is an activity of recording all events that happen in the daily life of an individual. The events can contain images, audio, health index, etc which are collected through various devices such as wearable cameras, smartwatches, and other digital services. Exploiting lifelog data can bring significant benefits for lifeloggers from creating personalized healthcare plans to retrieving events in the past. In recent years, there has been a growing development of interactive lifelog retrieval systems, such as competitors at the annual Lifelog Search Challenge (LSC), to assist lifeloggers in finding events from the past. This paper introduces an interactive lifelog image retrieval called MemoriEase for the LSC’23 challenge. This system combines concept-based and embedding-based retrieval approaches to answer accurate images for LSC’23 queries. This system uses BLIP for the embedding-based retrieval approach to reduce the semantic gap between images and text queries. The concept-based retrieval approach uses full-text search in Elasticsearch to retrieve images having visual concepts similar to keywords in the query. Regarding the user interface, we make it as simple as possible to make novices users can use it with only a small effort. This is the first version of MemoriEase and we expect this can help users perform well in the LSC’23 competition.
生活日志是一种记录个人日常生活中发生的所有事件的活动。这些事件可以包含图像、音频、健康指数等,通过各种设备(如可穿戴相机、智能手表和其他数字服务)收集。利用生活日志数据可以为生活记录者带来巨大的好处,从创建个性化的医疗保健计划到检索过去的事件。近年来,交互式生活日志检索系统的发展越来越多,例如在年度生活日志搜索挑战赛(LSC)上的竞争对手,以帮助生活日志记录者查找过去的事件。为了应对LSC’23的挑战,本文介绍了一种名为MemoriEase的交互式生活日志图像检索方法。该系统结合了基于概念和基于嵌入的检索方法来回答LSC ' 23查询的准确图像。该系统采用基于嵌入的BLIP检索方法来减少图像和文本查询之间的语义差距。基于概念的检索方法使用Elasticsearch中的全文搜索来检索具有与查询中的关键字相似的视觉概念的图像。在用户界面方面,我们尽量使它简单,使新手用户可以使用它只需很小的努力。这是MemoriEase的第一个版本,我们希望它能帮助用户在LSC ' 23比赛中表现出色。
{"title":"MemoriEase: An Interactive Lifelog Retrieval System for LSC’23","authors":"Quang-Linh Tran, Ly-Duyen Tran, Binh T. Nguyen, C. Gurrin","doi":"10.1145/3592573.3593101","DOIUrl":"https://doi.org/10.1145/3592573.3593101","url":null,"abstract":"Lifelogging is an activity of recording all events that happen in the daily life of an individual. The events can contain images, audio, health index, etc which are collected through various devices such as wearable cameras, smartwatches, and other digital services. Exploiting lifelog data can bring significant benefits for lifeloggers from creating personalized healthcare plans to retrieving events in the past. In recent years, there has been a growing development of interactive lifelog retrieval systems, such as competitors at the annual Lifelog Search Challenge (LSC), to assist lifeloggers in finding events from the past. This paper introduces an interactive lifelog image retrieval called MemoriEase for the LSC’23 challenge. This system combines concept-based and embedding-based retrieval approaches to answer accurate images for LSC’23 queries. This system uses BLIP for the embedding-based retrieval approach to reduce the semantic gap between images and text queries. The concept-based retrieval approach uses full-text search in Elasticsearch to retrieve images having visual concepts similar to keywords in the query. Regarding the user interface, we make it as simple as possible to make novices users can use it with only a small effort. This is the first version of MemoriEase and we expect this can help users perform well in the LSC’23 competition.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129519858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Lifelog Discovery Assistant: Suggesting Prompts and Indexing Event Sequences for FIRST at LSC 2023 生活日志发现助理:建议提示和索引事件序列在LSC 2023
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593104
N. Hoang-Xuan, Thang-Long Nguyen-Ho, C. Gurrin, Minh-Triet Tran
AI-assisted tools have become more prevalent than ever in the last few years. However, applying them to build a lifelog retrieval system is still non-trivial due to the disparity in interfaces and interactions. The Lifelog Search Challenge (LSC) aims to provide a testing ground where systems can be benchmarked in a highly competitive setting. In this paper, we present the fourth iteration of our participating system FIRST. For this year, we adopt generative models to equip the system with predictive ability rather than entirely relying on the user to input the query. We also index a sequence of images as an event for improved search speed. Finally, we demonstrate how the additional features can assist users in searching.
在过去的几年里,人工智能辅助工具变得比以往任何时候都更加普遍。然而,由于界面和交互的差异,将它们应用于构建生活日志检索系统仍然不是一件容易的事情。Lifelog搜索挑战赛(LSC)旨在提供一个测试平台,让系统可以在高度竞争的环境中进行基准测试。在本文中,我们首先提出了我们参与系统的第四次迭代。今年,我们采用生成模型使系统具备预测能力,而不是完全依靠用户输入查询。我们还将一系列图像作为事件索引,以提高搜索速度。最后,我们将演示附加功能如何帮助用户进行搜索。
{"title":"Lifelog Discovery Assistant: Suggesting Prompts and Indexing Event Sequences for FIRST at LSC 2023","authors":"N. Hoang-Xuan, Thang-Long Nguyen-Ho, C. Gurrin, Minh-Triet Tran","doi":"10.1145/3592573.3593104","DOIUrl":"https://doi.org/10.1145/3592573.3593104","url":null,"abstract":"AI-assisted tools have become more prevalent than ever in the last few years. However, applying them to build a lifelog retrieval system is still non-trivial due to the disparity in interfaces and interactions. The Lifelog Search Challenge (LSC) aims to provide a testing ground where systems can be benchmarked in a highly competitive setting. In this paper, we present the fourth iteration of our participating system FIRST. For this year, we adopt generative models to equip the system with predictive ability rather than entirely relying on the user to input the query. We also index a sequence of images as an event for improved search speed. Finally, we demonstrate how the additional features can assist users in searching.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123430160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
LifeLens: Transforming Lifelog Search with Innovative UX/UI Design LifeLens:用创新的UX/UI设计改变生活日志搜索
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593096
Maria Tysse Hordvik, Julie Sophie Teilstad Østby, M. Kesavulu, Thao-Nhu Nguyen, Tu-Khiem Le, Duc-Tien Dang-Nguyen
One of the important components of the lifelog systems is the user interface which provides the ability to quickly and easily find a specific image or set of images. Although lifelogging is a mature field in the information retrieval domain, the focus on user interfaces is not explored extensively. We start by identifying the common issues with existing lifelog systems from the user interface and user experience perspective. Following the exploration, we present a set of guidelines for designing a user interface for Lifelog systems. We introduce LifeLens- a novel minimalist user interface design specifically designed to improve the usability and ease of use of an interactive lifelog system. The initial version of the LifeLens system provides several improvements over existing lifelog systems addressing the design issues identified during the exploration. The proposed system presents several features that not only enable the users of the system to easily navigate the interface with minimal effort on the user’s part to learn and understand the features offered but also provide a minimal way to gather user feedback.
生命日志系统的重要组成部分之一是用户界面,它提供了快速轻松地找到特定图像或一组图像的能力。虽然生活日志是信息检索领域中一个成熟的领域,但对用户界面的关注并没有得到广泛的探讨。我们首先从用户界面和用户体验的角度确定现有生活日志系统的常见问题。在探索之后,我们提出了一套为Lifelog系统设计用户界面的指导方针。我们介绍LifeLens-一种新颖的极简用户界面设计,专门用于提高交互式生活日志系统的可用性和易用性。LifeLens系统的初始版本在现有生命日志系统的基础上进行了一些改进,解决了勘探过程中发现的设计问题。所提出的系统提供了几个功能,不仅使系统的用户能够轻松地导航界面,而用户学习和理解所提供的功能的努力最少,而且还提供了一种收集用户反馈的最小方法。
{"title":"LifeLens: Transforming Lifelog Search with Innovative UX/UI Design","authors":"Maria Tysse Hordvik, Julie Sophie Teilstad Østby, M. Kesavulu, Thao-Nhu Nguyen, Tu-Khiem Le, Duc-Tien Dang-Nguyen","doi":"10.1145/3592573.3593096","DOIUrl":"https://doi.org/10.1145/3592573.3593096","url":null,"abstract":"One of the important components of the lifelog systems is the user interface which provides the ability to quickly and easily find a specific image or set of images. Although lifelogging is a mature field in the information retrieval domain, the focus on user interfaces is not explored extensively. We start by identifying the common issues with existing lifelog systems from the user interface and user experience perspective. Following the exploration, we present a set of guidelines for designing a user interface for Lifelog systems. We introduce LifeLens- a novel minimalist user interface design specifically designed to improve the usability and ease of use of an interactive lifelog system. The initial version of the LifeLens system provides several improvements over existing lifelog systems addressing the design issues identified during the exploration. The proposed system presents several features that not only enable the users of the system to easily navigate the interface with minimal effort on the user’s part to learn and understand the features offered but also provide a minimal way to gather user feedback.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121028987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Voxento 4.0: A More Flexible Visualisation and Control for Lifelogs Voxento 4.0:一个更灵活的可视化和控制的生活日志
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593097
Ahmed Alateeq, M. Roantree, C. Gurrin
In this paper, we introduce Voxento 4.0 – an interactive voice-based retrieval system for lifelogs which has been developed to participate in the sixth Lifelog Search Challenge LSC’23, at ACM ICMR’23. Voxento has participated three times in the LSC editions and achieved the rank of 4th in LSC21 and 5th in LSC22 respectively. In this version, Voxento 4.0, we have focused on improving the previous system’s interface, voice interaction and retrieval functionality. The current version has implemented some processing and cleaning of the dataset and employs the CLIP model to extract image features. In addition, the system’s interface was redesigned for better visualisation of the elements and the images for effective interaction. This improvement in the interface will help to support voice interaction in future work. The interface developments include logging voice interaction and images displayed, submitted, selected and starred to enhance user experience with the system. The voice interaction part has also been enhanced in the workflow of the voice lifecycle interaction and with additional voice commands.
在本文中,我们介绍了Voxento 4.0——一个交互式的基于语音的生活日志检索系统,该系统是为参加ACM ICMR ' 23的第六届生活日志搜索挑战LSC ' 23而开发的。Voxento曾三次参加LSC,分别获得LSC21第4名和LSC22第5名。在Voxento 4.0这个版本中,我们重点改进了之前系统的界面、语音交互和检索功能。当前版本对数据集进行了一些处理和清理,并采用CLIP模型提取图像特征。此外,该系统的界面进行了重新设计,以更好地可视化元素和有效互动的图像。这种界面上的改进将有助于在未来的工作中支持语音交互。界面开发包括日志语音交互和图像显示、提交、选择和打星,以增强用户对系统的体验。语音交互部分也在语音生命周期交互的工作流程中得到了增强,并增加了语音命令。
{"title":"Voxento 4.0: A More Flexible Visualisation and Control for Lifelogs","authors":"Ahmed Alateeq, M. Roantree, C. Gurrin","doi":"10.1145/3592573.3593097","DOIUrl":"https://doi.org/10.1145/3592573.3593097","url":null,"abstract":"In this paper, we introduce Voxento 4.0 – an interactive voice-based retrieval system for lifelogs which has been developed to participate in the sixth Lifelog Search Challenge LSC’23, at ACM ICMR’23. Voxento has participated three times in the LSC editions and achieved the rank of 4th in LSC21 and 5th in LSC22 respectively. In this version, Voxento 4.0, we have focused on improving the previous system’s interface, voice interaction and retrieval functionality. The current version has implemented some processing and cleaning of the dataset and employs the CLIP model to extract image features. In addition, the system’s interface was redesigned for better visualisation of the elements and the images for effective interaction. This improvement in the interface will help to support voice interaction in future work. The interface developments include logging voice interaction and images displayed, submitted, selected and starred to enhance user experience with the system. The voice interaction part has also been enhanced in the workflow of the voice lifecycle interaction and with additional voice commands.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124601491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
MEMORIA: A Memory Enhancement and MOment RetrIeval Application for LSC 2023 记忆:lsc2023的记忆增强和瞬间检索应用
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593099
Ricardo F. Ribeiro, Luísa Amaral, Wei Ye, A. Trifan, António J. R. Neves, Pedro Iglésias
The continuous collection and storage of personal data, denoted Lifelogging, has gained popularity in recent years as a means of monitoring and improving personal health. One important aspect of lifelogging is the collection and analysis of image data, which can provide valuable insights into an individual’s lifestyle, dietary habits, and physical activity. The Lifelog Search Challenge provides a unique opportunity to explore the state-of-the-art in lifelogging research, particularly in the area of egocentric image retrieval and analysis. Researchers can propose their approaches and compete to solve lifelog retrieval challenges and evaluate the effectiveness of their systems on a rich multimodal dataset generated by an active lifelogger with 18 months of continuous capture of lifelogging data. This paper presents the second version of MEMORIA, a computational tool developed to participate in the Lifelog Search Challenge 2023. In this new version, the information retrieval is based on the use of natural language search with the possibility to filter the results based on keywords and time periods. The system applies image analysis algorithms to process visual lifelogs, from pre-processing algorithms to feature extraction methods, in order to enrich the annotation of the lifelogs. This new version explores the use of a graph database, more detailed image annotation, and event segmentation, in order to improve the performance and user interaction. Experimental results of the user interaction with our retrieval module are presented, confirming the effectiveness of the proposed approach and showing the most relevant functionalities of the system.
近年来,作为监测和改善个人健康状况的一种手段,不断收集和储存个人数据(称为“生活日志”)越来越受欢迎。生活记录的一个重要方面是图像数据的收集和分析,这可以为个人的生活方式、饮食习惯和体育活动提供有价值的见解。生命日志搜索挑战赛提供了一个独特的机会来探索生命日志研究的最新技术,特别是在以自我为中心的图像检索和分析领域。研究人员可以提出他们的方法,并竞争解决生命日志检索挑战,并评估他们的系统在一个丰富的多模态数据集上的有效性,该数据集是由一个活跃的生命记录者连续捕获18个月的生命记录数据生成的。本文介绍了MEMORIA的第二个版本,这是一个为参加2023年生活日志搜索挑战而开发的计算工具。在这个新版本中,信息检索基于使用自然语言搜索,并可以根据关键字和时间段过滤结果。本系统采用图像分析算法对视觉生命日志进行处理,从预处理算法到特征提取方法,以丰富生命日志的注释。这个新版本探索了图形数据库的使用,更详细的图像注释和事件分割,以提高性能和用户交互。给出了用户与检索模块交互的实验结果,验证了所提出方法的有效性,并展示了系统最相关的功能。
{"title":"MEMORIA: A Memory Enhancement and MOment RetrIeval Application for LSC 2023","authors":"Ricardo F. Ribeiro, Luísa Amaral, Wei Ye, A. Trifan, António J. R. Neves, Pedro Iglésias","doi":"10.1145/3592573.3593099","DOIUrl":"https://doi.org/10.1145/3592573.3593099","url":null,"abstract":"The continuous collection and storage of personal data, denoted Lifelogging, has gained popularity in recent years as a means of monitoring and improving personal health. One important aspect of lifelogging is the collection and analysis of image data, which can provide valuable insights into an individual’s lifestyle, dietary habits, and physical activity. The Lifelog Search Challenge provides a unique opportunity to explore the state-of-the-art in lifelogging research, particularly in the area of egocentric image retrieval and analysis. Researchers can propose their approaches and compete to solve lifelog retrieval challenges and evaluate the effectiveness of their systems on a rich multimodal dataset generated by an active lifelogger with 18 months of continuous capture of lifelogging data. This paper presents the second version of MEMORIA, a computational tool developed to participate in the Lifelog Search Challenge 2023. In this new version, the information retrieval is based on the use of natural language search with the possibility to filter the results based on keywords and time periods. The system applies image analysis algorithms to process visual lifelogs, from pre-processing algorithms to feature extraction methods, in order to enrich the annotation of the lifelogs. This new version explores the use of a graph database, more detailed image annotation, and event segmentation, in order to improve the performance and user interaction. Experimental results of the user interaction with our retrieval module are presented, confirming the effectiveness of the proposed approach and showing the most relevant functionalities of the system.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114227462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Best of Both Worlds: Lifelog Retrieval with a Desktop-Virtual Reality Hybrid System 两全其美:使用桌面-虚拟现实混合系统检索生活日志
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593107
Florian Spiess, Ralph Gasser, H. Schuldt, Luca Rossetto
Personal lifelog data collections are becoming more common as a memory aid, as well as for analytical tasks, such as health and fitness analysis. Due to the multimodal and personal nature of lifelog data, interactive multimedia retrieval approaches are required to facilitate flexible and iterative query formulation and result exploration for retrieval and analysis. In recent years, novel user interface modalities have emerged, that allow new ways for users to interact with a retrieval system. Virtual reality, one such new modality, provides advantages as well as challenges for interactive multimedia retrieval in comparison to conventional desktop-based interfaces. This paper describes a novel desktop-virtual reality hybrid system participating in the Lifelog Search Challenge 2023. The system, which is based on the components of the vitrivr stack, is described with a focus on query formulation in the web-based desktop user interface vitrivr-ng, and result exploration in the virtual reality-based vitrivr-VR.
个人生活日志数据收集作为一种记忆辅助工具,以及健康和健身分析等分析任务,正变得越来越普遍。由于生活日志数据的多模态和个性化,需要交互式多媒体检索方法来实现灵活迭代的查询公式和结果探索,以便检索和分析。近年来,出现了新的用户界面模式,为用户提供了与检索系统交互的新方法。与传统的基于桌面的界面相比,虚拟现实技术为交互式多媒体检索提供了优势,同时也带来了挑战。本文介绍了一种新型的桌面-虚拟现实混合系统,该系统参与了2023年生活日志搜索挑战赛。该系统以vitrivr栈的组件为基础,重点介绍了基于web的桌面用户界面vitrivr-ng中的查询公式,以及基于虚拟现实的vitrivr- vr中的结果探索。
{"title":"The Best of Both Worlds: Lifelog Retrieval with a Desktop-Virtual Reality Hybrid System","authors":"Florian Spiess, Ralph Gasser, H. Schuldt, Luca Rossetto","doi":"10.1145/3592573.3593107","DOIUrl":"https://doi.org/10.1145/3592573.3593107","url":null,"abstract":"Personal lifelog data collections are becoming more common as a memory aid, as well as for analytical tasks, such as health and fitness analysis. Due to the multimodal and personal nature of lifelog data, interactive multimedia retrieval approaches are required to facilitate flexible and iterative query formulation and result exploration for retrieval and analysis. In recent years, novel user interface modalities have emerged, that allow new ways for users to interact with a retrieval system. Virtual reality, one such new modality, provides advantages as well as challenges for interactive multimedia retrieval in comparison to conventional desktop-based interfaces. This paper describes a novel desktop-virtual reality hybrid system participating in the Lifelog Search Challenge 2023. The system, which is based on the components of the vitrivr stack, is described with a focus on query formulation in the web-based desktop user interface vitrivr-ng, and result exploration in the virtual reality-based vitrivr-VR.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114864668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Memento 3.0: An Enhanced Lifelog Search Engine for LSC’23 纪念品3.0:LSC ' 23的增强生活日志搜索引擎
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593103
Naushad Alam, Yvette Graham, C. Gurrin
In this work, we present our system Memento 3.0 for participation in the Lifelog Search Challenge 2023, which is a successor to the previous 2 iterations of our system called Memento 1.0 [1] and Memento 2.0 [2]. Memento 3.0 employs image-text embeddings derived from OpenAI CLIP models as well as larger OpenCLIP models trained on ∼ 5x more data. Our system also significantly reduces the query processing time by almost 75% when compared to its predecessor systems by employing a cluster-based search technique. We additionally make important updates to the system’s user interface to offer more flexibility to the user and at the same time be better suited to efficiently handle new query types introduced in the Lifelog Search Challenge.
在这项工作中,我们展示了我们的系统Memento 3.0,用于参与2023年的生活日志搜索挑战,它是我们系统的前两个迭代的继任者,称为Memento 1.0[1]和Memento 2.0[2]。Memento 3.0采用了源自OpenAI CLIP模型的图像-文本嵌入,以及在大约5倍以上的数据上训练的更大的OpenCLIP模型。通过采用基于集群的搜索技术,我们的系统与之前的系统相比,查询处理时间也显著减少了近75%。此外,我们还对系统的用户界面进行了重要的更新,为用户提供了更大的灵活性,同时更适合于有效地处理Lifelog搜索挑战中引入的新查询类型。
{"title":"Memento 3.0: An Enhanced Lifelog Search Engine for LSC’23","authors":"Naushad Alam, Yvette Graham, C. Gurrin","doi":"10.1145/3592573.3593103","DOIUrl":"https://doi.org/10.1145/3592573.3593103","url":null,"abstract":"In this work, we present our system Memento 3.0 for participation in the Lifelog Search Challenge 2023, which is a successor to the previous 2 iterations of our system called Memento 1.0 [1] and Memento 2.0 [2]. Memento 3.0 employs image-text embeddings derived from OpenAI CLIP models as well as larger OpenCLIP models trained on ∼ 5x more data. Our system also significantly reduces the query processing time by almost 75% when compared to its predecessor systems by employing a cluster-based search technique. We additionally make important updates to the system’s user interface to offer more flexibility to the user and at the same time be better suited to efficiently handle new query types introduced in the Lifelog Search Challenge.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124082907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
lifeXplore at the Lifelog Search Challenge 2023 lifeexplore在2023年生命日志搜索挑战赛上
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593105
Klaus Schoeffmann
Searching substantial data archives of lifeloggers is a challenging task. The Lifelog Search Challenge (LSC) is an annually held competition with the aim of encouraging international teams to develop interactive content retrieval systems capable of searching large lifelog databases. LSC takes place as a live event co-located with the ACM International Conference on Multimedia Retrieval (ICMR), where teams compete against each other by solving retrieval tasks issued by the lifelogger. This paper presents our newest version of lifeXplore, a lifelog retrieval system that has been participating in LSC since 2018. For this year, we significantly redesign the entire system (backend, middleware, and frontend) and integrate free text-search using embeddings from vision transformers trained with large sets of text-image pairs. We present a novel architecture for multi-source search, where results from image embeddings are used together with results from traditional content analysis (for objects, concepts, and recognized text). We also perform intensive analysis of vision transformer models in order to know which one fits best to the requirements of the LSC.
搜寻大量的生命记录者数据档案是一项具有挑战性的任务。生命日志搜索挑战赛(LSC)是一项每年举办的竞赛,旨在鼓励国际团队开发能够搜索大型生命日志数据库的交互式内容检索系统。LSC是与ACM多媒体检索国际会议(ICMR)同时举办的现场活动,团队通过解决由生命记录员发出的检索任务相互竞争。本文介绍了我们最新版本的lifeXplore,这是一个自2018年以来一直参与LSC的生活日志检索系统。今年,我们对整个系统(后端、中间件和前端)进行了重大的重新设计,并使用由大量文本图像对训练的视觉转换器嵌入来集成免费的文本搜索。我们提出了一种新的多源搜索架构,其中图像嵌入的结果与传统内容分析(对象、概念和可识别文本)的结果一起使用。我们还对视觉变压器模型进行了深入的分析,以了解哪一个最适合LSC的要求。
{"title":"lifeXplore at the Lifelog Search Challenge 2023","authors":"Klaus Schoeffmann","doi":"10.1145/3592573.3593105","DOIUrl":"https://doi.org/10.1145/3592573.3593105","url":null,"abstract":"Searching substantial data archives of lifeloggers is a challenging task. The Lifelog Search Challenge (LSC) is an annually held competition with the aim of encouraging international teams to develop interactive content retrieval systems capable of searching large lifelog databases. LSC takes place as a live event co-located with the ACM International Conference on Multimedia Retrieval (ICMR), where teams compete against each other by solving retrieval tasks issued by the lifelogger. This paper presents our newest version of lifeXplore, a lifelog retrieval system that has been participating in LSC since 2018. For this year, we significantly redesign the entire system (backend, middleware, and frontend) and integrate free text-search using embeddings from vision transformers trained with large sets of text-image pairs. We present a novel architecture for multi-source search, where results from image embeddings are used together with results from traditional content analysis (for objects, concepts, and recognized text). We also perform intensive analysis of vision transformer models in order to know which one fits best to the requirements of the LSC.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124930471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multi-Mode Clustering for Graph-Based Lifelog Retrieval 基于图的生活日志检索的多模式聚类
Pub Date : 2023-06-12 DOI: 10.1145/3592573.3593102
Luca Rossetto, O. Inel, Svenja Lange, Florian Ruosch, Ruijie Wang, Abraham Bernstein
As part of the 6th Lifelog Search Challenge, this paper presents an approach to arrange Lifelog data in a multi-modal knowledge graph based on cluster hierarchies. We use multiple sequence clustering approaches to address the multi-modal nature of Lifelogs in relation to temporal, spatial, and visual factors. The resulting clusters, along with semantic metadata captions and augmentations based on OpenCLIP, provide for the semantic structure of a graph including all Lifelogs as entries. Textual queries on this hierarchical graph can be expressed to retrieve individual Lifelogs, as well as clusters of Lifelogs.
作为第六届生活日志搜索挑战赛的一部分,本文提出了一种基于聚类层次结构的多模态知识图排列生活日志数据的方法。我们使用多序列聚类方法来解决与时间、空间和视觉因素相关的Lifelogs的多模态性质。生成的集群,以及基于OpenCLIP的语义元数据标题和增强,提供了包含所有Lifelogs作为条目的图的语义结构。对这个层次图的文本查询可以表示为检索单个Lifelogs,以及Lifelogs的集群。
{"title":"Multi-Mode Clustering for Graph-Based Lifelog Retrieval","authors":"Luca Rossetto, O. Inel, Svenja Lange, Florian Ruosch, Ruijie Wang, Abraham Bernstein","doi":"10.1145/3592573.3593102","DOIUrl":"https://doi.org/10.1145/3592573.3593102","url":null,"abstract":"As part of the 6th Lifelog Search Challenge, this paper presents an approach to arrange Lifelog data in a multi-modal knowledge graph based on cluster hierarchies. We use multiple sequence clustering approaches to address the multi-modal nature of Lifelogs in relation to temporal, spatial, and visual factors. The resulting clusters, along with semantic metadata captions and augmentations based on OpenCLIP, provide for the semantic structure of a graph including all Lifelogs as entries. Textual queries on this hierarchical graph can be expressed to retrieve individual Lifelogs, as well as clusters of Lifelogs.","PeriodicalId":147486,"journal":{"name":"Proceedings of the 6th Annual ACM Lifelog Search Challenge","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125970453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 6th Annual ACM Lifelog Search Challenge
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1