Consumers are increasingly using the web to find answers to their health-related queries. Unfortunately, they often struggle with formulating the questions, further compounded by the burden of having to traverse long documents returned by the search engine to look for reliable answers. To ease these burdens for users, automated consumer health question answering systems try to simulate a human professional by refining the queries and giving the most pertinent answers. This article surveys state-of-the-art approaches, resources, and evaluation methods used for automatic consumer health question answering. We summarize the main achievements in the research community and industry, discuss their strengths and limitations, and finally come up with recommendations to further improve these systems in terms of quality, engagement, and human-likeness.
{"title":"A survey of consumer health question answering systems","authors":"Anuradha Welivita, Pearl Pu","doi":"10.1002/aaai.12140","DOIUrl":"https://doi.org/10.1002/aaai.12140","url":null,"abstract":"<p>Consumers are increasingly using the web to find answers to their health-related queries. Unfortunately, they often struggle with formulating the questions, further compounded by the burden of having to traverse long documents returned by the search engine to look for reliable answers. To ease these burdens for users, automated consumer health question answering systems try to simulate a human professional by refining the queries and giving the most pertinent answers. This article surveys state-of-the-art approaches, resources, and evaluation methods used for automatic consumer health question answering. We summarize the main achievements in the research community and industry, discuss their strengths and limitations, and finally come up with recommendations to further improve these systems in terms of quality, engagement, and human-likeness.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"482-507"},"PeriodicalIF":0.9,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12140","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138558236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chao Zheng, Xu Cao, Kun Tang, Zhipeng Cao, Elena Sizikova, Tong Zhou, Erlong Li, Ao Liu, Shengtao Zou, Xinrui Yan, Shuqi Mei
As autonomous vehicle technology advances, high-definition (HD) maps have become essential for ensuring safety and navigation accuracy. However, creating HD maps with accurate annotations demands substantial human effort, leading to a time-consuming and costly process. Although artificial intelligence (AI) and computer vision (CV) algorithms have been developed for prelabeling HD maps, a significant gap remains in accuracy and robustness between AI-based methods and traditional manual pipelines. Additionally, building large-scale annotated datasets and advanced machine learning algorithms for AI-based HD map labeling systems can be resource-intensive. In this paper, we present and summarize the Tencent HD Map AI (THMA) system, an innovative end-to-end, AI-based, active learning HD map labeling system designed to produce HD map labels for hundreds of thousands of kilometers while employing active learning to enhance product iteration. Utilizing a combination of supervised, self-supervised, and weakly supervised learning, THMA is trained directly on massive HD map datasets to achieve the high accuracy and efficiency required by downstream users. Deployed by the Tencent Map team, THMA serves over 1000 labeling workers and generates more than 30,000 km of HD map data per day at its peak. With over 90% of Tencent Map's HD map data labeled automatically by THMA, the system accelerates traditional HD map labeling processes by more than tenfold, significantly reducing manual annotation burdens and paving the way for more efficient HD map production.
随着自动驾驶汽车技术的发展,高清(HD)地图已成为确保安全和导航准确性的关键。然而,绘制带有准确注释的高清地图需要大量人力,导致整个过程耗时且成本高昂。虽然已经开发出了人工智能(AI)和计算机视觉(CV)算法来对高清地图进行预标注,但基于 AI 的方法与传统的人工管道相比,在准确性和鲁棒性方面仍存在很大差距。此外,为基于人工智能的高清地图标注系统建立大规模注释数据集和先进的机器学习算法可能是资源密集型的。在本文中,我们介绍并总结了腾讯高清地图 AI(THMA)系统,这是一个创新的端到端、基于 AI 的主动学习高清地图标注系统,旨在生成数十万公里的高清地图标注,同时采用主动学习来加强产品迭代。THMA 采用监督学习、自监督学习和弱监督学习相结合的方式,直接在海量高清地图数据集上进行训练,以达到下游用户所需的高精度和高效率。THMA 由腾讯地图团队部署,服务于 1000 多名标注人员,高峰时每天生成超过 3 万公里的高清地图数据。腾讯地图 90% 以上的高清地图数据由 THMA 自动标注,该系统将传统的高清地图标注流程加快了十倍以上,大大减轻了人工标注负担,为更高效的高清地图生产铺平了道路。
{"title":"High-definition map automatic annotation system based on active learning","authors":"Chao Zheng, Xu Cao, Kun Tang, Zhipeng Cao, Elena Sizikova, Tong Zhou, Erlong Li, Ao Liu, Shengtao Zou, Xinrui Yan, Shuqi Mei","doi":"10.1002/aaai.12139","DOIUrl":"https://doi.org/10.1002/aaai.12139","url":null,"abstract":"<p>As autonomous vehicle technology advances, high-definition (HD) maps have become essential for ensuring safety and navigation accuracy. However, creating HD maps with accurate annotations demands substantial human effort, leading to a time-consuming and costly process. Although artificial intelligence (AI) and computer vision (CV) algorithms have been developed for prelabeling HD maps, a significant gap remains in accuracy and robustness between AI-based methods and traditional manual pipelines. Additionally, building large-scale annotated datasets and advanced machine learning algorithms for AI-based HD map labeling systems can be resource-intensive. In this paper, we present and summarize the Tencent HD Map AI (THMA) system, an innovative end-to-end, AI-based, active learning HD map labeling system designed to produce HD map labels for hundreds of thousands of kilometers while employing active learning to enhance product iteration. Utilizing a combination of supervised, self-supervised, and weakly supervised learning, THMA is trained directly on massive HD map datasets to achieve the high accuracy and efficiency required by downstream users. Deployed by the Tencent Map team, THMA serves over 1000 labeling workers and generates more than 30,000 km of HD map data per day at its peak. With over 90% of Tencent Map's HD map data labeled automatically by THMA, the system accelerates traditional HD map labeling processes by more than tenfold, significantly reducing manual annotation burdens and paving the way for more efficient HD map production.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"418-430"},"PeriodicalIF":0.9,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12139","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138558227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Social and moral norms are a fabric for holding human societies together and helping them to function. As such they will also become a means of evaluating the performance of future human–machine systems. While machine ethics has offered various approaches to endowing machines with normative competence, from the more logic-based to the more data-based, none of the proposals so far have considered the challenge of capturing the “spirit of a norm,” which often eludes rigid interpretation and complicates doing the right thing. We present some paradigmatic scenarios across contexts to illustrate why the spirit of a norm can be critical to make explicit and why it exposes the inadequacies of mere data-driven “value alignment” techniques such as reinforcement learning RL for interactive, real-time human–robot interaction. Instead, we argue that norm learning, in particular, learning to capture the spirit of a norm, requires combining common-sense inference-based and data-driven approaches.
{"title":"Understanding the spirit of a norm: Challenges for norm-learning agents","authors":"Thomas Arnold, Matthias Scheutz","doi":"10.1002/aaai.12138","DOIUrl":"10.1002/aaai.12138","url":null,"abstract":"<p>Social and moral norms are a fabric for holding human societies together and helping them to function. As such they will also become a means of evaluating the performance of future human–machine systems. While machine ethics has offered various approaches to endowing machines with normative competence, from the more logic-based to the more data-based, none of the proposals so far have considered the challenge of capturing the “spirit of a norm,” which often eludes rigid interpretation and complicates doing the right thing. We present some paradigmatic scenarios across contexts to illustrate why the spirit of a norm can be critical to make explicit and why it exposes the inadequacies of mere data-driven “value alignment” techniques such as reinforcement learning <i>RL</i> for interactive, real-time human–robot interaction. Instead, we argue that norm learning, in particular, learning to capture the spirit of a norm, requires combining common-sense inference-based and data-driven approaches.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"524-536"},"PeriodicalIF":0.9,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12138","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135928142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Machine Learning systems rely heavily on annotated instances. Such annotations are frequently done by human experts, or by tools developed by experts, and so the central message of this book, Noise: A Flaw in Human Judgment (Kahneman, Sibony, and Sunstein 2021) is of considerable importance to AI/Machine Learning community. The core message is that if a number of experts are asked to annotate tasks that involve judgments, these responses will frequently differ. This observation poses a problem for how analysts choose a particular annotated dataset (from the group), or process the set of responses to give a “balanced” response, or whether to reject all the annotated datasets. A further important aspect of this book is the case studies which demonstrate that differences in judgments between fellow experts have been reported in a significant number of disciplines including, business, the law, government, and medicine. Kahneman, Sibony and Sunstein (2021), referred to as KSS subsequently, discuss how Expert Biases can be reduced, but the main focus of this book is a discussion of Noise, that is, differences that often occur between fellow experts, and how Noise can often be reduced. To address the last point KSS have formulated a set of six decision hygiene principles which include the recommendation that complex tasks should be subdivided, and then each subtask should be solved separately. A further principle is that each task should be solved by individual experts before the various judgments are discussed with fellow experts. Effectively, the book being reviewed covers three main topics: First, it reports several motivating studies that show how judgments of fellow experts varied significantly in the pricing of insurance premiums, and in setting the lengths of custodial sentences. These motivating studies very effectively illustrate the central concepts of Judgment, Noise, and Bias; that section also provides definitions of these core concepts and discusses how Noise is often amplified in group meetings. Secondly, the authors provide detailed discussion of further studies, in a variety of domains, which report the levels of disagreement between experts. Thirdly, KSS discusses how to reduce the levels of Noise between experts, as noted above, the authors refer to these as Principles of Noise Hygiene. These three parts are interwoven in a complex way throughout the book; in our view, the best overview of the book is given in the section Review and Conclusions: Taking Noise Seriously (KSS, p. 361).
{"title":"Groups of experts often differ in their decisions: What are the implications for AI and machine learning? A commentary on Noise: A Flaw in Human Judgment, by Kahneman, Sibony, and Sunstein (2021)","authors":"Derek H. Sleeman, Ken Gilhooly","doi":"10.1002/aaai.12135","DOIUrl":"10.1002/aaai.12135","url":null,"abstract":"<p>Machine Learning systems rely heavily on annotated instances. Such annotations are frequently done by human experts, or by tools developed by experts, and so the central message of this book, <i>Noise: A Flaw in Human Judgment</i> (Kahneman, Sibony, and Sunstein 2021) is of considerable importance to AI/Machine Learning community. The core message is that if a number of experts are asked to annotate tasks that involve judgments, these responses will frequently differ. This observation poses a problem for how analysts choose a particular annotated dataset (from the group), or process the set of responses to give a “balanced” response, or whether to reject all the annotated datasets. A further important aspect of this book is the case studies which demonstrate that differences in judgments between fellow experts have been reported in a significant number of disciplines including, business, the law, government, and medicine. Kahneman, Sibony and Sunstein (2021), referred to as KSS subsequently, discuss how Expert Biases can be reduced, but the main focus of this book is a discussion of Noise, that is, differences that often occur between fellow experts, and how Noise can often be reduced. To address the last point KSS have formulated a set of six decision hygiene principles which include the recommendation that complex tasks should be subdivided, and then each subtask should be solved separately. A further principle is that each task should be solved by individual experts before the various judgments are discussed with fellow experts. Effectively, the book being reviewed covers three main topics: First, it reports several motivating studies that show how judgments of fellow experts varied significantly in the pricing of insurance premiums, and in setting the lengths of custodial sentences. These motivating studies very effectively illustrate the central concepts of Judgment, Noise, and Bias; that section also provides definitions of these core concepts and discusses how Noise is often amplified in group meetings. Secondly, the authors provide detailed discussion of further studies, in a variety of domains, which report the levels of disagreement between experts. Thirdly, KSS discusses how to reduce the levels of Noise between experts, as noted above, the authors refer to these as Principles of Noise Hygiene. These three parts are interwoven in a complex way throughout the book; in our view, the best overview of the book is given in the section Review and Conclusions: Taking Noise Seriously (KSS, p. 361).</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"555-567"},"PeriodicalIF":0.9,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12135","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136376496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper, which is part of the New Faculty Highlights Invited Speaker Program of AAAI'23, serves as a comprehensive survey of my research in transfer learning by utilizing embedding spaces. The work reviewed in this paper specifically revolves around the inherent challenges associated with continual learning and limited availability of labeled data. By providing an overview of my past and ongoing contributions, this paper aims to present a holistic understanding of my research, paving the way for future explorations and advancements in the field. My research delves into the various settings of transfer learning, including, few-shot learning, zero-shot learning, continual learning, domain adaptation, and distributed learning. I hope this survey provides a forward-looking perspective for researchers who would like to focus on similar research directions.
{"title":"Robust internal representations for domain generalization","authors":"Mohammad Rostami","doi":"10.1002/aaai.12137","DOIUrl":"10.1002/aaai.12137","url":null,"abstract":"<p>This paper, which is part of the New Faculty Highlights Invited Speaker Program of AAAI'23, serves as a comprehensive survey of my research in transfer learning by utilizing embedding spaces. The work reviewed in this paper specifically revolves around the inherent challenges associated with continual learning and limited availability of labeled data. By providing an overview of my past and ongoing contributions, this paper aims to present a holistic understanding of my research, paving the way for future explorations and advancements in the field. My research delves into the various settings of transfer learning, including, few-shot learning, zero-shot learning, continual learning, domain adaptation, and distributed learning. I hope this survey provides a forward-looking perspective for researchers who would like to focus on similar research directions.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"467-481"},"PeriodicalIF":0.9,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12137","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134908326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mihye Kim, Jimyung Choi, Jaehyun Kim, Wooyoung Kim, Yeonung Baek, Gisuk Bang, Kwangwoon Son, Yeonman Ryou, Kee-Eung Kim
The residual value (RV) of a vehicle refers to its estimated worth at some point in the future. It is a core component in every auto finance product, used to determine the credit lines and the leasing rates. As such, an accurate prediction of RV is critical for the auto finance industry, since it can pose a risk of revenue loss by over-prediction or make the financial product incompetent through under-prediction. Although there are a number of prior studies on training machine learning models on a large amount of used car sales data, we had to cope with real-world operational requirements such as compliance with regulations (i.e., monotonicity of output with respect to a subset of features) and generalization to unseen input (i.e., new and rare car models). In this paper, we describe how we addressed these practical challenges and created value for our business at Hyundai Capital Services, the top auto financial service provider in Korea.
{"title":"Trustworthy residual vehicle value prediction for auto finance","authors":"Mihye Kim, Jimyung Choi, Jaehyun Kim, Wooyoung Kim, Yeonung Baek, Gisuk Bang, Kwangwoon Son, Yeonman Ryou, Kee-Eung Kim","doi":"10.1002/aaai.12136","DOIUrl":"10.1002/aaai.12136","url":null,"abstract":"<p>The residual value (RV) of a vehicle refers to its estimated worth at some point in the future. It is a core component in every auto finance product, used to determine the credit lines and the leasing rates. As such, an accurate prediction of RV is critical for the auto finance industry, since it can pose a risk of revenue loss by over-prediction or make the financial product incompetent through under-prediction. Although there are a number of prior studies on training machine learning models on a large amount of used car sales data, we had to cope with real-world operational requirements such as compliance with regulations (i.e., monotonicity of output with respect to a subset of features) and generalization to unseen input (i.e., new and rare car models). In this paper, we describe how we addressed these practical challenges and created value for our business at Hyundai Capital Services, the top auto financial service provider in Korea.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"394-405"},"PeriodicalIF":0.9,"publicationDate":"2023-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12136","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135513286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The development of artificial intelligence (AI) agents capable of human-level understanding of video content and conducting conversations with humans on this basis is a promising application that people expect. However, this is a challenging task that requires the holistic integration of multimodal information with temporal dependencies and reasoning, as well as social and physical commonsense. In addition, the development of appropriate systematic evaluation methods is essential. In this context, we introduce the Video Turing Test (VTT), a blind test used to evaluate human-likeness in terms of video comprehension ability. Moreover, we propose Vincent as a video understanding AI. We explain the configuration of VTT, the architecture of Vincent to prepare for VTT and the proposed evaluation methods for video comprehension. We also estimate the current intelligence level of AI based on our results and discuss future research directions.
{"title":"Video Turing Test: A first step towards human-level AI","authors":"Minsu Lee, Yu-Jung Heo, Seongho Choi, Woo Suk Choi, Byoung-Tak Zhang","doi":"10.1002/aaai.12128","DOIUrl":"10.1002/aaai.12128","url":null,"abstract":"<p>The development of artificial intelligence (AI) agents capable of human-level understanding of video content and conducting conversations with humans on this basis is a promising application that people expect. However, this is a challenging task that requires the holistic integration of multimodal information with temporal dependencies and reasoning, as well as social and physical commonsense. In addition, the development of appropriate systematic evaluation methods is essential. In this context, we introduce the Video Turing Test (VTT), a blind test used to evaluate human-likeness in terms of video comprehension ability. Moreover, we propose Vincent as a video understanding AI. We explain the configuration of VTT, the architecture of Vincent to prepare for VTT and the proposed evaluation methods for video comprehension. We also estimate the current intelligence level of AI based on our results and discuss future research directions.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"537-554"},"PeriodicalIF":0.9,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12128","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136034011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hayden Gunraj, Paul Guerrier, Sheldon Fernandez, Alexander Wong
In electronics manufacturing, solder joint defects are a common problem affecting a variety of printed circuit board components. To identify and correct solder joint defects, the solder joints on a circuit board are typically inspected manually by trained human inspectors, which is a very time-consuming and error-prone process. To improve both inspection efficiency and accuracy, in this work, we describe an explainable deep learning-based visual quality inspection system tailored for visual inspection of solder joints in electronics manufacturing environments. At the core of this system is an explainable solder joint defect identification system called SolderNet that we design and implement with trust and transparency in mind. While several challenges remain before the full system can be developed and deployed, this study presents important progress towards trustworthy visual inspection of solder joints in electronics manufacturing.
{"title":"SolderNet: Towards trustworthy visual inspection of solder joints in electronics manufacturing using explainable artificial intelligence","authors":"Hayden Gunraj, Paul Guerrier, Sheldon Fernandez, Alexander Wong","doi":"10.1002/aaai.12129","DOIUrl":"10.1002/aaai.12129","url":null,"abstract":"<p>In electronics manufacturing, solder joint defects are a common problem affecting a variety of printed circuit board components. To identify and correct solder joint defects, the solder joints on a circuit board are typically inspected manually by trained human inspectors, which is a very time-consuming and error-prone process. To improve both inspection efficiency and accuracy, in this work, we describe an explainable deep learning-based visual quality inspection system tailored for visual inspection of solder joints in electronics manufacturing environments. At the core of this system is an explainable solder joint defect identification system called <b>SolderNet</b> that we design and implement with trust and transparency in mind. While several challenges remain before the full system can be developed and deployed, this study presents important progress towards trustworthy visual inspection of solder joints in electronics manufacturing.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"442-452"},"PeriodicalIF":0.9,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12129","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136185639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Rabia Ali, Muhammad Sarmad, Jawad Tayyub, Alexander Vogel
Welding is a fabrication process used to join or fuse two mechanical parts. Modern welding machines have automated lasers that follow a predefined weld seam path between the two parts to create a bond. Previous efforts have used simple computer vision edge detectors to automatically detect the weld seam on an image at the junction of two metals to be welded. However, these systems lack reliability and accuracy resulting in manual human verification of the detected edges. This paper presents a neural network architecture that automatically detects the weld seam edge between two metals with high accuracy. We augment this system with a preclassifier that filters out anomalous workpieces (e.g., incorrect placement). Finally, we justify our design choices by evaluating against several existing deep network pipelines as well as proof through real-world use. We also describe in detail the process of deploying the system in a real-world shop floor including evaluation and monitoring. We make public a large well-labeled laser seam dataset to perform deep learning-based edge detection in industrial settings.
{"title":"Accurate detection of weld seams for laser welding in real-world manufacturing","authors":"Rabia Ali, Muhammad Sarmad, Jawad Tayyub, Alexander Vogel","doi":"10.1002/aaai.12134","DOIUrl":"10.1002/aaai.12134","url":null,"abstract":"<p>Welding is a fabrication process used to join or fuse two mechanical parts. Modern welding machines have automated lasers that follow a predefined weld seam path between the two parts to create a bond. Previous efforts have used simple computer vision edge detectors to automatically detect the weld seam on an image at the junction of two metals to be welded. However, these systems lack reliability and accuracy resulting in manual human verification of the detected edges. This paper presents a neural network architecture that automatically detects the weld seam edge between two metals with high accuracy. We augment this system with a preclassifier that filters out anomalous workpieces (e.g., incorrect placement). Finally, we justify our design choices by evaluating against several existing deep network pipelines as well as proof through real-world use. We also describe in detail the process of deploying the system in a real-world shop floor including evaluation and monitoring. We make public a large well-labeled laser seam dataset to perform deep learning-based edge detection in industrial settings.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"431-441"},"PeriodicalIF":0.9,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12134","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135856069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Underserved communities face critical health challenges due to lack of access to timely and reliable information. Nongovernmental organizations are leveraging the widespread use of cellphones to combat these healthcare challenges and spread preventative awareness. The health workers at these organizations reach out individually to beneficiaries; however, such programs still suffer from declining engagement. We have deployed Saheli, a system to efficiently utilize the limited availability of health workers for improving maternal and child health in India. Saheli uses the Restless Multi-armed Bandit (RMAB) framework to identify beneficiaries for outreach. It is the first deployed application for RMABs in public health, and is already in continuous use by our partner NGO, ARMMAN. We have already reached ∼130K beneficiaries with Saheli, and are on track to serve one million beneficiaries by the end of 2023. This scale and impact has been achieved through multiple innovations in the RMAB model and its development, in preparation of real world data, and in deployment practices; and through careful consideration of responsible AI practices. Specifically, in this paper, we describe our approach to learn from past data to improve the performance of Saheli's RMAB model, the real-world challenges faced during deployment and adoption of Saheli, and the end-to-end pipeline.
{"title":"Expanding impact of mobile health programs: SAHELI for maternal and child care","authors":"Shresth Verma, Gargi Singh, Aditya Mate, Paritosh Verma, Sruthi Gorantla, Neha Madhiwalla, Aparna Hegde, Divy Thakkar, Manish Jain, Milind Tambe, Aparna Taneja","doi":"10.1002/aaai.12126","DOIUrl":"10.1002/aaai.12126","url":null,"abstract":"<p>Underserved communities face critical health challenges due to lack of access to timely and reliable information. Nongovernmental organizations are leveraging the widespread use of cellphones to combat these healthcare challenges and spread preventative awareness. The health workers at these organizations reach out individually to beneficiaries; however, such programs still suffer from declining engagement. We have deployed <span>Saheli</span>, a system to efficiently utilize the limited availability of health workers for improving maternal and child health in India. <span>Saheli</span> uses the Restless Multi-armed Bandit (RMAB) framework to identify beneficiaries for outreach. It is the <i>first deployed application</i> for RMABs in public health, and is already <i>in continuous use</i> by our partner NGO, ARMMAN. We have already reached ∼130K beneficiaries with <span>Saheli</span>, and are on track to serve one million beneficiaries by the end of 2023. This scale and impact has been achieved through multiple innovations in the RMAB model and its development, in preparation of real world data, and in deployment practices; and through careful consideration of responsible AI practices. Specifically, in this paper, we describe our approach to learn from past data to improve the performance of <span>Saheli</span>'s RMAB model, the real-world challenges faced during deployment and adoption of <span>Saheli</span>, and the end-to-end pipeline.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"44 4","pages":"363-376"},"PeriodicalIF":0.9,"publicationDate":"2023-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12126","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136013330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}