首页 > 最新文献

CEUR workshop proceedings最新文献

英文 中文
A Privacy-Preserving Unsupervised Speaker Disentanglement Method for Depression Detection from Speech. 一种用于从语音中检测抑郁的隐私保护无监督扬声器纠错方法
Pub Date : 2024-02-01
Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan

The proposed method focuses on speaker disentanglement in the context of depression detection from speech signals. Previous approaches require patient/speaker labels, encounter instability due to loss maximization, and introduce unnecessary parameters for adversarial domain prediction. In contrast, the proposed unsupervised approach reduces cosine similarity between latent spaces of depression and pre-trained speaker classification models. This method outperforms baseline models, matches or exceeds adversarial methods in performance, and does so without relying on speaker labels or introducing additional model parameters, leading to a reduction in model complexity. The higher the speaker de-identification score (DeID), the better the depression detection system is in masking a patient's identity thereby enhancing the privacy attributes of depression detection systems. On the DAIC-WOZ dataset with ComparE16 features and an LSTM-only model, our method achieves an F1-Score of 0.776 and a DeID score of 92.87%, outperforming its adversarial counterpart which has an F1Score of 0.762 and 68.37% DeID, respectively. Furthermore, we demonstrate that speaker-disentanglement methods are complementary to text-based approaches, and a score-level fusion with a Word2vec-based depression detection model further enhances the overall performance to an F1-Score of 0.830.

所提出的方法侧重于在从语音信号中检测抑郁的背景下进行扬声器分离。以往的方法需要患者/说话人标签,会因损失最大化而导致不稳定性,并为对抗域预测引入不必要的参数。相比之下,所提出的无监督方法降低了抑郁潜空间与预训练说话人分类模型之间的余弦相似度。这种方法的性能优于基线模型,在性能上与对抗方法不相上下,甚至更胜一筹,而且无需依赖说话者标签或引入额外的模型参数,从而降低了模型的复杂性。说话者去身份化得分(DeID)越高,抑郁检测系统在掩盖患者身份方面的表现就越好,从而提高了抑郁检测系统的隐私属性。在使用 ComparE16 特征和纯 LSTM 模型的 DAIC-WOZ 数据集上,我们的方法获得了 0.776 的 F1 分数和 92.87% 的 DeID 分数,优于其 F1 分数为 0.762 和 DeID 分数为 68.37% 的对抗方法。此外,我们还证明了扬声器分离方法与基于文本的方法是互补的,而与基于 Word2vec 的抑郁检测模型进行分数级融合则进一步提高了整体性能,使 F1 分数达到 0.830。
{"title":"A Privacy-Preserving Unsupervised Speaker Disentanglement Method for Depression Detection from Speech.","authors":"Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The proposed method focuses on speaker disentanglement in the context of depression detection from speech signals. Previous approaches require patient/speaker labels, encounter instability due to loss maximization, and introduce unnecessary parameters for adversarial domain prediction. In contrast, the proposed unsupervised approach reduces cosine similarity between latent spaces of depression and pre-trained speaker classification models. This method outperforms baseline models, matches or exceeds adversarial methods in performance, and does so without relying on speaker labels or introducing additional model parameters, leading to a reduction in model complexity. The higher the speaker de-identification score (<i>DeID</i>), the better the depression detection system is in masking a patient's identity thereby enhancing the privacy attributes of depression detection systems. On the DAIC-WOZ dataset with ComparE16 features and an LSTM-only model, our method achieves an F1-Score of 0.776 and a <i>DeID</i> score of 92.87%, outperforming its adversarial counterpart which has an F1Score of 0.762 and 68.37% <i>DeID</i>, respectively. Furthermore, we demonstrate that speaker-disentanglement methods are complementary to text-based approaches, and a score-level fusion with a Word2vec-based depression detection model further enhances the overall performance to an F1-Score of 0.830.</p>","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"3649 ","pages":"57-63"},"PeriodicalIF":0.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11034881/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140875057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning to Generate Context-Sensitive Backchannel Smiles for Embodied AI Agents with Applications in Mental Health Dialogues.
Pub Date : 2024-02-01
Maneesh Bilalpur, Mert Inan, Dorsa Zeinali, Jeffrey F Cohn, Malihe Alikhani

Addressing the critical shortage of mental health resources for effective screening, diagnosis, and treatment remains a significant challenge. This scarcity underscores the need for innovative solutions, particularly in enhancing the accessibility and efficacy of therapeutic support. Embodied agents with advanced interactive capabilities emerge as a promising and cost-effective supplement to traditional caregiving methods. Crucial to these agents' effectiveness is their ability to simulate non-verbal behaviors, like backchannels, that are pivotal in establishing rapport and understanding in therapeutic contexts but remain under-explored. To improve the rapport-building capabilities of embodied agents we annotated backchannel smiles in videos of intimate face-to-face conversations over topics such as mental health, illness, and relationships. We hypothesized that both speaker and listener behaviors affect the duration and intensity of backchannel smiles. Using cues from speech prosody and language along with the demographics of the speaker and listener, we found them to contain significant predictors of the intensity of backchannel smiles. Based on our findings, we introduce backchannel smile production in embodied agents as a generation problem. Our attention-based generative model suggests that listener information offers performance improvements over the baseline speaker-centric generation approach. Conditioned generation using the significant predictors of smile intensity provides statistically significant improvements in empirical measures of generation quality. Our user study by transferring generated smiles to an embodied agent suggests that agent with backchannel smiles is perceived to be more human-like and is an attractive alternative for non-personal conversations over agent without backchannel smiles.

{"title":"Learning to Generate Context-Sensitive Backchannel Smiles for Embodied AI Agents with Applications in Mental Health Dialogues.","authors":"Maneesh Bilalpur, Mert Inan, Dorsa Zeinali, Jeffrey F Cohn, Malihe Alikhani","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Addressing the critical shortage of mental health resources for effective screening, diagnosis, and treatment remains a significant challenge. This scarcity underscores the need for innovative solutions, particularly in enhancing the accessibility and efficacy of therapeutic support. Embodied agents with advanced interactive capabilities emerge as a promising and cost-effective supplement to traditional caregiving methods. Crucial to these agents' effectiveness is their ability to simulate non-verbal behaviors, like backchannels, that are pivotal in establishing rapport and understanding in therapeutic contexts but remain under-explored. To improve the rapport-building capabilities of embodied agents we annotated backchannel smiles in videos of intimate face-to-face conversations over topics such as mental health, illness, and relationships. We hypothesized that both speaker and listener behaviors affect the duration and intensity of backchannel smiles. Using cues from speech prosody and language along with the demographics of the speaker and listener, we found them to contain significant predictors of the intensity of backchannel smiles. Based on our findings, we introduce backchannel smile production in embodied agents as a generation problem. Our attention-based generative model suggests that listener information offers performance improvements over the baseline speaker-centric generation approach. Conditioned generation using the significant predictors of smile intensity provides statistically significant improvements in empirical measures of generation quality. Our user study by transferring generated smiles to an embodied agent suggests that agent with backchannel smiles is perceived to be more human-like and is an attractive alternative for non-personal conversations over agent without backchannel smiles.</p>","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"3649 ","pages":"12-22"},"PeriodicalIF":0.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11608428/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142775368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Internet resources for foreign language education in primary school: challenges and opportunities 小学外语教育的网络资源:挑战与机遇
Pub Date : 2023-09-12 DOI: 10.55056/ceur-ws.org/vol-3482/paper041
Inna A. Kravtsova, Alina O. Mankuta, Vita A. Hamaniuk, Olga S. Bilozir, Andrei V. Voznyak
The paper explores the challenges and opportunities of developing professional competence of primary school teachers in teaching foreign languages according to the New Ukrainian School concept. The paper analyzes and describes various Internet resources that can facilitate and enhance foreign language learning outcomes in primary school. The paper argues that Internet resources can help modernize foreign language education in primary school and align it with the New Ukrainian School concept. The paper also discusses the importance of training primary school teachers in the methods of organizing distance learning, which is a priority for higher education institutions in the context of continuous education.
根据新乌克兰学校的理念,探讨了小学外语教师专业能力发展的挑战与机遇。本文分析和描述了促进和提高小学外语学习成果的各种网络资源。本文认为,网络资源可以帮助小学外语教育现代化,并使其与新乌克兰学校的理念保持一致。本文还讨论了培训小学教师在组织远程学习方法中的重要性,这是高等教育机构在继续教育背景下优先考虑的问题。
{"title":"Internet resources for foreign language education in primary school: challenges and opportunities","authors":"Inna A. Kravtsova, Alina O. Mankuta, Vita A. Hamaniuk, Olga S. Bilozir, Andrei V. Voznyak","doi":"10.55056/ceur-ws.org/vol-3482/paper041","DOIUrl":"https://doi.org/10.55056/ceur-ws.org/vol-3482/paper041","url":null,"abstract":"The paper explores the challenges and opportunities of developing professional competence of primary school teachers in teaching foreign languages according to the New Ukrainian School concept. The paper analyzes and describes various Internet resources that can facilitate and enhance foreign language learning outcomes in primary school. The paper argues that Internet resources can help modernize foreign language education in primary school and align it with the New Ukrainian School concept. The paper also discusses the importance of training primary school teachers in the methods of organizing distance learning, which is a priority for higher education institutions in the context of continuous education.","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135935615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
YouTube as an open resource for foreign language learning: a case study of German YouTube作为外语学习的开放资源:以德语为例
Pub Date : 2023-09-12 DOI: 10.55056/ceur-ws.org/vol-3482/paper116
Olha V. Chorna, Vita A. Hamaniuk, Oksana Y. Markheva, Andrei V. Voznyak
The integration of information and communication technologies (ICT) in education has increased the possibilities and expanded the boundaries of the learning process. It is also a prerequisite for implementing distance learning. Various online resources, such as e-mail, blogs, forums, online applications, and video hosting sites, can be used to create open learning and education environments. This study focuses on the use of informational educational technologies for learning foreign languages, especially German. The article presents the results of a theoretical analysis of the content of YouTube video materials in terms of their personal and didactic relevance for teaching German as a first or second foreign language in higher education, specifically at a pedagogical university. Based on the practical experience of using several popular thematic YouTube channels with a large and stable audience, a brief didactic analysis of their products is provided and suggestions are made on how to transform video content into methodological material for the practical course of German language for future teachers. The article also explores the potential of using alternative YouTube resources for distance learning with regard to the development of mediation skills as defined by the authors of the CEFR Companion Volume with New Descriptors. Four types of resources that can serve as teaching materials are identified and analyzed; some examples of their preparation and use for the training of future foreign language teachers are given. The article also discusses the open resources ONCOO and TWINE, which can be used to foster the autonomy of future foreign language teachers, and describes their features. The proposed recommendations can help to achieve the following objectives: enriching vocabulary; semanticizing phraseological units, fixed expressions, clichés; developing pronunciation skills; enhancing linguistic and ICT competencies; improving listening and speaking skills; increasing motivation to learn, etc.
信息和通信技术(ICT)在教育中的整合增加了学习过程的可能性并扩大了学习过程的边界。这也是实施远程教育的先决条件。各种在线资源,如电子邮件、博客、论坛、在线应用程序和视频托管站点,都可以用来创建开放的学习和教育环境。本研究的重点是利用信息教育技术学习外语,特别是德语。本文介绍了对YouTube视频材料内容的理论分析结果,即它们与高等教育中德语作为第一或第二外语教学的个人和教学相关性,特别是在一所师范大学。本文结合几个热门主题YouTube频道的使用实践经验,对其产品进行了简要的教学分析,并就如何将视频内容转化为面向未来教师的德语实践课程的方法论材料提出了建议。本文还探讨了使用替代YouTube资源进行远程学习的潜力,这些资源与CEFR新描述符配套卷的作者所定义的调解技能的发展有关。确定并分析了四种可作为教材的资源;并举例说明了如何将其用于未来外语教师的培训。文章还讨论了ONCOO和TWINE这两种开放资源,它们可以用来培养未来外语教师的自主性,并描述了它们的特点。提出的建议有助于实现以下目标:丰富词汇量;对词汇单位、固定表达、陈词滥调进行语义化;发展发音技能;提高语言和信息通信技术能力;提高听说能力;增加学习的动力等等。
{"title":"YouTube as an open resource for foreign language learning: a case study of German","authors":"Olha V. Chorna, Vita A. Hamaniuk, Oksana Y. Markheva, Andrei V. Voznyak","doi":"10.55056/ceur-ws.org/vol-3482/paper116","DOIUrl":"https://doi.org/10.55056/ceur-ws.org/vol-3482/paper116","url":null,"abstract":"The integration of information and communication technologies (ICT) in education has increased the possibilities and expanded the boundaries of the learning process. It is also a prerequisite for implementing distance learning. Various online resources, such as e-mail, blogs, forums, online applications, and video hosting sites, can be used to create open learning and education environments. This study focuses on the use of informational educational technologies for learning foreign languages, especially German. The article presents the results of a theoretical analysis of the content of YouTube video materials in terms of their personal and didactic relevance for teaching German as a first or second foreign language in higher education, specifically at a pedagogical university. Based on the practical experience of using several popular thematic YouTube channels with a large and stable audience, a brief didactic analysis of their products is provided and suggestions are made on how to transform video content into methodological material for the practical course of German language for future teachers. The article also explores the potential of using alternative YouTube resources for distance learning with regard to the development of mediation skills as defined by the authors of the CEFR Companion Volume with New Descriptors. Four types of resources that can serve as teaching materials are identified and analyzed; some examples of their preparation and use for the training of future foreign language teachers are given. The article also discusses the open resources ONCOO and TWINE, which can be used to foster the autonomy of future foreign language teachers, and describes their features. The proposed recommendations can help to achieve the following objectives: enriching vocabulary; semanticizing phraseological units, fixed expressions, clichés; developing pronunciation skills; enhancing linguistic and ICT competencies; improving listening and speaking skills; increasing motivation to learn, etc.","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135935773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ontology-based representation and analysis of conditional vaccine immune responses using Omics data. 基于本体的表征和分析使用 Omics 数据的条件疫苗免疫反应。
Pub Date : 2023-08-01
Anthony Huffman, Edison Ong, Tim Brunson, Nasim Sanati, Jie Zheng, Anna Maria Masci, Guanming Wu, Yongqun He

ImmPort, the world's largest repository of immunology data, includes many vaccine immune response datasets. ImmPort maps the metadata of these studies to ontology and database schema. As of February 28, 2023, our ImmPort data analysis identified 6.258 immune exposures using 47 vaccines in 4,607 human subjects, and 324 cohort studies from the ImmPort. We hypothesized that an integrative ontological representation of the data from these studies would enhance our understanding and analysis of these ImmPort vaccine studies, and with ontological classification and tools such as VIGET, we could further study the effects of different conditions such as vaccine types and host biological sex on the vaccine response gene expression profiles. Our Vaccine Ontology (VO) analysis classified these 37 vaccines into bacterial, viral, and protozoan vaccine types with different vaccine properties. The ImmPort metadata types were modeled with the Vaccine Investigation Ontology (VIO). Our new ontology-based pipeline extracted vaccine response data from the ImmPort database, annotated them based on ontology, obtained corresponding gene expression data from the GEO, and performed consistent omics data analysis. Our use case found gene profiles shared and differed from live and killed inactivated influenza vaccines. Furthermore, our Omics data analysis using the VIGET tool found that female and male human subjects have differential host responses for influenza vaccines. For example, our study showed much stronger early female responses to influenza vaccination than males, and males was able to show active immune responses at a later stage. Interestingly, the female (but not male) human subject group also showed significantly enriched neutrophil degranulation at Day 3 after influenza vaccination; however, males (but not females) displayed significantly enriched neutrophil degranulation at Day 14 after influenza vaccination. These mechanisms have been used to find differences between the gene lists and pathways of host responses to different vaccines conditional to different factors including vaccine types and host biological sex. Moreover, this framework can be expanded to other vaccines and vaccine categories easily.

ImmPort 是世界上最大的免疫学数据储存库,其中包括许多疫苗免疫反应数据集。ImmPort 将这些研究的元数据映射到本体论和数据库模式。截至 2023 年 2 月 28 日,我们的 ImmPort 数据分析确定了在 4607 名人类受试者中使用 47 种疫苗的 6.258 次免疫暴露,以及 ImmPort 中的 324 项队列研究。我们假设,对这些研究的数据进行本体论整合表述,将能增强我们对这些 ImmPort 疫苗研究的理解和分析,通过本体论分类和 VIGET 等工具,我们可以进一步研究疫苗类型和宿主生物性别等不同条件对疫苗反应基因表达谱的影响。我们的疫苗本体(VO)分析将这 37 种疫苗分为细菌、病毒和原生动物疫苗类型,它们具有不同的疫苗特性。ImmPort 元数据类型采用疫苗调查本体(VIO)建模。我们基于本体的新管道从 ImmPort 数据库中提取疫苗反应数据,根据本体对其进行注释,从 GEO 获取相应的基因表达数据,并进行一致的 omics 数据分析。我们的使用案例发现了流感活疫苗和灭活疫苗的基因图谱的共同点和不同点。此外,我们使用 VIGET 工具进行的全局数据分析发现,女性和男性人体受试者对流感疫苗的宿主反应存在差异。例如,我们的研究显示,女性对流感疫苗的早期反应比男性强烈得多,而男性则能在后期表现出活跃的免疫反应。有趣的是,女性(而非男性)人类受试者组在接种流感疫苗后第 3 天也表现出明显的中性粒细胞脱颗粒现象;然而,男性(而非女性)在接种流感疫苗后第 14 天表现出明显的中性粒细胞脱颗粒现象。这些机制已被用于发现宿主对不同疫苗反应的基因列表和途径之间的差异,这些差异取决于不同的因素,包括疫苗类型和宿主的生物性别。此外,这一框架还可以很容易地扩展到其他疫苗和疫苗类别。
{"title":"Ontology-based representation and analysis of conditional vaccine immune responses using Omics data.","authors":"Anthony Huffman, Edison Ong, Tim Brunson, Nasim Sanati, Jie Zheng, Anna Maria Masci, Guanming Wu, Yongqun He","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>ImmPort, the world's largest repository of immunology data, includes many vaccine immune response datasets. ImmPort maps the metadata of these studies to ontology and database schema. As of February 28, 2023, our ImmPort data analysis identified 6.258 immune exposures using 47 vaccines in 4,607 human subjects, and 324 cohort studies from the ImmPort. We hypothesized that an integrative ontological representation of the data from these studies would enhance our understanding and analysis of these ImmPort vaccine studies, and with ontological classification and tools such as VIGET, we could further study the effects of different conditions such as vaccine types and host biological sex on the vaccine response gene expression profiles. Our Vaccine Ontology (VO) analysis classified these 37 vaccines into bacterial, viral, and protozoan vaccine types with different vaccine properties. The ImmPort metadata types were modeled with the Vaccine Investigation Ontology (VIO). Our new ontology-based pipeline extracted vaccine response data from the ImmPort database, annotated them based on ontology, obtained corresponding gene expression data from the GEO, and performed consistent omics data analysis. Our use case found gene profiles shared and differed from live and killed inactivated influenza vaccines. Furthermore, our Omics data analysis using the VIGET tool found that female and male human subjects have differential host responses for influenza vaccines. For example, our study showed much stronger early female responses to influenza vaccination than males, and males was able to show active immune responses at a later stage. Interestingly, the female (but not male) human subject group also showed significantly enriched neutrophil degranulation at Day 3 after influenza vaccination; however, males (but not females) displayed significantly enriched neutrophil degranulation at Day 14 after influenza vaccination. These mechanisms have been used to find differences between the gene lists and pathways of host responses to different vaccines conditional to different factors including vaccine types and host biological sex. Moreover, this framework can be expanded to other vaccines and vaccine categories easily.</p>","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"3603 ","pages":"1-12"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11584146/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142711244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of Human Experts and AI in Predicting Autism from Facial Behavior. 人类专家和人工智能从面部行为预测自闭症的比较。
Pub Date : 2023-03-01 Epub Date: 2023-03-16
Evangelos Sariyanidi, Casey J Zampella, Ellis DeJardin, John D Herrington, Robert T Schultz, Birkan Tunc

Advances in computational behavior analysis via artificial intelligence (AI) promise to improve mental healthcare services by providing clinicians with tools to assist diagnosis or measurement of treatment outcomes. This potential has spurred an increasing number of studies in which automated pipelines predict diagnoses of mental health conditions. However, a fundamental question remains unanswered: How do the predictions of the AI algorithms correspond and compare with the predictions of humans? This is a critical question if AI technology is to be used as an assistive tool, because the utility of an AI algorithm would be negligible if it provides little information beyond what clinicians can readily infer. In this paper, we compare the performance of 19 human raters (8 autism experts and 11 non-experts) and that of an AI algorithm in terms of predicting autism diagnosis from short (3-minute) videos of N = 42 participants in a naturalistic conversation. Results show that the AI algorithm achieves an average accuracy of 80.5%, which is comparable to that of clinicians with expertise in autism (83.1%) and clinical research staff without specialized expertise (78.3%). Critically, diagnoses that were inaccurately predicted by most humans (experts and non-experts, alike) were typically correctly predicted by AI. Our results highlight the potential of AI as an assistive tool that can augment clinician diagnostic decision-making.

人工智能(AI)在计算行为分析方面的进展有望改善精神卫生保健服务,为临床医生提供辅助诊断或衡量治疗结果的工具。这种潜力刺激了越来越多的研究,在这些研究中,自动化管道预测心理健康状况的诊断。然而,一个基本的问题仍然没有得到回答:人工智能算法的预测与人类的预测相对应并进行比较?如果人工智能技术被用作辅助工具,这是一个关键问题,因为如果人工智能算法提供的信息很少,超出临床医生可以轻易推断的范围,那么它的效用就可以忽略不计。在本文中,我们比较了19名人类评分者(8名自闭症专家和11名非专家)和人工智能算法在预测自闭症诊断方面的表现,这些评分者来自N = 42名参与者的自然对话中的短视频(3分钟)。结果表明,人工智能算法的平均准确率为80.5%,与具有自闭症专业知识的临床医生(83.1%)和没有专业知识的临床研究人员(78.3%)相当。关键是,大多数人(专家和非专家都一样)预测不准确的诊断通常会被人工智能正确预测。我们的研究结果强调了人工智能作为辅助工具的潜力,可以增强临床医生的诊断决策。
{"title":"Comparison of Human Experts and AI in Predicting Autism from Facial Behavior.","authors":"Evangelos Sariyanidi, Casey J Zampella, Ellis DeJardin, John D Herrington, Robert T Schultz, Birkan Tunc","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Advances in computational behavior analysis via artificial intelligence (AI) promise to improve mental healthcare services by providing clinicians with tools to assist diagnosis or measurement of treatment outcomes. This potential has spurred an increasing number of studies in which automated pipelines predict diagnoses of mental health conditions. However, a fundamental question remains unanswered: How do the predictions of the AI algorithms correspond and compare with the predictions of humans? This is a critical question if AI technology is to be used as an assistive tool, because the utility of an AI algorithm would be negligible if it provides little information beyond what clinicians can readily infer. In this paper, we compare the performance of 19 human raters (8 autism experts and 11 non-experts) and that of an AI algorithm in terms of predicting autism diagnosis from short (3-minute) videos of <i>N</i> = 42 participants in a naturalistic conversation. Results show that the AI algorithm achieves an average accuracy of 80.5%, which is comparable to that of clinicians with expertise in autism (83.1%) and clinical research staff without specialized expertise (78.3%). Critically, diagnoses that were inaccurately predicted by most humans (experts and non-experts, alike) were typically correctly predicted by AI. Our results highlight the potential of AI as an assistive tool that can augment clinician diagnostic decision-making.</p>","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"3359 ITAH","pages":"48-57"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10687770/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138464759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Extendible Realism-Based Ontology for Kinship. 基于现实主义的可扩展亲缘关系本体论。
Pub Date : 2023-01-01
Michael Rabenberg, Anuwat Pengput, Werner Ceusters

Adequately representing kinship relations is crucial for a variety of medical and biomedical applications. Several kinship ontologies have been proposed but none of them have been designed thus far in line with the Basic Formal Ontology. In this paper, we propose a novel kinship ontology that exhibits the following characteristics: (1) it is fully axiomatized in First Order Logic following the rules governing predicate formation as proposed in BFO2020-FOL, (2) it is modularized in 6 separate files written in the Common Logic Interface Format (CLIF) each one of which can be imported based on specific needs, (3) it provides bridging axioms to and from SNOMED CT, and (4) it contains an extra module with axioms which would not be literally true when phrased naively but are crafted in such a way that they highlight the unusual kinship relations they represent and can be used to generate alerts on possible data entry mistakes. We describe design considerations and challenges encountered.

充分表达亲属关系对于各种医学和生物医学应用至关重要。目前已经有多个亲属关系本体被提出,但迄今为止还没有一个是按照基本形式本体设计的。在本文中,我们提出了一种新的亲属关系本体,它具有以下特点:(1) 它完全用一阶逻辑公理化,遵循 BFO2020-FOL 中提出的谓词形成规则;(2) 它被模块化为 6 个用通用逻辑接口格式(CLIF)编写的独立文件,每个文件都可以根据具体需要导入、(3) 它提供了连接 SNOMED CT 和 SNOMED CT 的桥接公理,以及 (4) 它包含一个额外的公理模块,这些公理在简单措辞时并不完全正确,但经过精心设计,可以突出它们所代表的不寻常的亲缘关系,并可用于对可能的数据录入错误发出警报。我们将介绍设计时的考虑因素和遇到的挑战。
{"title":"An Extendible Realism-Based Ontology for Kinship.","authors":"Michael Rabenberg, Anuwat Pengput, Werner Ceusters","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Adequately representing kinship relations is crucial for a variety of medical and biomedical applications. Several kinship ontologies have been proposed but none of them have been designed thus far in line with the Basic Formal Ontology. In this paper, we propose a novel kinship ontology that exhibits the following characteristics: (1) it is fully axiomatized in First Order Logic following the rules governing predicate formation as proposed in BFO2020-FOL, (2) it is modularized in 6 separate files written in the Common Logic Interface Format (CLIF) each one of which can be imported based on specific needs, (3) it provides bridging axioms to and from SNOMED CT, and (4) it contains an extra module with axioms which would not be literally true when phrased naively but are crafted in such a way that they highlight the unusual kinship relations they represent and can be used to generate alerts on possible data entry mistakes. We describe design considerations and challenges encountered.</p>","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"3603 ","pages":"25-35"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11131162/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141163042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design. 快速优化加权稀疏决策树,用于优化治疗机制和优化政策设计。
Pub Date : 2022-10-01
Ali Behrouz, Mathias Lécuyer, Cynthia Rudin, Margo Seltzer

Sparse decision trees are one of the most common forms of interpretable models. While recent advances have produced algorithms that fully optimize sparse decision trees for prediction, that work does not address policy design, because the algorithms cannot handle weighted data samples. Specifically, they rely on the discreteness of the loss function, which means that real-valued weights cannot be directly used. For example, none of the existing techniques produce policies that incorporate inverse propensity weighting on individual data points. We present three algorithms for efficient sparse weighted decision tree optimization. The first approach directly optimizes the weighted loss function; however, it tends to be computationally inefficient for large datasets. Our second approach, which scales more efficiently, transforms weights to integer values and uses data duplication to transform the weighted decision tree optimization problem into an unweighted (but larger) counterpart. Our third algorithm, which scales to much larger datasets, uses a randomized procedure that samples each data point with a probability proportional to its weight. We present theoretical bounds on the error of the two fast methods and show experimentally that these methods can be two orders of magnitude faster than the direct optimization of the weighted loss, without losing significant accuracy.

稀疏决策树是最常见的可解释模型形式之一。虽然最近的进步已经产生了可以完全优化稀疏决策树预测的算法,但这项工作并没有解决策略设计的问题,因为这些算法无法处理加权数据样本。具体来说,这些算法依赖于损失函数的离散性,这意味着无法直接使用实值权重。例如,现有技术中没有一种能在单个数据点上生成包含反倾向加权的策略。我们提出了三种高效稀疏加权决策树优化算法。第一种方法直接优化加权损失函数,但对于大型数据集来说,计算效率往往较低。我们的第二种方法扩展效率更高,它将权重转换为整数值,并利用数据复制将加权决策树优化问题转换为非加权(但更大)的对应问题。我们的第三种算法可扩展到更大的数据集,它采用随机程序,以与其权重成正比的概率对每个数据点进行采样。我们提出了这两种快速方法的误差理论界限,并通过实验证明,这些方法比直接优化加权损失的方法快两个数量级,而不会损失显著的准确性。
{"title":"Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design.","authors":"Ali Behrouz, Mathias Lécuyer, Cynthia Rudin, Margo Seltzer","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Sparse decision trees are one of the most common forms of interpretable models. While recent advances have produced algorithms that fully optimize sparse decision trees for <i>prediction</i>, that work does not address <i>policy design</i>, because the algorithms cannot handle weighted data samples. Specifically, they rely on the discreteness of the loss function, which means that real-valued weights cannot be directly used. For example, none of the existing techniques produce policies that incorporate inverse propensity weighting on individual data points. We present three algorithms for efficient sparse weighted decision tree optimization. The first approach directly optimizes the weighted loss function; however, it tends to be computationally inefficient for large datasets. Our second approach, which scales more efficiently, transforms weights to integer values and uses data duplication to transform the weighted decision tree optimization problem into an unweighted (but larger) counterpart. Our third algorithm, which scales to much larger datasets, uses a randomized procedure that samples each data point with a probability proportional to its weight. We present theoretical bounds on the error of the two fast methods and show experimentally that these methods can be two orders of magnitude faster than the direct optimization of the weighted loss, without losing significant accuracy.</p>","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"3318 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10039433/pdf/nihms-1883491.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9197753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast optimization of weighted sparse decision trees for use in optimal treatment regimes and optimal policy design 加权稀疏决策树的快速优化,用于最优治疗方案和最优策略设计
Pub Date : 2022-10-01 DOI: 10.48550/arXiv.2210.06825
Ali Behrouz, Mathias Lécuyer, C. Rudin, M. Seltzer
Sparse decision trees are one of the most common forms of interpretable models. While recent advances have produced algorithms that fully optimize sparse decision trees for prediction, that work does not address policy design, because the algorithms cannot handle weighted data samples. Specifically, they rely on the discreteness of the loss function, which means that real-valued weights cannot be directly used. For example, none of the existing techniques produce policies that incorporate inverse propensity weighting on individual data points. We present three algorithms for efficient sparse weighted decision tree optimization. The first approach directly optimizes the weighted loss function; however, it tends to be computationally inefficient for large datasets. Our second approach, which scales more efficiently, transforms weights to integer values and uses data duplication to transform the weighted decision tree optimization problem into an unweighted (but larger) counterpart. Our third algorithm, which scales to much larger datasets, uses a randomized procedure that samples each data point with a probability proportional to its weight. We present theoretical bounds on the error of the two fast methods and show experimentally that these methods can be two orders of magnitude faster than the direct optimization of the weighted loss, without losing significant accuracy.
稀疏决策树是可解释模型中最常见的形式之一。虽然最近的进展已经产生了完全优化稀疏决策树进行预测的算法,但这项工作并没有涉及策略设计,因为这些算法无法处理加权数据样本。具体来说,它们依赖于损失函数的离散性,这意味着不能直接使用实值权重。例如,现有的技术都没有产生在单个数据点上包含反向倾向加权的政策。我们提出了三种有效的稀疏加权决策树优化算法。第一种方法直接优化加权损失函数;然而,对于大型数据集,它往往在计算上效率低下。我们的第二种方法更有效地扩展,将权重转换为整数值,并使用数据复制将加权决策树优化问题转换为未加权(但更大)的对应问题。我们的第三种算法可扩展到更大的数据集,它使用随机过程,以与权重成比例的概率对每个数据点进行采样。我们给出了两种快速方法误差的理论界限,并通过实验表明,这些方法可以比直接优化加权损耗快两个数量级,而不会损失显著的精度。
{"title":"Fast optimization of weighted sparse decision trees for use in optimal treatment regimes and optimal policy design","authors":"Ali Behrouz, Mathias Lécuyer, C. Rudin, M. Seltzer","doi":"10.48550/arXiv.2210.06825","DOIUrl":"https://doi.org/10.48550/arXiv.2210.06825","url":null,"abstract":"Sparse decision trees are one of the most common forms of interpretable models. While recent advances have produced algorithms that fully optimize sparse decision trees for prediction, that work does not address policy design, because the algorithms cannot handle weighted data samples. Specifically, they rely on the discreteness of the loss function, which means that real-valued weights cannot be directly used. For example, none of the existing techniques produce policies that incorporate inverse propensity weighting on individual data points. We present three algorithms for efficient sparse weighted decision tree optimization. The first approach directly optimizes the weighted loss function; however, it tends to be computationally inefficient for large datasets. Our second approach, which scales more efficiently, transforms weights to integer values and uses data duplication to transform the weighted decision tree optimization problem into an unweighted (but larger) counterpart. Our third algorithm, which scales to much larger datasets, uses a randomized procedure that samples each data point with a probability proportional to its weight. We present theoretical bounds on the error of the two fast methods and show experimentally that these methods can be two orders of magnitude faster than the direct optimization of the weighted loss, without losing significant accuracy.","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"3318 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45358116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A community effort for COVID-19 Ontology Harmonization. COVID-19 本体协调的社区努力。
Pub Date : 2022-01-01 Epub Date: 2022-01-28
Asiyah Yu Lin, Yuki Yamagata, William D Duncan, Leigh C Carmody, Tatsuya Kushida, Hiroshi Masuya, John Beverley, Biswanath Dutta, Michael DeBellis, Zoë May Pendlington, Paola Roncaglia, Yongqun He

Ontologies have emerged to become critical to support data and knowledge representation, standardization, integration, and analysis. The SARS-CoV-2 pandemic led to the rapid proliferation of COVID-19 data, as well as the development of many COVID-19 ontologies. In the interest of supporting data interoperability, we initiated a community-based effort to harmonize COVID-19 ontologies. Our effort involves the collaborative discussion among developers of seven COVID-19 related ontologies, and the merging of four ontologies. This effort demonstrates the feasibility of harmonizing these ontologies in an interoperable framework to support integrative representation and analysis of COVID-19 related data and knowledge.

本体论已成为支持数据和知识表示、标准化、集成和分析的关键。SARS-CoV-2 大流行导致 COVID-19 数据迅速扩散,同时也催生了许多 COVID-19 本体的开发。为了支持数据互操作性,我们发起了一项基于社区的工作,以协调 COVID-19 本体。我们的工作涉及七个 COVID-19 相关本体的开发人员之间的合作讨论,以及四个本体的合并。这项工作证明了在一个可互操作的框架内协调这些本体的可行性,以支持 COVID-19 相关数据和知识的综合表示和分析。
{"title":"A community effort for COVID-19 Ontology Harmonization.","authors":"Asiyah Yu Lin, Yuki Yamagata, William D Duncan, Leigh C Carmody, Tatsuya Kushida, Hiroshi Masuya, John Beverley, Biswanath Dutta, Michael DeBellis, Zoë May Pendlington, Paola Roncaglia, Yongqun He","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Ontologies have emerged to become critical to support data and knowledge representation, standardization, integration, and analysis. The SARS-CoV-2 pandemic led to the rapid proliferation of COVID-19 data, as well as the development of many COVID-19 ontologies. In the interest of supporting data interoperability, we initiated a community-based effort to harmonize COVID-19 ontologies. Our effort involves the collaborative discussion among developers of seven COVID-19 related ontologies, and the merging of four ontologies. This effort demonstrates the feasibility of harmonizing these ontologies in an interoperable framework to support integrative representation and analysis of COVID-19 related data and knowledge.</p>","PeriodicalId":72554,"journal":{"name":"CEUR workshop proceedings","volume":"3073 ","pages":"122-127"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10262777/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10024339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
CEUR workshop proceedings
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1