首页 > 最新文献

arXiv - CS - Computers and Society最新文献

英文 中文
Inside Out or Not: Privacy Implications of Emotional Disclosure 内向与否:情感披露对隐私的影响
Pub Date : 2024-09-18 DOI: arxiv-2409.11805
Elham Naghizade, Kaixin Ji, Benjamin Tag, Flora Salim
Privacy is dynamic, sensitive, and contextual, much like our emotions.Previous studies have explored the interplay between privacy and context,privacy and emotion, and emotion and context. However, there remains asignificant gap in understanding the interplay of these aspects simultaneously.In this paper, we present a preliminary study investigating the role ofemotions in driving individuals' information sharing behaviour, particularly inrelation to urban locations and social ties. We adopt a novel methodology thatintegrates context (location and time), emotion, and personal informationsharing behaviour, providing a comprehensive analysis of how contextualemotions affect privacy. The emotions are assessed with both self-reporting andelectrodermal activity (EDA). Our findings reveal that self-reported emotionsinfluence personal information-sharing behaviour with distant social groups,while neutral emotions lead individuals to share less precise information withclose social circles, a pattern is potentially detectable with wrist-worn EDA.Our study helps lay the foundation for personalised emotion-aware strategies tomitigate oversharing risks and enhance user privacy in the digital age.
隐私是动态的、敏感的、与情境相关的,就像我们的情绪一样。以前的研究已经探索了隐私与情境、隐私与情绪、情绪与情境之间的相互作用。在本文中,我们提出了一项初步研究,调查情绪在推动个人信息共享行为中的作用,尤其是与城市地点和社会关系的关系。我们采用了一种新颖的方法,将情境(地点和时间)、情绪和个人信息分享行为结合起来,对情境情绪如何影响隐私进行了全面分析。情绪通过自我报告和耳皮活动(EDA)进行评估。我们的研究结果表明,自我报告的情绪会影响与较远社交群体的个人信息分享行为,而中性情绪则会导致个人与较近社交圈的人分享较不精确的信息,这种模式有可能通过腕戴式 EDA 检测到。
{"title":"Inside Out or Not: Privacy Implications of Emotional Disclosure","authors":"Elham Naghizade, Kaixin Ji, Benjamin Tag, Flora Salim","doi":"arxiv-2409.11805","DOIUrl":"https://doi.org/arxiv-2409.11805","url":null,"abstract":"Privacy is dynamic, sensitive, and contextual, much like our emotions.\u0000Previous studies have explored the interplay between privacy and context,\u0000privacy and emotion, and emotion and context. However, there remains a\u0000significant gap in understanding the interplay of these aspects simultaneously.\u0000In this paper, we present a preliminary study investigating the role of\u0000emotions in driving individuals' information sharing behaviour, particularly in\u0000relation to urban locations and social ties. We adopt a novel methodology that\u0000integrates context (location and time), emotion, and personal information\u0000sharing behaviour, providing a comprehensive analysis of how contextual\u0000emotions affect privacy. The emotions are assessed with both self-reporting and\u0000electrodermal activity (EDA). Our findings reveal that self-reported emotions\u0000influence personal information-sharing behaviour with distant social groups,\u0000while neutral emotions lead individuals to share less precise information with\u0000close social circles, a pattern is potentially detectable with wrist-worn EDA.\u0000Our study helps lay the foundation for personalised emotion-aware strategies to\u0000mitigate oversharing risks and enhance user privacy in the digital age.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Law-based and standards-oriented approach for privacy impact assessment in medical devices: a topic for lawyers, engineers and healthcare practitioners in MedTech 以法律和标准为导向的医疗器械隐私影响评估方法:医疗技术领域律师、工程师和医疗从业人员的话题
Pub Date : 2024-09-18 DOI: arxiv-2409.11845
Yuri R. Ladeia, David M. Pereira
Background: The integration of the General Data Protection Regulation (GDPR)and the Medical Device Regulation (MDR) creates complexities in conducting DataProtection Impact Assessments (DPIAs) for medical devices. The adoption ofnon-binding standards like ISO and IEC can harmonize these processes byenhancing accountability and privacy by design. Methods: This study employs amultidisciplinary literature review, focusing on GDPR and MDR intersection inmedical devices that process personal health data. It evaluates key standards,including ISO/IEC 29134 and IEC 62304, to propose a unified approach for DPIAsthat aligns with legal and technical frameworks. Results: The analysis revealsthe benefits of integrating ISO/IEC standards into DPIAs, which providedetailed guidance on implementing privacy by design, risk assessment, andmitigation strategies specific to medical devices. The proposed frameworkensures that DPIAs are living documents, continuously updated to adapt toevolving data protection challenges. Conclusions: A unified approach combiningEuropean Union (EU) regulations and international standards offers a robustframework for conducting DPIAs in medical devices. This integration balancessecurity, innovation, and privacy, enhancing compliance and fostering trust inmedical technologies. The study advocates for leveraging both hard law andstandards to systematically address privacy and safety in the design andoperation of medical devices, thereby raising the maturity of the MedTechecosystem.
背景:一般数据保护条例》(GDPR)和《医疗器械条例》(MDR)的整合给医疗器械的数据保护影响评估(DPIA)带来了复杂性。采用 ISO 和 IEC 等非约束性标准可以通过加强责任和隐私设计来协调这些流程。方法:本研究采用多学科文献综述的方法,重点关注处理个人健康数据的医疗设备与 GDPR 和 MDR 的交叉点。它评估了包括 ISO/IEC 29134 和 IEC 62304 在内的主要标准,提出了一种与法律和技术框架相一致的 DPIA 统一方法。结果:分析揭示了将 ISO/IEC 标准整合到 DPIA 中的益处,这些标准为实施医疗设备特有的隐私设计、风险评估和缓解策略提供了详细指导。建议的框架可确保 DPIA 成为有生命力的文件,不断更新以适应不断变化的数据保护挑战。结论:结合欧盟 (EU) 法规和国际标准的统一方法为在医疗设备中开展 DPIA 提供了一个强大的框架。这种整合平衡了安全性、创新性和隐私性,提高了合规性,促进了对医疗技术的信任。该研究主张利用硬性法律和标准系统地解决医疗器械设计和操作中的隐私和安全问题,从而提高医疗技术生态系统的成熟度。
{"title":"Law-based and standards-oriented approach for privacy impact assessment in medical devices: a topic for lawyers, engineers and healthcare practitioners in MedTech","authors":"Yuri R. Ladeia, David M. Pereira","doi":"arxiv-2409.11845","DOIUrl":"https://doi.org/arxiv-2409.11845","url":null,"abstract":"Background: The integration of the General Data Protection Regulation (GDPR)\u0000and the Medical Device Regulation (MDR) creates complexities in conducting Data\u0000Protection Impact Assessments (DPIAs) for medical devices. The adoption of\u0000non-binding standards like ISO and IEC can harmonize these processes by\u0000enhancing accountability and privacy by design. Methods: This study employs a\u0000multidisciplinary literature review, focusing on GDPR and MDR intersection in\u0000medical devices that process personal health data. It evaluates key standards,\u0000including ISO/IEC 29134 and IEC 62304, to propose a unified approach for DPIAs\u0000that aligns with legal and technical frameworks. Results: The analysis reveals\u0000the benefits of integrating ISO/IEC standards into DPIAs, which provide\u0000detailed guidance on implementing privacy by design, risk assessment, and\u0000mitigation strategies specific to medical devices. The proposed framework\u0000ensures that DPIAs are living documents, continuously updated to adapt to\u0000evolving data protection challenges. Conclusions: A unified approach combining\u0000European Union (EU) regulations and international standards offers a robust\u0000framework for conducting DPIAs in medical devices. This integration balances\u0000security, innovation, and privacy, enhancing compliance and fostering trust in\u0000medical technologies. The study advocates for leveraging both hard law and\u0000standards to systematically address privacy and safety in the design and\u0000operation of medical devices, thereby raising the maturity of the MedTech\u0000ecosystem.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Idiosyncratic properties of Australian STV election counting 澳大利亚 STV 选举计票的失常特性
Pub Date : 2024-09-18 DOI: arxiv-2409.11627
Andrew Conway, Michelle Blom, Alexander Ek, Peter J. Stuckey, Vanessa J. Teague, Damjan Vukcevic
Single Transferable Vote (STV) counting, used in several jurisdictions inAustralia, is a system for choosing multiple election winners given voters'preferences over candidates. There are a variety of different versions of STVlegislated and/or applied across Australia. This paper shows some of theunintuitive properties of some of these systems.
澳大利亚多个司法管辖区采用的 "单一可转移投票"(STV)计票是一种根据选民对候选人的偏好选择多个选举获胜者的系统。澳大利亚各地立法和/或应用的可转移投票有多种不同版本。本文展示了其中一些系统的直观特性。
{"title":"Idiosyncratic properties of Australian STV election counting","authors":"Andrew Conway, Michelle Blom, Alexander Ek, Peter J. Stuckey, Vanessa J. Teague, Damjan Vukcevic","doi":"arxiv-2409.11627","DOIUrl":"https://doi.org/arxiv-2409.11627","url":null,"abstract":"Single Transferable Vote (STV) counting, used in several jurisdictions in\u0000Australia, is a system for choosing multiple election winners given voters'\u0000preferences over candidates. There are a variety of different versions of STV\u0000legislated and/or applied across Australia. This paper shows some of the\u0000unintuitive properties of some of these systems.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reporting Non-Consensual Intimate Media: An Audit Study of Deepfakes 报告未经同意的亲密媒体:深度伪造的审计研究
Pub Date : 2024-09-18 DOI: arxiv-2409.12138
Li Qiwei, Shihui Zhang, Andrew Timothy Kasper, Joshua Ashkinaze, Asia A. Eaton, Sarita Schoenebeck, Eric Gilbert
Non-consensual intimate media (NCIM) inflicts significant harm. Currently,victim-survivors can use two mechanisms to report NCIM - as a non-consensualnudity violation or as copyright infringement. We conducted an audit study oftakedown speed of NCIM reported to X (formerly Twitter) of both mechanisms. Weuploaded 50 AI-generated nude images and reported half under X's"non-consensual nudity" reporting mechanism and half under its "copyrightinfringement" mechanism. The copyright condition resulted in successful imageremoval within 25 hours for all images (100% removal rate), whilenon-consensual nudity reports resulted in no image removal for over three weeks(0% removal rate). We stress the need for targeted legislation to regulate NCIMremoval online. We also discuss ethical considerations for auditing NCIM onsocial platforms.
未经同意的亲密媒体(NCIM)会造成重大伤害。目前,受害者-幸存者可以使用两种机制报告非自愿亲密媒体--作为非自愿裸露侵犯或作为版权侵犯。我们对通过这两种机制向 X(前 Twitter)报告的 NCIM 的删除速度进行了审计研究。我们上传了 50 张人工智能生成的裸体图片,一半按照 X 的 "未经同意的裸体 "报告机制进行报告,一半按照其 "侵犯版权 "机制进行报告。版权条件导致所有图像在 25 小时内被成功删除(删除率为 100%),而非自愿裸体报告导致图像在三周内未被删除(删除率为 0%)。我们强调有必要制定有针对性的立法来规范网络上的 NCIM 删除行为。我们还讨论了审核社交平台上的非自愿裸体信息的道德考虑因素。
{"title":"Reporting Non-Consensual Intimate Media: An Audit Study of Deepfakes","authors":"Li Qiwei, Shihui Zhang, Andrew Timothy Kasper, Joshua Ashkinaze, Asia A. Eaton, Sarita Schoenebeck, Eric Gilbert","doi":"arxiv-2409.12138","DOIUrl":"https://doi.org/arxiv-2409.12138","url":null,"abstract":"Non-consensual intimate media (NCIM) inflicts significant harm. Currently,\u0000victim-survivors can use two mechanisms to report NCIM - as a non-consensual\u0000nudity violation or as copyright infringement. We conducted an audit study of\u0000takedown speed of NCIM reported to X (formerly Twitter) of both mechanisms. We\u0000uploaded 50 AI-generated nude images and reported half under X's\u0000\"non-consensual nudity\" reporting mechanism and half under its \"copyright\u0000infringement\" mechanism. The copyright condition resulted in successful image\u0000removal within 25 hours for all images (100% removal rate), while\u0000non-consensual nudity reports resulted in no image removal for over three weeks\u0000(0% removal rate). We stress the need for targeted legislation to regulate NCIM\u0000removal online. We also discuss ethical considerations for auditing NCIM on\u0000social platforms.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"18 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gender Representation and Bias in Indian Civil Service Mock Interviews 印度公务员模拟面试中的性别代表性和偏见
Pub Date : 2024-09-18 DOI: arxiv-2409.12194
Somonnoy Banerjee, Sujan Dutta, Soumyajit Datta, Ashiqur R. KhudaBukhsh
This paper makes three key contributions. First, via a substantial corpus of51,278 interview questions sourced from 888 YouTube videos of mock interviewsof Indian civil service candidates, we demonstrate stark gender bias in thebroad nature of questions asked to male and female candidates. Second, ourexperiments with large language models show a strong presence of gender bias inexplanations provided by the LLMs on the gender inference task. Finally, wepresent a novel dataset of 51,278 interview questions that can inform futuresocial science studies.
本文有三个主要贡献。首先,我们通过从 888 个 YouTube 模拟印度公务员候选人面试视频中获取的 51 278 个面试问题的大量语料库,证明了向男性和女性候选人提出的问题在广泛性上存在明显的性别偏见。其次,我们使用大型语言模型进行的实验表明,在性别推断任务中,LLMs 提供的解释存在强烈的性别偏见。最后,我们展示了一个包含 51,278 个面试问题的新数据集,该数据集可为未来的社会科学研究提供参考。
{"title":"Gender Representation and Bias in Indian Civil Service Mock Interviews","authors":"Somonnoy Banerjee, Sujan Dutta, Soumyajit Datta, Ashiqur R. KhudaBukhsh","doi":"arxiv-2409.12194","DOIUrl":"https://doi.org/arxiv-2409.12194","url":null,"abstract":"This paper makes three key contributions. First, via a substantial corpus of\u000051,278 interview questions sourced from 888 YouTube videos of mock interviews\u0000of Indian civil service candidates, we demonstrate stark gender bias in the\u0000broad nature of questions asked to male and female candidates. Second, our\u0000experiments with large language models show a strong presence of gender bias in\u0000explanations provided by the LLMs on the gender inference task. Finally, we\u0000present a novel dataset of 51,278 interview questions that can inform future\u0000social science studies.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"307 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strategic Insights in Human and Large Language Model Tactics at Word Guessing Games 人类和大型语言模型在猜词游戏中的策略见解
Pub Date : 2024-09-17 DOI: arxiv-2409.11112
Matīss Rikters, Sanita Reinsone
At the beginning of 2022, a simplistic word-guessing game took the world bystorm and was further adapted to many languages beyond the original Englishversion. In this paper, we examine the strategies of daily word-guessing gameplayers that have evolved during a period of over two years. A survey gatheredfrom 25% of frequent players reveals their strategies and motivations forcontinuing the daily journey. We also explore the capability of several popularopen-access large language model systems and open-source models atcomprehending and playing the game in two different languages. Resultshighlight the struggles of certain models to maintain correct guess length andgenerate repetitions, as well as hallucinations of non-existent words andinflections.
2022 年初,一款简单的猜词游戏风靡全球,除了最初的英文版本外,还被进一步改编为多种语言。在本文中,我们研究了日常猜词游戏玩家在两年多时间里逐渐形成的策略。通过对 25% 的常玩游戏者进行调查,我们发现了他们的策略和继续每日游戏的动机。我们还探索了几种流行的开放式大型语言模型系统和开源模型在用两种不同语言理解和玩游戏方面的能力。研究结果突显了某些模型在保持正确的猜测长度和产生重复方面的困难,以及对不存在的单词和转折词产生幻觉的问题。
{"title":"Strategic Insights in Human and Large Language Model Tactics at Word Guessing Games","authors":"Matīss Rikters, Sanita Reinsone","doi":"arxiv-2409.11112","DOIUrl":"https://doi.org/arxiv-2409.11112","url":null,"abstract":"At the beginning of 2022, a simplistic word-guessing game took the world by\u0000storm and was further adapted to many languages beyond the original English\u0000version. In this paper, we examine the strategies of daily word-guessing game\u0000players that have evolved during a period of over two years. A survey gathered\u0000from 25% of frequent players reveals their strategies and motivations for\u0000continuing the daily journey. We also explore the capability of several popular\u0000open-access large language model systems and open-source models at\u0000comprehending and playing the game in two different languages. Results\u0000highlight the struggles of certain models to maintain correct guess length and\u0000generate repetitions, as well as hallucinations of non-existent words and\u0000inflections.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"48 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Role of AI Safety Institutes in Contributing to International Standards for Frontier AI Safety 人工智能安全机构在促进制定人工智能前沿安全国际标准方面的作用
Pub Date : 2024-09-17 DOI: arxiv-2409.11314
Kristina Fort
International standards are crucial for ensuring that frontier AI systems aredeveloped and deployed safely around the world. Since the AI Safety Institutes(AISIs) possess in-house technical expertise, mandate for internationalengagement, and convening power in the national AI ecosystem while being agovernment institution, we argue that they are particularly well-positioned tocontribute to the international standard-setting processes for AI safety. Inthis paper, we propose and evaluate three models for AISI involvement: 1. SeoulDeclaration Signatories, 2. US (and other Seoul Declaration Signatories) andChina, and 3. Globally Inclusive. Leveraging their diverse strengths, thesemodels are not mutually exclusive. Rather, they offer a multi-track systemsolution in which the central role of AISIs guarantees coherence among thedifferent tracks and consistency in their AI safety focus.
国际标准对于确保前沿人工智能系统在全球安全开发和部署至关重要。由于人工智能安全研究所(AISI)拥有内部技术专长、国际参与授权以及在国家人工智能生态系统中的号召力,同时又是政府机构,因此我们认为它们特别适合为人工智能安全的国际标准制定过程做出贡献。在本文中,我们提出并评估了 AISI 参与的三种模式:1.首尔宣言》签署国;2.美国(及其他《首尔宣言》签署国)和中国;3.全球包容性。全球包容。利用各自不同的优势,这些模式并非相互排斥。相反,它们提供了一个多轨系统解决方案,其中,机构间创新倡议的核心作用保证了不同轨道之间的一致性及其人工智能安全重点的一致性。
{"title":"The Role of AI Safety Institutes in Contributing to International Standards for Frontier AI Safety","authors":"Kristina Fort","doi":"arxiv-2409.11314","DOIUrl":"https://doi.org/arxiv-2409.11314","url":null,"abstract":"International standards are crucial for ensuring that frontier AI systems are\u0000developed and deployed safely around the world. Since the AI Safety Institutes\u0000(AISIs) possess in-house technical expertise, mandate for international\u0000engagement, and convening power in the national AI ecosystem while being a\u0000government institution, we argue that they are particularly well-positioned to\u0000contribute to the international standard-setting processes for AI safety. In\u0000this paper, we propose and evaluate three models for AISI involvement: 1. Seoul\u0000Declaration Signatories, 2. US (and other Seoul Declaration Signatories) and\u0000China, and 3. Globally Inclusive. Leveraging their diverse strengths, these\u0000models are not mutually exclusive. Rather, they offer a multi-track system\u0000solution in which the central role of AISIs guarantees coherence among the\u0000different tracks and consistency in their AI safety focus.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beyond Algorithmic Fairness: A Guide to Develop and Deploy Ethical AI-Enabled Decision-Support Tools 超越算法公平:开发和部署道德人工智能决策支持工具指南
Pub Date : 2024-09-17 DOI: arxiv-2409.11489
Rosemarie Santa Gonzalez, Ryan Piansky, Sue M Bae, Justin Biddle, Daniel Molzahn
The integration of artificial intelligence (AI) and optimization holdsubstantial promise for improving the efficiency, reliability, and resilienceof engineered systems. Due to the networked nature of many engineered systems,ethically deploying methodologies at this intersection poses challenges thatare distinct from other AI settings, thus motivating the development of ethicalguidelines tailored to AI-enabled optimization. This paper highlights the needto go beyond fairness-driven algorithms to systematically address ethicaldecisions spanning the stages of modeling, data curation, results analysis, andimplementation of optimization-based decision support tools. Accordingly, thispaper identifies ethical considerations required when deploying algorithms atthe intersection of AI and optimization via case studies in power systems aswell as supply chain and logistics. Rather than providing a prescriptive set ofrules, this paper aims to foster reflection and awareness among researchers andencourage consideration of ethical implications at every step of thedecision-making process.
人工智能(AI)与优化的结合为提高工程系统的效率、可靠性和复原力带来了巨大希望。由于许多工程系统具有网络化的特性,在这一交叉点上部署伦理方法面临着有别于其他人工智能环境的挑战,因此促使人们制定专门针对人工智能优化的伦理准则。本文强调有必要超越公平驱动的算法,系统地解决建模、数据整理、结果分析和基于优化的决策支持工具的实施等阶段的伦理决策问题。因此,本文通过对电力系统、供应链和物流的案例研究,指出了在人工智能与优化交叉领域部署算法时需要考虑的伦理问题。本文的目的不是提供一套规范性的规则,而是促进研究人员的反思和认识,并鼓励他们在决策过程的每一步都考虑伦理影响。
{"title":"Beyond Algorithmic Fairness: A Guide to Develop and Deploy Ethical AI-Enabled Decision-Support Tools","authors":"Rosemarie Santa Gonzalez, Ryan Piansky, Sue M Bae, Justin Biddle, Daniel Molzahn","doi":"arxiv-2409.11489","DOIUrl":"https://doi.org/arxiv-2409.11489","url":null,"abstract":"The integration of artificial intelligence (AI) and optimization hold\u0000substantial promise for improving the efficiency, reliability, and resilience\u0000of engineered systems. Due to the networked nature of many engineered systems,\u0000ethically deploying methodologies at this intersection poses challenges that\u0000are distinct from other AI settings, thus motivating the development of ethical\u0000guidelines tailored to AI-enabled optimization. This paper highlights the need\u0000to go beyond fairness-driven algorithms to systematically address ethical\u0000decisions spanning the stages of modeling, data curation, results analysis, and\u0000implementation of optimization-based decision support tools. Accordingly, this\u0000paper identifies ethical considerations required when deploying algorithms at\u0000the intersection of AI and optimization via case studies in power systems as\u0000well as supply chain and logistics. Rather than providing a prescriptive set of\u0000rules, this paper aims to foster reflection and awareness among researchers and\u0000encourage consideration of ethical implications at every step of the\u0000decision-making process.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"210 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142269355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gaps or Hallucinations? Gazing into Machine-Generated Legal Analysis for Fine-grained Text Evaluations 空白还是幻觉?凝视机器生成的法律分析,进行精细文本评估
Pub Date : 2024-09-16 DOI: arxiv-2409.09947
Abe Bohan Hou, William Jurayj, Nils Holzenberger, Andrew Blair-Stanek, Benjamin Van Durme
Large Language Models (LLMs) show promise as a writing aid for professionalsperforming legal analyses. However, LLMs can often hallucinate in this setting,in ways difficult to recognize by non-professionals and existing textevaluation metrics. In this work, we pose the question: when canmachine-generated legal analysis be evaluated as acceptable? We introduce theneutral notion of gaps, as opposed to hallucinations in a strict erroneoussense, to refer to the difference between human-written and machine-generatedlegal analysis. Gaps do not always equate to invalid generation. Working withlegal experts, we consider the CLERC generation task proposed in Hou et al.(2024b), leading to a taxonomy, a fine-grained detector for predicting gapcategories, and an annotated dataset for automatic evaluation. Our bestdetector achieves 67% F1 score and 80% precision on the test set. Employingthis detector as an automated metric on legal analysis generated by SOTA LLMs,we find around 80% contain hallucinations of different kinds.
大型语言模型(LLM)有望成为专业人士进行法律分析的写作辅助工具。然而,LLM 在这种情况下往往会产生幻觉,非专业人士和现有的文本评估指标很难识别。在这项工作中,我们提出了这样一个问题:什么时候机器生成的法律分析可以被评价为可接受的?我们引入了 "空白 "这一中性概念,与严格意义上的 "幻觉 "相对,指的是人工撰写的法律分析与机器生成的法律分析之间的差异。空白并不总是等同于无效生成。通过与法律专家合作,我们考虑了 Hou 等人(2024b)提出的 CLERC 生成任务,并由此产生了一个分类法、一个用于预测空白类别的细粒度检测器和一个用于自动评估的注释数据集。我们的最佳检测器在测试集上取得了 67% 的 F1 分数和 80% 的精确度。将该检测器作为自动度量标准用于 SOTA LLM 生成的法律分析,我们发现约 80% 的分析包含不同类型的幻觉。
{"title":"Gaps or Hallucinations? Gazing into Machine-Generated Legal Analysis for Fine-grained Text Evaluations","authors":"Abe Bohan Hou, William Jurayj, Nils Holzenberger, Andrew Blair-Stanek, Benjamin Van Durme","doi":"arxiv-2409.09947","DOIUrl":"https://doi.org/arxiv-2409.09947","url":null,"abstract":"Large Language Models (LLMs) show promise as a writing aid for professionals\u0000performing legal analyses. However, LLMs can often hallucinate in this setting,\u0000in ways difficult to recognize by non-professionals and existing text\u0000evaluation metrics. In this work, we pose the question: when can\u0000machine-generated legal analysis be evaluated as acceptable? We introduce the\u0000neutral notion of gaps, as opposed to hallucinations in a strict erroneous\u0000sense, to refer to the difference between human-written and machine-generated\u0000legal analysis. Gaps do not always equate to invalid generation. Working with\u0000legal experts, we consider the CLERC generation task proposed in Hou et al.\u0000(2024b), leading to a taxonomy, a fine-grained detector for predicting gap\u0000categories, and an annotated dataset for automatic evaluation. Our best\u0000detector achieves 67% F1 score and 80% precision on the test set. Employing\u0000this detector as an automated metric on legal analysis generated by SOTA LLMs,\u0000we find around 80% contain hallucinations of different kinds.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"21 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142269358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LLMs as information warriors? Auditing how LLM-powered chatbots tackle disinformation about Russia's war in Ukraine 法学硕士是信息战士?审核由 LLM 驱动的聊天机器人如何处理有关俄罗斯乌克兰战争的虚假信息
Pub Date : 2024-09-16 DOI: arxiv-2409.10697
Mykola Makhortykh, Ani Baghumyan, Victoria Vziatysheva, Maryna Sydorova, Elizaveta Kuznetsova
The rise of large language models (LLMs) has a significant impact oninformation warfare. By facilitating the production of content related todisinformation and propaganda campaigns, LLMs can amplify different types ofinformation operations and mislead online users. In our study, we empiricallyinvestigate how LLM-powered chatbots, developed by Google, Microsoft, andPerplexity, handle disinformation about Russia's war in Ukraine and whether thechatbots' ability to provide accurate information on the topic varies acrosslanguages and over time. Our findings indicate that while for some chatbots(Perplexity), there is a significant improvement in performance over time inseveral languages, for others (Gemini), the performance improves only inEnglish but deteriorates in low-resource languages.
大型语言模型(LLM)的兴起对信息战产生了重大影响。通过促进制作与虚假信息和宣传活动相关的内容,LLMs 可以放大不同类型的信息行动并误导在线用户。在我们的研究中,我们实证调查了由谷歌、微软和Perplexity开发的由LLM驱动的聊天机器人是如何处理有关俄罗斯在乌克兰战争的虚假信息的,以及这些聊天机器人提供有关该主题的准确信息的能力是否因语言和时间而异。我们的研究结果表明,对于某些聊天机器人(Perplexity)来说,随着时间的推移,其在多种语言中的表现都有显著提高,而对于其他聊天机器人(Gemini)来说,只有在英语中的表现有所提高,而在低资源语言中的表现则有所下降。
{"title":"LLMs as information warriors? Auditing how LLM-powered chatbots tackle disinformation about Russia's war in Ukraine","authors":"Mykola Makhortykh, Ani Baghumyan, Victoria Vziatysheva, Maryna Sydorova, Elizaveta Kuznetsova","doi":"arxiv-2409.10697","DOIUrl":"https://doi.org/arxiv-2409.10697","url":null,"abstract":"The rise of large language models (LLMs) has a significant impact on\u0000information warfare. By facilitating the production of content related to\u0000disinformation and propaganda campaigns, LLMs can amplify different types of\u0000information operations and mislead online users. In our study, we empirically\u0000investigate how LLM-powered chatbots, developed by Google, Microsoft, and\u0000Perplexity, handle disinformation about Russia's war in Ukraine and whether the\u0000chatbots' ability to provide accurate information on the topic varies across\u0000languages and over time. Our findings indicate that while for some chatbots\u0000(Perplexity), there is a significant improvement in performance over time in\u0000several languages, for others (Gemini), the performance improves only in\u0000English but deteriorates in low-resource languages.","PeriodicalId":501112,"journal":{"name":"arXiv - CS - Computers and Society","volume":"93 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Computers and Society
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1