首页 > 最新文献

arXiv - STAT - Other Statistics最新文献

英文 中文
FAVis: Visual Analytics of Factor Analysis for Psychological Research FAVis:心理研究因素分析的可视化分析
Pub Date : 2024-07-19 DOI: arxiv-2407.14072
Yikai Lu, Chaoli Wang
Psychological research often involves understanding psychological constructsthrough conducting factor analysis on data collected by a questionnaire, whichcan comprise hundreds of questions. Without interactive systems forinterpreting factor models, researchers are frequently exposed to subjectivity,potentially leading to misinterpretations or overlooked crucial information.This paper introduces FAVis, a novel interactive visualization tool designed toaid researchers in interpreting and evaluating factor analysis results. FAVisenhances the understanding of relationships between variables and factors bysupporting multiple views for visualizing factor loadings and correlations,allowing users to analyze information from various perspectives. The primaryfeature of FAVis is to enable users to set optimal thresholds for factorloadings to balance clarity and information retention. FAVis also allows usersto assign tags to variables, enhancing the understanding of factors by linkingthem to their associated psychological constructs. Our user study demonstratesthe utility of FAVis in various tasks.
心理学研究通常涉及通过对由数百个问题组成的问卷所收集的数据进行因子分析来理解心理建构。如果没有交互式系统来解释因子模型,研究人员就会经常受到主观因素的影响,从而可能导致误解或忽略关键信息。本文介绍的 FAVis 是一种新型交互式可视化工具,旨在帮助研究人员解释和评估因子分析结果。FAVis 支持多种视图来可视化因子载荷和相关性,允许用户从不同角度分析信息,从而加深了对变量和因子之间关系的理解。FAVis 的主要功能是让用户能够为因子载荷设置最佳阈值,从而在清晰度和信息保留之间取得平衡。FAVis 还允许用户为变量分配标签,通过将它们与相关的心理结构联系起来,加深对因素的理解。我们的用户研究证明了 FAVis 在各种任务中的实用性。
{"title":"FAVis: Visual Analytics of Factor Analysis for Psychological Research","authors":"Yikai Lu, Chaoli Wang","doi":"arxiv-2407.14072","DOIUrl":"https://doi.org/arxiv-2407.14072","url":null,"abstract":"Psychological research often involves understanding psychological constructs\u0000through conducting factor analysis on data collected by a questionnaire, which\u0000can comprise hundreds of questions. Without interactive systems for\u0000interpreting factor models, researchers are frequently exposed to subjectivity,\u0000potentially leading to misinterpretations or overlooked crucial information.\u0000This paper introduces FAVis, a novel interactive visualization tool designed to\u0000aid researchers in interpreting and evaluating factor analysis results. FAVis\u0000enhances the understanding of relationships between variables and factors by\u0000supporting multiple views for visualizing factor loadings and correlations,\u0000allowing users to analyze information from various perspectives. The primary\u0000feature of FAVis is to enable users to set optimal thresholds for factor\u0000loadings to balance clarity and information retention. FAVis also allows users\u0000to assign tags to variables, enhancing the understanding of factors by linking\u0000them to their associated psychological constructs. Our user study demonstrates\u0000the utility of FAVis in various tasks.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141744941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying Research Hotspots and Future Development Trends in Current Psychology: A Bibliometric Analysis of the Past Decade's Publications 识别当前心理学的研究热点和未来发展趋势:过去十年出版物的文献计量分析
Pub Date : 2024-07-18 DOI: arxiv-2407.13495
Shen Liu, Yan Yang
By conducting a bibliometric analysis on 4,869 publications in CurrentPsychology from 2013 to 2022, this paper examined the annual publications andannual citations, as well as the leading institutions, countries, and keywords.CiteSpace, VOSviewer and SCImago Graphica were utilized for visualizationanalysis. On one hand, this paper analyzed the academic influence of CurrentPsychology over the past decade. On the other hand, it explored the researchhotspots and future development trends within the field of internationalpsychology. The results revealed that the three main research areas covered inthe publications of Current Psychology were: the psychological well-being ofyoung people, the negative emotions of adults, and self-awareness andmanagement. The latest research hotspots highlighted in the journal includenegative emotions, personality, and mental health. The three main developmenttrends of Current Psychology are: 1) exploring the personality psychology ofboth adolescents and adults, 2) promoting the interdisciplinary research tostudy social psychological issues through the use of diversified researchmethods, and 3) emphasizing the emotional psychology of individuals and theirinteraction with social reality, from a people-oriented perspective.
本文通过对2013年至2022年CurrentPsychology上发表的4869篇论文进行文献计量分析,考察了论文的年发表量和年被引频次,以及主要机构、国家和关键词等,并利用CiteSpace、VOSviewer和SCImago Graphica进行了可视化分析。一方面,本文分析了《当代心理学》在过去十年中的学术影响力。另一方面,探讨了国际心理学领域的研究热点和未来发展趋势。结果显示,《当代心理学》刊物涉及的三大研究领域分别是:年轻人的心理健康、成年人的负面情绪以及自我意识与管理。最新的研究热点包括负面情绪、人格和心理健康。当前心理学》的三大发展趋势是1)探索青少年和成人的人格心理;2)促进跨学科研究,通过使用多样化的研究方法来研究社会心理问题;3)从以人为本的角度出发,强调个人的情感心理及其与社会现实的互动。
{"title":"Identifying Research Hotspots and Future Development Trends in Current Psychology: A Bibliometric Analysis of the Past Decade's Publications","authors":"Shen Liu, Yan Yang","doi":"arxiv-2407.13495","DOIUrl":"https://doi.org/arxiv-2407.13495","url":null,"abstract":"By conducting a bibliometric analysis on 4,869 publications in Current\u0000Psychology from 2013 to 2022, this paper examined the annual publications and\u0000annual citations, as well as the leading institutions, countries, and keywords.\u0000CiteSpace, VOSviewer and SCImago Graphica were utilized for visualization\u0000analysis. On one hand, this paper analyzed the academic influence of Current\u0000Psychology over the past decade. On the other hand, it explored the research\u0000hotspots and future development trends within the field of international\u0000psychology. The results revealed that the three main research areas covered in\u0000the publications of Current Psychology were: the psychological well-being of\u0000young people, the negative emotions of adults, and self-awareness and\u0000management. The latest research hotspots highlighted in the journal include\u0000negative emotions, personality, and mental health. The three main development\u0000trends of Current Psychology are: 1) exploring the personality psychology of\u0000both adolescents and adults, 2) promoting the interdisciplinary research to\u0000study social psychological issues through the use of diversified research\u0000methods, and 3) emphasizing the emotional psychology of individuals and their\u0000interaction with social reality, from a people-oriented perspective.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141744934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Future of Data Science Education 数据科学教育的未来
Pub Date : 2024-07-16 DOI: arxiv-2407.11824
Brian Wright, Peter Alonzi, Ali Riveria
The definition of Data Science is a hotly debated topic. For many, thedefinition is a simple shortcut to Artificial Intelligence or Machine Learning.However, there is far more depth and nuance to the field of Data Science than asimple shortcut can provide. The School of Data Science at the University ofVirginia has developed a novel model for the definition of Data Science. Thismodel is based on identifying a unified understanding of the data work doneacross all areas of Data Science. It represents a generational leap forward inhow we understand and teach Data Science. In this paper we will present thecore features of the model and explain how it unifies various concepts goingfar beyond the analytics component of AI. From this foundation we will presentour Undergraduate Major curriculum in Data Science and demonstrate how itprepares students to be well-rounded Data Science team members and leaders. Thepaper will conclude with an in-depth overview of the Foundations of DataScience course designed to introduce students to the field while alsoimplementing proven STEM oriented pedagogical methods. These include, forexample, specifications grading, active learning lectures, guest lectures fromindustry experts and weekly gamification labs.
数据科学的定义是一个备受争议的话题。然而,数据科学领域的深度和细微差别远非简单的捷径所能比拟。弗吉尼亚大学数据科学学院为数据科学的定义开发了一个新颖的模型。该模型基于对数据科学所有领域数据工作的统一理解。它代表了我们在如何理解和教授数据科学方面的一次飞跃。在本文中,我们将介绍该模型的核心特征,并解释它是如何统一各种概念,远远超出人工智能的分析部分。在此基础上,我们将介绍我们的数据科学本科专业课程,并展示它是如何培养学生成为全面的数据科学团队成员和领导者的。本文最后将深入概述数据科学基础课程,该课程旨在向学生介绍该领域,同时还采用了经过验证的、以 STEM 为导向的教学方法。例如,这些方法包括规范评分、主动学习讲座、行业专家客座讲座和每周游戏化实验。
{"title":"The Future of Data Science Education","authors":"Brian Wright, Peter Alonzi, Ali Riveria","doi":"arxiv-2407.11824","DOIUrl":"https://doi.org/arxiv-2407.11824","url":null,"abstract":"The definition of Data Science is a hotly debated topic. For many, the\u0000definition is a simple shortcut to Artificial Intelligence or Machine Learning.\u0000However, there is far more depth and nuance to the field of Data Science than a\u0000simple shortcut can provide. The School of Data Science at the University of\u0000Virginia has developed a novel model for the definition of Data Science. This\u0000model is based on identifying a unified understanding of the data work done\u0000across all areas of Data Science. It represents a generational leap forward in\u0000how we understand and teach Data Science. In this paper we will present the\u0000core features of the model and explain how it unifies various concepts going\u0000far beyond the analytics component of AI. From this foundation we will present\u0000our Undergraduate Major curriculum in Data Science and demonstrate how it\u0000prepares students to be well-rounded Data Science team members and leaders. The\u0000paper will conclude with an in-depth overview of the Foundations of Data\u0000Science course designed to introduce students to the field while also\u0000implementing proven STEM oriented pedagogical methods. These include, for\u0000example, specifications grading, active learning lectures, guest lectures from\u0000industry experts and weekly gamification labs.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141721117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble Transport Filter via Optimized Maximum Mean Discrepancy 通过优化最大均值差异实现集合传输滤波器
Pub Date : 2024-07-16 DOI: arxiv-2407.11518
Dengfei Zeng, Lijian Jiang
In this paper, we present a new ensemble-based filter method byreconstructing the analysis step of the particle filter through a transportmap, which directly transports prior particles to posterior particles. Thetransport map is constructed through an optimization problem described by theMaximum Mean Discrepancy loss function, which matches the expectationinformation of the approximated posterior and reference posterior. The proposedmethod inherits the accurate estimation of the posterior distribution fromparticle filtering. To improve the robustness of Maximum Mean Discrepancy, avariance penalty term is used to guide the optimization. It prioritizesminimizing the discrepancy between the expectations of highly informativestatistics for the approximated and reference posteriors. The penalty termsignificantly enhances the robustness of the proposed method and leads to abetter approximation of the posterior. A few numerical examples are presentedto illustrate the advantage of the proposed method over the ensemble Kalmanfilter.
在本文中,我们提出了一种新的基于集合的滤波方法,它通过一个传输图(transportmap)来重新构建粒子滤波的分析步骤,直接将先验粒子传输到后验粒子。传输图是通过最大均差损失函数(Maximum Mean Discrepancy loss function)描述的优化问题构建的,它匹配了近似后验和参考后验的期望信息。所提出的方法继承了粒子滤波法对后验分布的精确估计。为了提高最大均差法的鲁棒性,使用了方差惩罚项来指导优化。它优先最小化近似后验和参考后验的高信息量统计期望之间的差异。惩罚项显著增强了所提方法的鲁棒性,并使后验的近似度更高。本文列举了几个数值示例来说明所提方法相对于集合卡尔曼滤波器的优势。
{"title":"Ensemble Transport Filter via Optimized Maximum Mean Discrepancy","authors":"Dengfei Zeng, Lijian Jiang","doi":"arxiv-2407.11518","DOIUrl":"https://doi.org/arxiv-2407.11518","url":null,"abstract":"In this paper, we present a new ensemble-based filter method by\u0000reconstructing the analysis step of the particle filter through a transport\u0000map, which directly transports prior particles to posterior particles. The\u0000transport map is constructed through an optimization problem described by the\u0000Maximum Mean Discrepancy loss function, which matches the expectation\u0000information of the approximated posterior and reference posterior. The proposed\u0000method inherits the accurate estimation of the posterior distribution from\u0000particle filtering. To improve the robustness of Maximum Mean Discrepancy, a\u0000variance penalty term is used to guide the optimization. It prioritizes\u0000minimizing the discrepancy between the expectations of highly informative\u0000statistics for the approximated and reference posteriors. The penalty term\u0000significantly enhances the robustness of the proposed method and leads to a\u0000better approximation of the posterior. A few numerical examples are presented\u0000to illustrate the advantage of the proposed method over the ensemble Kalman\u0000filter.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141721121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alternative proof for the bias of the hot hand statistic of streak length one 关于连胜长度为 1 的热门统计偏差的其他证明
Pub Date : 2024-07-15 DOI: arxiv-2407.10577
Maximilian Janisch
For a sequence of $n$ random variables taking values $0$ or $1$, the hot handstatistic of streak length $k$ counts what fraction of the streaks of length$k$, that is, $k$ consecutive variables taking the value $1$, among the $n$variables are followed by another $1$. Since this statistic does not use theexpected value of how many streaks of length $k$ are observed, but instead usesthe realization of the number of streaks present in the data, it may be abiased estimator of the conditional probability of a fixed random variabletaking value $1$ if it is preceded by a streak of length $k$, as was firststudied and observed explicitly in [Miller and Sanjurjo, 2018]. In this shortnote, we suggest an alternative proof for an explicit formula of theexpectation of the hot hand statistic for the case of streak length one. Thisformula was obtained through a different argument in [Miller and Sanjurjo,2018] and [Rinott and Bar-Hillel, 2015].
对于取值为 $0$ 或 $1$的 $n$ 随机变量序列,长度为 $k$ 的条纹长度热手统计量(hot handstatistic of streak length $k$)计算的是在 $n$ 变量中,长度为 $k$ 的条纹(即取值为 $1$的 $k$ 连续变量)中,有多少个是在另一个 $1$ 变量之后出现的。由于该统计量并不使用观察到的长度为$k$的条纹数量的预期值,而是使用数据中存在的条纹数量的实现值,因此它可能是固定随机变量取值$1$的条件概率的无偏估计值,如果它前面有长度为$k$的条纹,这在[Miller and Sanjurjo, 2018]中得到了首次研究和明确观察。在本短文中,我们提出了另一种证明方法,即在条纹长度为 1 的情况下,热手统计量期望值的明确公式。这个公式在 [Miller and Sanjurjo, 2018] 和 [Rinott and Bar-Hillel, 2015] 中通过不同的论证得到。
{"title":"Alternative proof for the bias of the hot hand statistic of streak length one","authors":"Maximilian Janisch","doi":"arxiv-2407.10577","DOIUrl":"https://doi.org/arxiv-2407.10577","url":null,"abstract":"For a sequence of $n$ random variables taking values $0$ or $1$, the hot hand\u0000statistic of streak length $k$ counts what fraction of the streaks of length\u0000$k$, that is, $k$ consecutive variables taking the value $1$, among the $n$\u0000variables are followed by another $1$. Since this statistic does not use the\u0000expected value of how many streaks of length $k$ are observed, but instead uses\u0000the realization of the number of streaks present in the data, it may be a\u0000biased estimator of the conditional probability of a fixed random variable\u0000taking value $1$ if it is preceded by a streak of length $k$, as was first\u0000studied and observed explicitly in [Miller and Sanjurjo, 2018]. In this short\u0000note, we suggest an alternative proof for an explicit formula of the\u0000expectation of the hot hand statistic for the case of streak length one. This\u0000formula was obtained through a different argument in [Miller and Sanjurjo,\u00002018] and [Rinott and Bar-Hillel, 2015].","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141721118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward a Complete Criterion for Value of Information in Insoluble Decision Problems 为不可解决策问题中的信息价值制定完整标准
Pub Date : 2024-07-13 DOI: arxiv-2407.09883
Ryan Carey, Sanghack Lee, Robin J. Evans
In a decision problem, observations are said to be material if they must betaken into account to perform optimally. Decision problems have an underlying(graphical) causal structure, which may sometimes be used to evaluate certainobservations as immaterial. For soluble graphs - ones where important pastobservations are remembered - there is a complete graphical criterion; one thatrules out materiality whenever this can be done on the basis of the graphicalstructure alone. In this work, we analyse a proposed criterion for insolublegraphs. In particular, we prove that some of the conditions used to proveimmateriality are necessary; when they are not satisfied, materiality ispossible. We discuss possible avenues and obstacles to proving necessity of theremaining conditions.
在决策问题中,如果必须将观测结果考虑在内才能达到最佳效果,那么这些观测结果就是重要的。决策问题有一个基本的(图形)因果结构,有时可用于评估某些观测值是否重要。对于可溶性图形--重要的过去观察结果被记住的图形--来说,有一个完整的图形标准;只要仅根据图形结构就能排除重要性,那么这个标准就是可溶性图形标准。在这项工作中,我们分析了所提出的关于非实质性电报的标准。特别是,我们证明了用于证明非实质性的一些条件是必要的;当这些条件不满足时,实质性是可能的。我们讨论了证明剩余条件必要性的可能途径和障碍。
{"title":"Toward a Complete Criterion for Value of Information in Insoluble Decision Problems","authors":"Ryan Carey, Sanghack Lee, Robin J. Evans","doi":"arxiv-2407.09883","DOIUrl":"https://doi.org/arxiv-2407.09883","url":null,"abstract":"In a decision problem, observations are said to be material if they must be\u0000taken into account to perform optimally. Decision problems have an underlying\u0000(graphical) causal structure, which may sometimes be used to evaluate certain\u0000observations as immaterial. For soluble graphs - ones where important past\u0000observations are remembered - there is a complete graphical criterion; one that\u0000rules out materiality whenever this can be done on the basis of the graphical\u0000structure alone. In this work, we analyse a proposed criterion for insoluble\u0000graphs. In particular, we prove that some of the conditions used to prove\u0000immateriality are necessary; when they are not satisfied, materiality is\u0000possible. We discuss possible avenues and obstacles to proving necessity of the\u0000remaining conditions.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141721173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
More than Formulas -- Integrity, Communication, Computing and Reproducibility in Statistics Education 不仅仅是公式 -- 统计教育中的诚信、交流、计算和可复制性
Pub Date : 2024-07-11 DOI: arxiv-2407.08835
Eva Furrer, Annina Cincera, Reinhard Furrer
This paper introduces a novel course design in the Master Program inBiostatistics at the University of Zurich that integrates computing skills,effective communication, reproducibility, and scientific integrity within onecourse. Utilizing a flipped classroom model, the course aims to equip studentswith the necessary competencies to handle real-world data analysis challengesand effective statistical practice in general. The curriculum includespractical tools such as version control with Git, dynamic reporting, unittesting and containerization to foster reproducibility, and integrity instatistical practice. Feedback gathered from both staff and studentspost-implementation indicates that the course significantly enhances studentreadiness for professional and academic environments, demonstrating theeffectiveness of this educational approach.
本文介绍了苏黎世大学生物统计学硕士课程中的一种新颖课程设计,它将计算技能、有效沟通、可重复性和科学诚信整合在一门课程中。该课程采用翻转课堂模式,旨在使学生具备必要的能力,以应对现实世界中的数据分析挑战和有效的统计实践。课程包括实用工具,如使用 Git 进行版本控制、动态报告、统一测试和容器化,以促进统计实践的可重复性和完整性。实施后从教职员工和学生那里收集到的反馈表明,该课程大大提高了学生在专业和学术环境中的准备程度,证明了这种教育方法的有效性。
{"title":"More than Formulas -- Integrity, Communication, Computing and Reproducibility in Statistics Education","authors":"Eva Furrer, Annina Cincera, Reinhard Furrer","doi":"arxiv-2407.08835","DOIUrl":"https://doi.org/arxiv-2407.08835","url":null,"abstract":"This paper introduces a novel course design in the Master Program in\u0000Biostatistics at the University of Zurich that integrates computing skills,\u0000effective communication, reproducibility, and scientific integrity within one\u0000course. Utilizing a flipped classroom model, the course aims to equip students\u0000with the necessary competencies to handle real-world data analysis challenges\u0000and effective statistical practice in general. The curriculum includes\u0000practical tools such as version control with Git, dynamic reporting, unit\u0000testing and containerization to foster reproducibility, and integrity in\u0000statistical practice. Feedback gathered from both staff and students\u0000post-implementation indicates that the course significantly enhances student\u0000readiness for professional and academic environments, demonstrating the\u0000effectiveness of this educational approach.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141721119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Need for a Recurring Large-Scale Benchmarking Survey to Continually Evaluate Sampling Methods and Administration Modes: Lessons from the 2022 Collaborative Midterm Survey 需要定期开展大规模基准调查,以持续评估抽样方法和管理模式:2022 年合作中期调查的经验教训
Pub Date : 2024-07-08 DOI: arxiv-2407.06090
Peter K. Enns, Colleen L. Barry, James N. Druckman, Sergio Garcia-Rios, David C. Wilson, Jonathon P. Schuldt
As survey methods adapt to technological and societal changes, a growing bodyof research seeks to understand the tradeoffs associated with various samplingmethods and administration modes. We show how the NSF-funded 2022 CollaborativeMidterm Survey (CMS) can be used as a dynamic and transparent framework forevaluating which sampling approaches - or combination of approaches - are bestsuited for various research goals. The CMS is ideally suited for this purposebecause it includes almost 20,000 respondents interviewed using twoadministration modes (phone and online) and data drawn from random digitdialing, random address-based sampling, a probability-based panel, twononprobability panels, and two nonprobability marketplaces. The analysisconsiders three types of population benchmarks (election data, administrativerecords, and large government surveys) and focuses on the national-levelestimates as well as oversamples in three states (California, Florida, andWisconsin). In addition to documenting how each of the survey strategiesperformed, we develop a strategy to assess how different combinations ofapproaches compare to different population benchmarks in order to guideresearchers combining sampling methods and sources. We conclude by providingspecific recommendations to public opinion and election survey researchers anddemonstrating how our approach could be applied to a large government surveyconducted at regular intervals to provide ongoing guidance to researchers,government, businesses, and nonprofits regarding the most appropriate surveysampling and administration methods.
随着调查方法适应技术和社会的变化,越来越多的研究试图了解与各种抽样方法和管理模式相关的权衡。我们展示了如何将国家科学基金会资助的 2022 年中期合作调查 (CMS) 作为一个动态、透明的框架,评估哪种抽样方法或方法组合最适合各种研究目标。CMS 非常适合这一目的,因为它包括使用两种管理模式(电话和在线)访问的近 20,000 名受访者,以及从随机数字拨号、基于地址的随机抽样、基于概率的面板、两个概率面板和两个非概率市场抽取的数据。分析考虑了三种类型的人口基准(选举数据、行政记录和大型政府调查),重点关注全国范围内的估计值以及三个州(加利福尼亚州、佛罗里达州和威斯康星州)的超样本。除了记录每种调查策略的表现外,我们还制定了一种策略来评估不同方法组合与不同人口基准的比较情况,以便为研究人员结合抽样方法和来源提供指导。最后,我们向民意调查和选举调查研究人员提出了具体建议,并展示了如何将我们的方法应用于定期进行的大型政府调查,从而为研究人员、政府、企业和非营利组织提供有关最合适的调查抽样和管理方法的持续指导。
{"title":"The Need for a Recurring Large-Scale Benchmarking Survey to Continually Evaluate Sampling Methods and Administration Modes: Lessons from the 2022 Collaborative Midterm Survey","authors":"Peter K. Enns, Colleen L. Barry, James N. Druckman, Sergio Garcia-Rios, David C. Wilson, Jonathon P. Schuldt","doi":"arxiv-2407.06090","DOIUrl":"https://doi.org/arxiv-2407.06090","url":null,"abstract":"As survey methods adapt to technological and societal changes, a growing body\u0000of research seeks to understand the tradeoffs associated with various sampling\u0000methods and administration modes. We show how the NSF-funded 2022 Collaborative\u0000Midterm Survey (CMS) can be used as a dynamic and transparent framework for\u0000evaluating which sampling approaches - or combination of approaches - are best\u0000suited for various research goals. The CMS is ideally suited for this purpose\u0000because it includes almost 20,000 respondents interviewed using two\u0000administration modes (phone and online) and data drawn from random digit\u0000dialing, random address-based sampling, a probability-based panel, two\u0000nonprobability panels, and two nonprobability marketplaces. The analysis\u0000considers three types of population benchmarks (election data, administrative\u0000records, and large government surveys) and focuses on the national-level\u0000estimates as well as oversamples in three states (California, Florida, and\u0000Wisconsin). In addition to documenting how each of the survey strategies\u0000performed, we develop a strategy to assess how different combinations of\u0000approaches compare to different population benchmarks in order to guide\u0000researchers combining sampling methods and sources. We conclude by providing\u0000specific recommendations to public opinion and election survey researchers and\u0000demonstrating how our approach could be applied to a large government survey\u0000conducted at regular intervals to provide ongoing guidance to researchers,\u0000government, businesses, and nonprofits regarding the most appropriate survey\u0000sampling and administration methods.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141577853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reducing Total Trip Time and Vehicle Emission through Park-and-Ride -- methods and case-study 通过停车换乘减少总行程时间和车辆排放--方法和案例研究
Pub Date : 2024-07-08 DOI: arxiv-2407.05572
Ayane Nakamura, Fabiana Ferracina, Naoki Sakata, Takahiro Noguchi, Hiroyasu Ando
Serious traffic congestion and emission by excessive usage of private carsare crucial issues in our modern society. As one solution for these, a conceptof Park-and-Ride (PnR) where people stop their private cars (i.e.single-occupancy vehicles) at stations and ride on public vehicles (i.e. masstransportation) are receiving wide attention recently. In this paper, wepropose a comprehensive mathematical model which can evaluate waiting times andtraveling times of customers, and the total emission of vehicles for varioususage ratio of PnR and operation policies of public transportation. Using asystem of queues integrated with an emissions model we perform a case-study ofTsukuba city, in Japan. We indicate an intriguing trade-off between the waitingtime of customers for the PnR and the long traveling time due to the trafficcongestion (leading to high emissions) caused by private cars depending on theusage ratio through some numerical experiments. Moreover, we study the totalcost to society caused by total trip times and pollution, in which the decisionvariables are the capacities and frequencies of the public transportation forthe PnR system. Our numerical results showed a significant reduction in thetotal social cost under the optimal transit policy for the current high usagerate of single-occupancy vehicles. Furthermore, we show that further reductionin the total social cost can be revealed by considering the reduction on theuse of private cars compared to the current state implying the socialimportance of promoting car-free movement.
过度使用私家车造成的严重交通拥堵和废气排放是现代社会的关键问题。作为解决这些问题的方法之一,人们把私家车(即单人车)停在车站,乘坐公共车辆(即公共交通)的停车换乘(PnR)概念近来受到广泛关注。本文提出了一个综合数学模型,该模型可以评估不同 PnR 使用比例和公共交通运营政策下乘客的等待时间和旅行时间,以及车辆的总排放量。我们利用队列系统和排放模型对日本筑波市进行了案例研究。通过一些数值实验,我们发现在乘客等待 PnR 的时间和因私家车造成的交通拥堵(导致高排放)而导致的长时间行驶之间,存在着一种有趣的权衡。此外,我们还研究了总行程时间和污染造成的社会总成本,其中的决策变量是 PnR 系统的公共交通容量和频率。我们的数值结果表明,在目前单人汽车使用率较高的情况下,最优公交政策能显著降低社会总成本。此外,我们还发现,与目前的状况相比,如果考虑减少私家车的使用,社会总成本还能进一步降低,这意味着促进无车出行具有重要的社会意义。
{"title":"Reducing Total Trip Time and Vehicle Emission through Park-and-Ride -- methods and case-study","authors":"Ayane Nakamura, Fabiana Ferracina, Naoki Sakata, Takahiro Noguchi, Hiroyasu Ando","doi":"arxiv-2407.05572","DOIUrl":"https://doi.org/arxiv-2407.05572","url":null,"abstract":"Serious traffic congestion and emission by excessive usage of private cars\u0000are crucial issues in our modern society. As one solution for these, a concept\u0000of Park-and-Ride (PnR) where people stop their private cars (i.e.\u0000single-occupancy vehicles) at stations and ride on public vehicles (i.e. mass\u0000transportation) are receiving wide attention recently. In this paper, we\u0000propose a comprehensive mathematical model which can evaluate waiting times and\u0000traveling times of customers, and the total emission of vehicles for various\u0000usage ratio of PnR and operation policies of public transportation. Using a\u0000system of queues integrated with an emissions model we perform a case-study of\u0000Tsukuba city, in Japan. We indicate an intriguing trade-off between the waiting\u0000time of customers for the PnR and the long traveling time due to the traffic\u0000congestion (leading to high emissions) caused by private cars depending on the\u0000usage ratio through some numerical experiments. Moreover, we study the total\u0000cost to society caused by total trip times and pollution, in which the decision\u0000variables are the capacities and frequencies of the public transportation for\u0000the PnR system. Our numerical results showed a significant reduction in the\u0000total social cost under the optimal transit policy for the current high usage\u0000rate of single-occupancy vehicles. Furthermore, we show that further reduction\u0000in the total social cost can be revealed by considering the reduction on the\u0000use of private cars compared to the current state implying the social\u0000importance of promoting car-free movement.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141575205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy Social Network Analysis: Theory and Application in a University Department's Collaboration Network 模糊社会网络分析:大学院系合作网络的理论与应用
Pub Date : 2024-07-02 DOI: arxiv-2407.02401
Annamaria Porreca, Fabrizio Maturo, Viviana Ventre
Social network analysis (SNA) helps us understand the relationships andinteractions between individuals, groups, organisations, or other socialentities. In SNA, ties are generally binary or weighted based on theirstrength. Nonetheless, when actors are individuals, the relationships betweenactors are often imprecise and identifying them with simple scalars leads toinformation loss. Social relationships are often vague in real life. Despitemany classical social network techniques contemplate the use of weighted links,these approaches do not align with the original philosophy of fuzzy logic,which instead aims to preserve the vagueness inherent in human language andreal life. Dealing with imprecise ties and introducing fuzziness in thedefinition of relationships requires an extension of social network analysis tofuzzy numbers instead of crisp values. The mathematical formalisation for thisgeneralisation needs to extend classical centrality indices and operations tofuzzy numbers. For this reason, this paper proposes a generalisation of theso-called Fuzzy Social Network Analysis (FSNA) to the context of impreciserelationships among actors. The article shows the theory and application ofreal data collected through a fascinating mouse tracking technique to study thefuzzy relationships in a collaboration network among the members of aUniversity department.
社会网络分析(SNA)有助于我们了解个人、团体、组织或其他社会实体之间的关系和互动。在 SNA 中,联系通常是二元的,或根据其强度加权。然而,当行动者是个人时,行动者之间的关系往往是不精确的,用简单的标量来识别会导致信息丢失。在现实生活中,社会关系往往是模糊的。尽管许多经典的社会网络技术都考虑使用加权链接,但这些方法并不符合模糊逻辑的最初理念,而模糊逻辑的目标是保留人类语言和现实生活中固有的模糊性。要处理不精确的联系并在关系定义中引入模糊性,就需要将社会网络分析扩展到模糊数而不是清晰值。这种扩展的数学形式化需要将经典的中心度指数和运算扩展到模糊数。为此,本文提出了将所谓的模糊社会网络分析(FSNA)推广到行动者之间不精确关系的环境中。文章展示了通过引人入胜的鼠标跟踪技术收集到的真实数据的理论和应用,以研究大学某系成员之间合作网络中的模糊关系。
{"title":"Fuzzy Social Network Analysis: Theory and Application in a University Department's Collaboration Network","authors":"Annamaria Porreca, Fabrizio Maturo, Viviana Ventre","doi":"arxiv-2407.02401","DOIUrl":"https://doi.org/arxiv-2407.02401","url":null,"abstract":"Social network analysis (SNA) helps us understand the relationships and\u0000interactions between individuals, groups, organisations, or other social\u0000entities. In SNA, ties are generally binary or weighted based on their\u0000strength. Nonetheless, when actors are individuals, the relationships between\u0000actors are often imprecise and identifying them with simple scalars leads to\u0000information loss. Social relationships are often vague in real life. Despite\u0000many classical social network techniques contemplate the use of weighted links,\u0000these approaches do not align with the original philosophy of fuzzy logic,\u0000which instead aims to preserve the vagueness inherent in human language and\u0000real life. Dealing with imprecise ties and introducing fuzziness in the\u0000definition of relationships requires an extension of social network analysis to\u0000fuzzy numbers instead of crisp values. The mathematical formalisation for this\u0000generalisation needs to extend classical centrality indices and operations to\u0000fuzzy numbers. For this reason, this paper proposes a generalisation of the\u0000so-called Fuzzy Social Network Analysis (FSNA) to the context of imprecise\u0000relationships among actors. The article shows the theory and application of\u0000real data collected through a fascinating mouse tracking technique to study the\u0000fuzzy relationships in a collaboration network among the members of a\u0000University department.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141513511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - STAT - Other Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1