The following conversation is partly based on an interview that took place in the Hong Kong University of Science and Technology in July 2013.
以下对话部分基于2013年7月在香港科技大学进行的一次采访。
{"title":"A Conversation with Howell Tong","authors":"Kung-Sik Chan, Q. Yao","doi":"10.1214/13-STS464","DOIUrl":"https://doi.org/10.1214/13-STS464","url":null,"abstract":"The following conversation is partly based on an interview that took place in the Hong Kong University of Science and Technology in July 2013.","PeriodicalId":413623,"journal":{"name":"arXiv: Other Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130354776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper, we deal with the problem of marginalization over and conditioning on two disjoint subsets of the node set of chain graphs (CGs) with the LWF Markov property. For this purpose, we define the class of chain mixed graphs (CMGs) with three types of edges and, for this class, provide a separation criterion under which the class of CMGs is stable under marginalization and conditioning and contains the class of LWF CGs as its subclass. We provide a method for generating such graphs after marginalization and conditioning for a given CMG or a given LWF CG. We then define and study the class of anterial graphs, which is also stable under marginalization and conditioning and contains LWF CGs, but has a simpler structure than CMGs.
{"title":"Marginalization and Conditioning for LWF Chain Graphs","authors":"Kayvan Sadeghi","doi":"10.1214/16-AOS1451","DOIUrl":"https://doi.org/10.1214/16-AOS1451","url":null,"abstract":"In this paper, we deal with the problem of marginalization over and conditioning on two disjoint subsets of the node set of chain graphs (CGs) with the LWF Markov property. For this purpose, we define the class of chain mixed graphs (CMGs) with three types of edges and, for this class, provide a separation criterion under which the class of CMGs is stable under marginalization and conditioning and contains the class of LWF CGs as its subclass. We provide a method for generating such graphs after marginalization and conditioning for a given CMG or a given LWF CG. We then define and study the class of anterial graphs, which is also stable under marginalization and conditioning and contains LWF CGs, but has a simpler structure than CMGs.","PeriodicalId":413623,"journal":{"name":"arXiv: Other Statistics","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115065650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Benjamin S. Baumer, Mine Çetinkaya-Rundel, Andrew Bray, Linda Loi, N. Horton
Nolan and Temple Lang argue that "the ability to express statistical computations is an essential skill." A key related capacity is the ability to conduct and present data analysis in a way that another person can understand and replicate. The copy-and-paste workflow that is an artifact of antiquated user-interface design makes reproducibility of statistical analysis more difficult, especially as data become increasingly complex and statistical methods become increasingly sophisticated. R Markdown is a new technology that makes creating fully-reproducible statistical analysis simple and painless. It provides a solution suitable not only for cutting edge research, but also for use in an introductory statistics course. We present evidence that R Markdown can be used effectively in introductory statistics courses, and discuss its role in the rapidly-changing world of statistical computation.
{"title":"R Markdown: Integrating A Reproducible Analysis Tool into Introductory Statistics","authors":"Benjamin S. Baumer, Mine Çetinkaya-Rundel, Andrew Bray, Linda Loi, N. Horton","doi":"10.5070/T581020118","DOIUrl":"https://doi.org/10.5070/T581020118","url":null,"abstract":"Nolan and Temple Lang argue that \"the ability to express statistical computations is an essential skill.\" A key related capacity is the ability to conduct and present data analysis in a way that another person can understand and replicate. The copy-and-paste workflow that is an artifact of antiquated user-interface design makes reproducibility of statistical analysis more difficult, especially as data become increasingly complex and statistical methods become increasingly sophisticated. R Markdown is a new technology that makes creating fully-reproducible statistical analysis simple and painless. It provides a solution suitable not only for cutting edge research, but also for use in an introductory statistics course. We present evidence that R Markdown can be used effectively in introductory statistics courses, and discuss its role in the rapidly-changing world of statistical computation.","PeriodicalId":413623,"journal":{"name":"arXiv: Other Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124006384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2012-01-31DOI: 10.1007/978-1-4614-3649-2_23
T. Speed, J. Pitman, J. Rice
{"title":"A Brief History of the Statistics Department of the University of California at Berkeley","authors":"T. Speed, J. Pitman, J. Rice","doi":"10.1007/978-1-4614-3649-2_23","DOIUrl":"https://doi.org/10.1007/978-1-4614-3649-2_23","url":null,"abstract":"","PeriodicalId":413623,"journal":{"name":"arXiv: Other Statistics","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125988223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In a wide class of paired comparisons, especially in the sports games, in which all subjects are divided into several groups, the intragroup comparisons are dense and the intergroup comparisons are sparse. Typical examples include the NFL regular season. Motivated by these situations, we propose group sparsity for paired comparisons and show the consistency and asymptotical normality of the maximum likelihood estimate in the Bradley-Terry model when the number of parameters goes to infinity in this paper. Simulations are carried out to illustrate the group sparsity and asymptotical results.
{"title":"GROUPED SPARSE PAIRED COMPARISONS IN THE BRADLEY-TERRY MODEL","authors":"T. Yan, Jinfeng Xu, Yaning Yang","doi":"10.5705/SS.2010.299","DOIUrl":"https://doi.org/10.5705/SS.2010.299","url":null,"abstract":"In a wide class of paired comparisons, especially in the sports games, in which all subjects are divided into several groups, the intragroup comparisons are dense and the intergroup comparisons are sparse. Typical examples include the NFL regular season. Motivated by these situations, we propose group sparsity for paired comparisons and show the consistency and asymptotical normality of the maximum likelihood estimate in the Bradley-Terry model when the number of parameters goes to infinity in this paper. Simulations are carried out to illustrate the group sparsity and asymptotical results.","PeriodicalId":413623,"journal":{"name":"arXiv: Other Statistics","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125569777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We present scientometric results about world-wide centers of excellence in psychology. Based on Web of Science data, domain-specific excellence can be identified for cities where highly cited papers are published. Data refer to all psychology articles published in 2007 which are documented in the Social Science Citation Index and to their citation frequencies from 2007 to May 2011. Visualized are 214 cities with an article output of at least 50 in 2007. Statistical z tests are used for the evaluation of the degree to which an observed number of top-cited papers (top-10%) for a city differs from the number expected on the basis of randomness in the selection of papers. Map visualizing city ratios on significant differences between observed and expected numbers of highly-cited papers point at excellence centers in cities at the East and West Coast of the United States as well as in Great Britain, Germany, the Netherlands, Ireland, Belgium, Sweden, Finland, Australia, and Taiwan. Furthermore, positive but non-significant differences in favor of high citation rates are documented for some cities in the United States, Great Britain, the Netherlands, the Scandinavian and the German-speaking countries, Belgium, France, Spain, Israel, South Korea, and China. Scientometric results show convincingly that highly-cited psychological research articles come from the Anglo-American countries and some of the non-English European countries in which the number of English-language publications has increased during the last decades.
我们提出了关于世界范围内心理学卓越中心的科学计量结果。基于Web of Science数据,可以确定发表高被引论文的城市的特定领域卓越性。数据指的是社会科学引文索引中收录的2007年发表的所有心理学论文,以及2007年至2011年5月的被引频次。可视化的是214个城市,2007年的文章产出至少为50篇。统计检验用于评估一个城市观察到的最高被引论文数量(前10%)与基于论文选择随机性的预期数量的差异程度。观察到的高被引论文数量和预期高被引论文数量之间显著差异的城市比例可视化地图指向美国东海岸和西海岸以及英国、德国、荷兰、爱尔兰、比利时、瑞典、芬兰、澳大利亚和台湾城市的卓越中心。此外,在美国、英国、荷兰、斯堪的纳维亚和德语国家、比利时、法国、西班牙、以色列、韩国和中国的一些城市中,存在有利于高被引率的积极但不显著的差异。科学计量学的结果令人信服地表明,高引用率的心理学研究文章来自英美国家和一些非英语的欧洲国家,在过去的几十年里,这些国家的英语出版物数量有所增加。
{"title":"Which are the best cities for psychology research worldwide? A map visualizing city ratios of observed and expected numbers of highly-cited papers","authors":"L. Bornmann, L. Leydesdorff, Gunter Krampen","doi":"10.5964/EJOP.V8I4.482","DOIUrl":"https://doi.org/10.5964/EJOP.V8I4.482","url":null,"abstract":"We present scientometric results about world-wide centers of excellence in psychology. Based on Web of Science data, domain-specific excellence can be identified for cities where highly cited papers are published. Data refer to all psychology articles published in 2007 which are documented in the Social Science Citation Index and to their citation frequencies from 2007 to May 2011. Visualized are 214 cities with an article output of at least 50 in 2007. Statistical z tests are used for the evaluation of the degree to which an observed number of top-cited papers (top-10%) for a city differs from the number expected on the basis of randomness in the selection of papers. Map visualizing city ratios on significant differences between observed and expected numbers of highly-cited papers point at excellence centers in cities at the East and West Coast of the United States as well as in Great Britain, Germany, the Netherlands, Ireland, Belgium, Sweden, Finland, Australia, and Taiwan. Furthermore, positive but non-significant differences in favor of high citation rates are documented for some cities in the United States, Great Britain, the Netherlands, the Scandinavian and the German-speaking countries, Belgium, France, Spain, Israel, South Korea, and China. Scientometric results show convincingly that highly-cited psychological research articles come from the Anglo-American countries and some of the non-English European countries in which the number of English-language publications has increased during the last decades.","PeriodicalId":413623,"journal":{"name":"arXiv: Other Statistics","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134555476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Martin Bradbury Wilk was born on December 18, 1922, in Montr'{e}al, Qu'{e}bec, Canada. He completed a B.Eng. degree in Chemical Engineering in 1945 at McGill University and worked as a Research Engineer on the Atomic Energy Project for the National Research Council of Canada from 1945 to 1950. He then went to Iowa State College, where he completed a M.Sc. and a Ph.D. degree in Statistics in 1953 and 1955, respectively. After a one-year post-doc with John Tukey, he became Assistant Director of the Statistical Techniques Research Group at Princeton University in 1956--1957, and then served as Professor and Director of Research in Statistics at Rutgers University from 1959 to 1963. In parallel, he also had a 14-year career at Bell Laboratories, Murray Hill, New Jersey. From 1956 to 1969, he was in turn Member of Technical Staff, Head of the Statistical Models and Methods Research Department, and Statistical Director in Management Sciences Research. He wrote a number of influential papers in statistical methodology during that period, notably testing procedures for normality (the Shapiro--Wilk statistic) and probability plotting techniques for multivariate data. In 1970, Martin moved into higher management levels of the American Telephone and Telegraph (AT&T) Company. He occupied various positions culminating as Assistant Vice-President and Director of Corporate Planning. In 1980, he returned to Canada and became the first professional statistician to serve as Chief Statistician. His accomplishments at Statistics Canada were numerous and contributed to a resurgence of the institution's international standing. He played a crucial role in the reinstatement of the Cabinet-cancelled 1986 Census.
马丁·布拉德伯里·威尔克于1922年12月18日出生在加拿大魁北克省的蒙特利尔。1945年,他在麦吉尔大学(McGill University)获得化学工程学士学位。1945年至1950年,他在加拿大国家研究委员会(National Research Council of Canada)的原子能项目中担任研究工程师。然后他去了爱荷华州立大学,在那里他分别在1953年和1955年获得了统计学硕士学位和博士学位。1956年至1957年,他成为普林斯顿大学统计技术研究小组助理主任,1959年至1963年,他担任罗格斯大学统计学教授兼研究主任。与此同时,他还在新泽西州默里山的贝尔实验室工作了14年。1956年至1969年,他先后担任技术人员、统计模型和方法研究部主任、管理科学研究统计主任。在此期间,他在统计方法学方面写了许多有影响力的论文,特别是正态性的测试程序(夏皮罗-威尔克统计)和多变量数据的概率绘图技术。1970年,马丁进入美国电话电报公司(AT&T)的高级管理层。他曾担任多个职位,最终担任副总裁助理和企业规划总监。1980年,他回到加拿大,成为第一位担任首席统计师的专业统计师。他在加拿大统计局取得了许多成就,并为该机构的国际地位的复苏做出了贡献。他在恢复被内阁取消的1986年人口普查中发挥了关键作用。
{"title":"A Conversation with Martin Bradbury Wilk","authors":"C. Genest, G. Brackstone","doi":"10.1214/08-STS272","DOIUrl":"https://doi.org/10.1214/08-STS272","url":null,"abstract":"Martin Bradbury Wilk was born on December 18, 1922, in Montr'{e}al, Qu'{e}bec, Canada. He completed a B.Eng. degree in Chemical Engineering in 1945 at McGill University and worked as a Research Engineer on the Atomic Energy Project for the National Research Council of Canada from 1945 to 1950. He then went to Iowa State College, where he completed a M.Sc. and a Ph.D. degree in Statistics in 1953 and 1955, respectively. After a one-year post-doc with John Tukey, he became Assistant Director of the Statistical Techniques Research Group at Princeton University in 1956--1957, and then served as Professor and Director of Research in Statistics at Rutgers University from 1959 to 1963. In parallel, he also had a 14-year career at Bell Laboratories, Murray Hill, New Jersey. From 1956 to 1969, he was in turn Member of Technical Staff, Head of the Statistical Models and Methods Research Department, and Statistical Director in Management Sciences Research. He wrote a number of influential papers in statistical methodology during that period, notably testing procedures for normality (the Shapiro--Wilk statistic) and probability plotting techniques for multivariate data. In 1970, Martin moved into higher management levels of the American Telephone and Telegraph (AT&T) Company. He occupied various positions culminating as Assistant Vice-President and Director of Corporate Planning. In 1980, he returned to Canada and became the first professional statistician to serve as Chief Statistician. His accomplishments at Statistics Canada were numerous and contributed to a resurgence of the institution's international standing. He played a crucial role in the reinstatement of the Cabinet-cancelled 1986 Census.","PeriodicalId":413623,"journal":{"name":"arXiv: Other Statistics","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132811528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1900-01-01DOI: 10.1007/978-3-030-43408-3_9
Chad Giusti, Darrick Lee
{"title":"Iterated Integrals and Population Time Series Analysis","authors":"Chad Giusti, Darrick Lee","doi":"10.1007/978-3-030-43408-3_9","DOIUrl":"https://doi.org/10.1007/978-3-030-43408-3_9","url":null,"abstract":"","PeriodicalId":413623,"journal":{"name":"arXiv: Other Statistics","volume":"108 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124295942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}