首页 > 最新文献

AStA Wirtschafts- und Sozialstatistisches Archiv最新文献

英文 中文
Data Observer—a guide to data that can help to inform evidence-based policymaking 数据观察员--有助于循证决策的数据指南
Pub Date : 2024-06-24 DOI: 10.1007/s11943-024-00341-5
Joachim Wagner

For many attempts to inform evidence-based policymaking (or policy-makers in general) researchers have to rely on already available (instead of newly collected) data. These data have to be reliable, accessible (at best, without high hurdles, and with low or no fees to be paid) and findable. One way that helps to find suitable data that are easily accessible (and hopefully reliable) is to look at the contributions published in the Data Observer series described in this paper.

在为循证决策(或一般决策者)提供信息的许多尝试中,研究人员必须依靠已有的(而不是新收集的)数据。这些数据必须可靠、可获取(最多是没有高门槛、低费用或无费用)、可查找。找到易于获取(希望可靠)的合适数据的一个方法是查看本文所述的《数据观察家》系列所发表的文章。
{"title":"Data Observer—a guide to data that can help to inform evidence-based policymaking","authors":"Joachim Wagner","doi":"10.1007/s11943-024-00341-5","DOIUrl":"10.1007/s11943-024-00341-5","url":null,"abstract":"<div><p>For many attempts to inform evidence-based policymaking (or policy-makers in general) researchers have to rely on already available (instead of newly collected) data. These data have to be reliable, accessible (at best, without high hurdles, and with low or no fees to be paid) and findable. One way that helps to find suitable data that are easily accessible (and hopefully reliable) is to look at the contributions published in the <i>Data Observer</i> series described in this paper.</p></div>","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"18 2","pages":"279 - 287"},"PeriodicalIF":0.0,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142451138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flat rent price prediction in Berlin with web scraping 利用网络搜索预测柏林公寓租金价格
Pub Date : 2024-06-24 DOI: 10.1007/s11943-024-00340-6
Camilo Meyberg, Ulrich Rendtel, Holger Leerhoff

Internet data pose a challenge to the traditional system of official statistics, which relies on more conventional sources such as surveys and registers, not readily adaptable to rapid changes. Expanding this system to include internet data is currently at an experimental stage, exploring these sources’ potentials and benefits. This paper describes a project conducted within the ESSnet Trusted Smart Statistics – Web Intelligence Network framework. It investigates the use of online apartment listings to analyze the rental market. We used web scraping to extract information from two online real estate portals for flats in the city of Berlin. Using this data, we developed a model to predict rental prices per square meter based on the accommodation’s features and location within the city. We detected offers which appear in both portals by means of statistical matching and removed duplicate offers. Missing values were treated by multiple imputation. The prediction model is a semi-parametric approach where the postal districts are used to describe the location effect. Comparisons with microcensus results and the local rent index reveal significant differences between the market of online flat offers and the stock of existing flat contracts. Interested readers will find the commented programming code in the internet supplement.

互联网数据对传统的官方统计系统提出了挑战,因为传统的官方统计系统依赖于调查和登记等较传统的来源,不易适应快速的变化。将这一系统扩展到互联网数据目前正处于试验阶段,探索这些来源的潜力和益处。本文介绍了在 ESSnet 可信智能统计--网络智能网络框架内开展的一个项目。该项目研究了如何利用在线公寓列表来分析租赁市场。我们使用网络搜刮技术从柏林市的两个在线房地产门户网站中提取公寓信息。利用这些数据,我们建立了一个模型,根据住房的特点和在城市中的位置来预测每平方米的租金价格。我们通过统计匹配方法检测了两个门户网站中出现的报价,并删除了重复报价。缺失值通过多重估算进行处理。预测模型是一种半参数方法,使用邮区来描述位置效应。通过与微观人口普查结果和当地租金指数进行比较,发现在线公寓报价市场与现有公寓合同存量之间存在显著差异。感兴趣的读者可在互联网增刊中找到注释编程代码。
{"title":"Flat rent price prediction in Berlin with web scraping","authors":"Camilo Meyberg,&nbsp;Ulrich Rendtel,&nbsp;Holger Leerhoff","doi":"10.1007/s11943-024-00340-6","DOIUrl":"10.1007/s11943-024-00340-6","url":null,"abstract":"<div><p>Internet data pose a challenge to the traditional system of official statistics, which relies on more conventional sources such as surveys and registers, not readily adaptable to rapid changes. Expanding this system to include internet data is currently at an experimental stage, exploring these sources’ potentials and benefits. This paper describes a project conducted within the ESSnet <i>Trusted Smart Statistics – Web Intelligence Network</i> framework. It investigates the use of online apartment listings to analyze the rental market. We used web scraping to extract information from two online real estate portals for flats in the city of Berlin. Using this data, we developed a model to predict rental prices per square meter based on the accommodation’s features and location within the city. We detected offers which appear in both portals by means of statistical matching and removed duplicate offers. Missing values were treated by multiple imputation. The prediction model is a semi-parametric approach where the postal districts are used to describe the location effect. Comparisons with microcensus results and the local rent index reveal significant differences between the market of online flat offers and the stock of existing flat contracts. Interested readers will find the commented programming code in the internet supplement.</p></div>","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"18 2","pages":"245 - 278"},"PeriodicalIF":0.0,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11943-024-00340-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142451139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vorwort der Herausgeber 编辑前言
Pub Date : 2024-04-17 DOI: 10.1007/s11943-024-00339-z
Markus Zwick, Jan Pablo Burgard
{"title":"Vorwort der Herausgeber","authors":"Markus Zwick,&nbsp;Jan Pablo Burgard","doi":"10.1007/s11943-024-00339-z","DOIUrl":"10.1007/s11943-024-00339-z","url":null,"abstract":"","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"18 1","pages":"1 - 4"},"PeriodicalIF":0.0,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11943-024-00339-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142412142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interview mit Ralf Münnich 采访拉尔夫-明尼希
Pub Date : 2024-03-11 DOI: 10.1007/s11943-024-00337-1
Walter Krämer
{"title":"Interview mit Ralf Münnich","authors":"Walter Krämer","doi":"10.1007/s11943-024-00337-1","DOIUrl":"10.1007/s11943-024-00337-1","url":null,"abstract":"","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"18 1","pages":"117 - 125"},"PeriodicalIF":0.0,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140251306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bürgerbeteiligung in Deutschland – Wer beteiligt sich wofür mit welchen Auswirkungen? 德国的公民参与--谁参与了什么,产生了什么影响?
Pub Date : 2024-03-08 DOI: 10.1007/s11943-024-00336-2
Olaf Hübler

Bürgerbeteiligungen finden sich in nahezu allen Bereichen des öffentlichen Lebens. Häufig sind Unzufriedenheit mit öffentlichen Entscheidungen und Politikverdrossenheit dafür ausschlaggebend, dass es zu einem Engagement der Bürger außerhalb des Berufslebens kommt. Über Auswirkungen und Struktur von Bürgerinitiativen ist wenig bekannt. Empirische Untersuchungen beschränken sich häufig auf Einzelfallanalysen. Eine breitere Datenbasis unter Verwendung von statistisch-ökonometrischen Verfahren ist notwendig, um zu verallgemeinerbaren Aussagen zu gelangen. Welcher Typ Mensch ist bei über das Private hinausgehenden Angelegenheiten aktiv und beteiligt sich an diesen? Inwiefern wird er davon in seiner Einstellung und seinen Verhaltensweisen beeinflusst. Bürgerräte sind ein vergleichsweise neu entwickeltes Instrument zur Bürgerbeteiligung, zu dem aus statistischer Sicht noch eine Reihe an Informationen fehlt. Zufallsgesteuerte Auswahlverfahren sollen dazu beitragen, dass sich Politikempfehlungen und Politikentscheidungen stärker am Bevölkerungswillen orientieren. Welche persönlichen Merkmale sind für Bürgerratsmitglieder typisch? Entspricht die Verteilung dieser Merkmale der in der Gesamtbevölkerung?

Die empirische Untersuchung zeigt, dass übliche demographische Merkmale nur beschränkt die Teilnahme an Bürgerinitiativen erklären können und dass eine wechselseitige Beziehung zur Beteiligung an Bürgerinitiativen besteht. Von zusätzlicher Bedeutung sind Big 5 Charakteristika und Beurteilungen, was als gerecht empfunden wird. Lebenszufriedenheit und Vertrauen in Politiker offenbaren sich bei Personen mit und ohne Erfahrung im Bereich der Bürgerinitiativen unterschiedlich. Insgesamt ist die Bedeutung von Bürgerbeteiligung geringer einzuschätzen als die anderer altruistisch orientierter Aktivitäten.

公民参与几乎存在于公共生活的所有领域。对公共决策的不满和对政治的失望往往是公民在职业生活之外参与其中的主要原因。人们对公民倡议的效果和结构知之甚少。实证研究往往局限于个案分析。有必要利用统计-计量经济学方法,建立更广泛的数据基础,以得出具有普遍意义的结论。什么类型的人积极参与私人领域以外的事务?这在多大程度上影响了他们的态度和行为?公民委员会是一种相对较新的公民参与工具,从统计角度看,这方面的信息还很缺乏。随机遴选程序应有助于确保政策建议和决策更加符合民众的意愿。公民大会成员具有哪些典型的个人特征?这些特征的分布是否与整个人口的分布相一致? 实证研究表明,通常的人口特征只能在一定程度上解释公民倡议的参与情况,而公民倡议的参与情况与人口特征之间存在相互关系。更重要的是五大特征和对公平的判断。生活满意度和对政治家的信任度在有公民倡议经验和没有公民倡议经验的人身上有不同的表现。总体而言,公民参与的重要性低于其他利他主义导向的活动。
{"title":"Bürgerbeteiligung in Deutschland – Wer beteiligt sich wofür mit welchen Auswirkungen?","authors":"Olaf Hübler","doi":"10.1007/s11943-024-00336-2","DOIUrl":"10.1007/s11943-024-00336-2","url":null,"abstract":"<p>Bürgerbeteiligungen finden sich in nahezu allen Bereichen des öffentlichen Lebens. Häufig sind Unzufriedenheit mit öffentlichen Entscheidungen und Politikverdrossenheit dafür ausschlaggebend, dass es zu einem Engagement der Bürger außerhalb des Berufslebens kommt. Über Auswirkungen und Struktur von Bürgerinitiativen ist wenig bekannt. Empirische Untersuchungen beschränken sich häufig auf Einzelfallanalysen. Eine breitere Datenbasis unter Verwendung von statistisch-ökonometrischen Verfahren ist notwendig, um zu verallgemeinerbaren Aussagen zu gelangen. Welcher Typ Mensch ist bei über das Private hinausgehenden Angelegenheiten aktiv und beteiligt sich an diesen? Inwiefern wird er davon in seiner Einstellung und seinen Verhaltensweisen beeinflusst. Bürgerräte sind ein vergleichsweise neu entwickeltes Instrument zur Bürgerbeteiligung, zu dem aus statistischer Sicht noch eine Reihe an Informationen fehlt. Zufallsgesteuerte Auswahlverfahren sollen dazu beitragen, dass sich Politikempfehlungen und Politikentscheidungen stärker am Bevölkerungswillen orientieren. Welche persönlichen Merkmale sind für Bürgerratsmitglieder typisch? Entspricht die Verteilung dieser Merkmale der in der Gesamtbevölkerung?</p><p>Die empirische Untersuchung zeigt, dass übliche demographische Merkmale nur beschränkt die Teilnahme an Bürgerinitiativen erklären können und dass eine wechselseitige Beziehung zur Beteiligung an Bürgerinitiativen besteht. Von zusätzlicher Bedeutung sind Big 5 Charakteristika und Beurteilungen, was als gerecht empfunden wird. Lebenszufriedenheit und Vertrauen in Politiker offenbaren sich bei Personen mit und ohne Erfahrung im Bereich der Bürgerinitiativen unterschiedlich. Insgesamt ist die Bedeutung von Bürgerbeteiligung geringer einzuschätzen als die anderer altruistisch orientierter Aktivitäten.</p>","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"18 1","pages":"99 - 116"},"PeriodicalIF":0.0,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11943-024-00336-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142410616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Subventionen für „kleine Jobs“: 为 "小工作 "提供补贴:
Pub Date : 2024-03-04 DOI: 10.1007/s11943-024-00335-3
Regina T. Riphahn

Die Grohmann-Vorlesung des Jahres 2023 beschäftigt sich mit dem Phänomen der „kleinen Jobs“ in Deutschland. Zunächst wird der institutionelle und historische Hintergrund von Minijobs erläutert und die Intensität ihrer Nutzung beschrieben. Anschließend fasst der Text die Inhalte von drei empirischen Studien zusammen. Diese setzen sich mit der Frage auseinander ob (i) Arbeitgeber reguläre Beschäftigung durch Minijobs ersetzen, (ii) Minijobs zur „motherhood penalty“ in Deutschland beitragen und (iii) ob Midijobs Übergänge aus Minijobs in reguläre sozialversicherungspflichtige Beschäftigung erleichtert haben. Die Vorlesung schließt mit einer Betrachtung möglicher Regelungsalternativen für „kleine Jobs“ in Deutschland.

2023 年格罗曼讲座的主题是德国的 "小工作 "现象。首先解释了小型工作的制度和历史背景,并介绍了小型工作的使用强度。然后,文章总结了三项实证研究的内容。这些研究涉及以下问题:(i) 雇主是否以小型工作取代正规就业;(ii) 小型工作是否助长了德国的 "母亲惩罚";(iii) 中型工作是否促进了从小型工作向正规就业的过渡,而正规就业需要缴纳社会保险金。讲座最后探讨了德国对 "小型工作 "可能采取的替代监管措施。
{"title":"Subventionen für „kleine Jobs“:","authors":"Regina T. Riphahn","doi":"10.1007/s11943-024-00335-3","DOIUrl":"10.1007/s11943-024-00335-3","url":null,"abstract":"<p>Die Grohmann-Vorlesung des Jahres 2023 beschäftigt sich mit dem Phänomen der „kleinen Jobs“ in Deutschland. Zunächst wird der institutionelle und historische Hintergrund von Minijobs erläutert und die Intensität ihrer Nutzung beschrieben. Anschließend fasst der Text die Inhalte von drei empirischen Studien zusammen. Diese setzen sich mit der Frage auseinander ob (i) Arbeitgeber reguläre Beschäftigung durch Minijobs ersetzen, (ii) Minijobs zur „motherhood penalty“ in Deutschland beitragen und (iii) ob Midijobs Übergänge aus Minijobs in reguläre sozialversicherungspflichtige Beschäftigung erleichtert haben. Die Vorlesung schließt mit einer Betrachtung möglicher Regelungsalternativen für „kleine Jobs“ in Deutschland.</p>","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"18 1","pages":"5 - 14"},"PeriodicalIF":0.0,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11943-024-00335-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140079905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Establishing a probability sample in a crisis context: the example of Ukrainian refugees in Germany in 2022 在危机背景下建立概率样本:以 2022 年德国境内的乌克兰难民为例
Pub Date : 2024-03-04 DOI: 10.1007/s11943-024-00338-0
Hans Walter Steinhauer, Jean Philippe Décieux, Manuel Siegert, Andreas Ette, Sabine Zinn

Following Russia’s invasion of Ukraine in early 2022, more than one million refugees have arrived in Germany. These Ukrainian refugees differ in many aspects from Germany’s past forced migration experiences and there exists an urgent need for sound data and information for politics, practitioners, and academics. In response, the IAB-BiB/FReDA-BAMF-SOEP study was established to provide high-quality longitudinal data following a register-based probability sample. We detail on an approach for sampling refugees in brief time, making use of two different registers—the German population register and the central register of foreigners—and discuss the quality of the final sample with respect to potential selectivity of participation in the panel. Overall, we demonstrate the benefits and feasibility of establishing register-based samples even in the context of a geopolitical crisis and the necessity of sound data within brief time horizons. We provide guidance that can be followed for similar events in the future.

2022 年初俄罗斯入侵乌克兰之后,已有 100 多万难民抵达德国。这些乌克兰难民在许多方面都不同于德国过去的被迫移民经历,因此迫切需要为政界、从业人员和学术界提供可靠的数据和信息。为此,IAB-BiB/FReDA-BAMF-SOEP 研究应运而生,通过基于登记的概率抽样,提供高质量的纵向数据。我们详细介绍了利用两种不同的登记册--德国人口登记册和外国人中央登记册--对难民进行短时间抽样的方法,并讨论了最终样本的质量以及参与小组的潜在选择性。总之,我们证明了在地缘政治危机的背景下建立基于登记册的样本的好处和可行性,以及在短时间内获得可靠数据的必要性。我们为今后类似的事件提供了可遵循的指导。
{"title":"Establishing a probability sample in a crisis context: the example of Ukrainian refugees in Germany in 2022","authors":"Hans Walter Steinhauer,&nbsp;Jean Philippe Décieux,&nbsp;Manuel Siegert,&nbsp;Andreas Ette,&nbsp;Sabine Zinn","doi":"10.1007/s11943-024-00338-0","DOIUrl":"10.1007/s11943-024-00338-0","url":null,"abstract":"<div><p>Following Russia’s invasion of Ukraine in early 2022, more than one million refugees have arrived in Germany. These Ukrainian refugees differ in many aspects from Germany’s past forced migration experiences and there exists an urgent need for sound data and information for politics, practitioners, and academics. In response, the IAB-BiB/FReDA-BAMF-SOEP study was established to provide high-quality longitudinal data following a register-based probability sample. We detail on an approach for sampling refugees in brief time, making use of two different registers—the German population register and the central register of foreigners—and discuss the quality of the final sample with respect to potential selectivity of participation in the panel. Overall, we demonstrate the benefits and feasibility of establishing register-based samples even in the context of a geopolitical crisis and the necessity of sound data within brief time horizons. We provide guidance that can be followed for similar events in the future.</p></div>","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"18 1","pages":"77 - 97"},"PeriodicalIF":0.0,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11943-024-00338-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142409775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Can machine learning algorithms deliver superior models for rental guides? 机器学习算法能否为租赁指南提供卓越的模型?
Pub Date : 2023-12-12 DOI: 10.1007/s11943-023-00333-x
Oliver Trinkaus, Göran Kauermann

In this paper we discuss the use and potential advantages and disadvantages of machine learning driven models in rental guides. Rental guides are a formal legal instrument in Germany for surveying rents of flats in cities and municipalities, which are today based on regression models or simple contingency tables. We discuss if and how modern and timely methods of machine learning outperform existing and established routines. We make use of data from the Munich rental guide and mainly focus on the predictive power of these models. We discuss the “black-box” character making some of these models difficult to interpret and hence challenging for applications in the rental guide context. Still, it is of interest to see how “black-box” models perform with respect to prediction error. Moreover, we study adversarial effects, i.e. we investigate robustness in the sense how corrupted data influence the performance of the prediction models. With the data at hand we show that models with promising predictive performance suffer from being more vulnerable to corruptions than classic linear models including Ridge or Lasso regularization.

本文将讨论机器学习驱动模型在租金指南中的应用和潜在优缺点。租金指南是德国调查城市和市政单位租金的正式法律文书,目前基于回归模型或简单的或然率表。我们将讨论现代和及时的机器学习方法是否以及如何优于现有的常规方法。我们利用慕尼黑租金指南中的数据,主要关注这些模型的预测能力。我们讨论了 "黑箱 "特性,这种特性使得其中一些模型难以解释,因此在租赁指南中的应用具有挑战性。不过,我们还是有兴趣了解 "黑箱 "模型在预测误差方面的表现。此外,我们还研究了对抗效应,即从损坏数据如何影响预测模型性能的角度来研究鲁棒性。我们利用手头的数据表明,与包括 Ridge 或 Lasso 正则化在内的经典线性模型相比,具有良好预测性能的模型更容易受到干扰的影响。
{"title":"Can machine learning algorithms deliver superior models for rental guides?","authors":"Oliver Trinkaus,&nbsp;Göran Kauermann","doi":"10.1007/s11943-023-00333-x","DOIUrl":"10.1007/s11943-023-00333-x","url":null,"abstract":"<div><p>In this paper we discuss the use and potential advantages and disadvantages of machine learning driven models in rental guides. Rental guides are a formal legal instrument in Germany for surveying rents of flats in cities and municipalities, which are today based on regression models or simple contingency tables. We discuss if and how modern and timely methods of machine learning outperform existing and established routines. We make use of data from the Munich rental guide and mainly focus on the predictive power of these models. We discuss the “black-box” character making some of these models difficult to interpret and hence challenging for applications in the rental guide context. Still, it is of interest to see how “black-box” models perform with respect to prediction error. Moreover, we study adversarial effects, i.e. we investigate robustness in the sense how corrupted data influence the performance of the prediction models. With the data at hand we show that models with promising predictive performance suffer from being more vulnerable to corruptions than classic linear models including Ridge or Lasso regularization.</p></div>","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"17 3-4","pages":"305 - 330"},"PeriodicalIF":0.0,"publicationDate":"2023-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11943-023-00333-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138987242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial issue 3 + 4, 2023 2023 年第 3 期和第 4 期社论
Pub Date : 2023-12-07 DOI: 10.1007/s11943-023-00334-w
Florian Dumpert, Sebastian Wichert, Thomas Augustin, Nina Storfinger
{"title":"Editorial issue 3 + 4, 2023","authors":"Florian Dumpert,&nbsp;Sebastian Wichert,&nbsp;Thomas Augustin,&nbsp;Nina Storfinger","doi":"10.1007/s11943-023-00334-w","DOIUrl":"10.1007/s11943-023-00334-w","url":null,"abstract":"","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"17 3-4","pages":"191 - 194"},"PeriodicalIF":0.0,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11943-023-00334-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138591428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ten propositions on machine learning in official statistics 关于官方统计中机器学习的十项主张
Pub Date : 2023-12-07 DOI: 10.1007/s11943-023-00330-0
Arnout van Delden, Joep Burger, Marco Puts

Machine learning (ML) is increasingly being used in official statistics with a range of different applications. The main focus of ML models is to accurately predict attributes of new, unlabeled cases whereas the focus of classical statistical models is to describe the relations between independent and dependent variables. There is already a lot of experience in the sound use of classical statistical models in official statistics, but for ML models this is still under development. Recent discussions concerning the quality aspects of using ML in official statistics have concentrated on its implications for existing quality frameworks. We are in favor of the use of ML in official statistics, but the main question remains as to what factors need to be considered when using ML models in official statistics. As a means of raising awareness regarding these factors, we pose ten propositions regarding the (sensible) use of ML in official statistics.

机器学习(ML)正越来越多地应用于官方统计中的一系列不同领域。ML 模型的主要重点是准确预测未标记的新案例的属性,而经典统计模型的重点是描述自变量和因变量之间的关系。在官方统计中合理使用经典统计模型方面已经有了很多经验,但对于 ML 模型来说,这仍处于发展阶段。最近有关在官方统计中使用 ML 的质量问题的讨论主要集中在其对现有质量框架的影响上。我们赞成在官方统计中使用 ML,但主要问题仍然是在官方统计中使用 ML 模型时需要考虑哪些因素。为了提高对这些因素的认识,我们提出了关于在官方统计中(合理)使用 ML 的十项主张。
{"title":"Ten propositions on machine learning in official statistics","authors":"Arnout van Delden,&nbsp;Joep Burger,&nbsp;Marco Puts","doi":"10.1007/s11943-023-00330-0","DOIUrl":"10.1007/s11943-023-00330-0","url":null,"abstract":"<div><p>Machine learning (ML) is increasingly being used in official statistics with a range of different applications. The main focus of ML models is to accurately predict attributes of new, unlabeled cases whereas the focus of classical statistical models is to describe the relations between independent and dependent variables. There is already a lot of experience in the sound use of classical statistical models in official statistics, but for ML models this is still under development. Recent discussions concerning the quality aspects of using ML in official statistics have concentrated on its implications for existing quality frameworks. We are in favor of the use of ML in official statistics, but the main question remains as to what factors need to be considered when using ML models in official statistics. As a means of raising awareness regarding these factors, we pose ten propositions regarding the (sensible) use of ML in official statistics.</p></div>","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"17 3-4","pages":"195 - 221"},"PeriodicalIF":0.0,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138590780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AStA Wirtschafts- und Sozialstatistisches Archiv
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1