首页 > 最新文献

Findings (Sydney (N.S.W.)最新文献

英文 中文
Evacuation Decisions during the Great East Japan Earthquake 东日本大地震期间的疏散决策
Pub Date : 2023-05-30 DOI: 10.32866/001c.77365
Jan Dirk Schmöcker, Jun Ji, Fajar Prawira Belgiawan, Nobuhiro Uno
We analyse evacuation decisions with data from a survey among 10,384 survivers of the 2011 Great East Japan earthquake. The decisions of individuals and families to evacuate or stay are influenced by the Tsunami warning system as well as the behaviour of the surrounding population which is modelled as the percentage of persons evacuating from a city. We formulate binary choice models with “field effects” where we try to control for the endogeneity with a 2-stage model approach. Our results quantify the field effect and suggest that with each minute the Tsunami warning arrives later, on average 3% less of the population are evacuating and surviving. We also show the importance of other variables, in particular the preparedness measures such as signage and evacuation drills.
我们用对2011年东日本大地震的10384名幸存者的调查数据来分析疏散决策。个人和家庭撤离或留下的决定受到海啸预警系统以及周围人口行为的影响,周围人口的行为以从一个城市撤离的人口百分比为模型。我们用“场效应”制定二元选择模型,其中我们试图用两阶段模型方法控制内生性。我们的研究结果量化了磁场效应,并表明海啸警报每延迟一分钟,疏散和幸存的人口平均减少3%。我们还展示了其他变量的重要性,特别是准备措施,如标志和疏散演习。
{"title":"Evacuation Decisions during the Great East Japan Earthquake","authors":"Jan Dirk Schmöcker, Jun Ji, Fajar Prawira Belgiawan, Nobuhiro Uno","doi":"10.32866/001c.77365","DOIUrl":"https://doi.org/10.32866/001c.77365","url":null,"abstract":"We analyse evacuation decisions with data from a survey among 10,384 survivers of the 2011 Great East Japan earthquake. The decisions of individuals and families to evacuate or stay are influenced by the Tsunami warning system as well as the behaviour of the surrounding population which is modelled as the percentage of persons evacuating from a city. We formulate binary choice models with “field effects” where we try to control for the endogeneity with a 2-stage model approach. Our results quantify the field effect and suggest that with each minute the Tsunami warning arrives later, on average 3% less of the population are evacuating and surviving. We also show the importance of other variables, in particular the preparedness measures such as signage and evacuation drills.","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":"177 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135643604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Energy Transitions in the Food Sector: The Economic Viability of Low-carbon Technologies in the Swiss Dairy Industry 食品行业的能源转型:瑞士乳制品行业低碳技术的经济可行性
Pub Date : 2023-05-23 DOI: 10.32866/001c.75416
Paul Tautorat, Taha Ramazanoğlu, T. Schmidt, B. Steffen
Swiss dairy products are globally sought after but their production requires relatively large amounts of process heat, often generated from oil and gas. Low-carbon electricity- and biomass-based solutions exist but were often regarded as economically not viable in the past. Therefore, we evaluate the economic viability of low-carbon technologies for the Swiss dairy industry for scenarios of low and high fossil fuel prices in Europe, and its sensitivity to emission cost pathways. Results show a clear cost advantage of heat pumps and biomass boilers going forward, driven particularly by expected future gas prices.
瑞士乳制品在全球范围内备受追捧,但其生产需要相对大量的工艺热量,通常来自石油和天然气。基于低碳电力和生物质的解决方案是存在的,但在过去常常被认为在经济上不可行。因此,我们评估了瑞士乳制品行业低碳技术在欧洲化石燃料价格低和高的情况下的经济可行性,以及其对排放成本途径的敏感性。结果表明,热泵和生物质锅炉在未来具有明显的成本优势,尤其是受未来天然气价格预期的推动。
{"title":"Energy Transitions in the Food Sector: The Economic Viability of Low-carbon Technologies in the Swiss Dairy Industry","authors":"Paul Tautorat, Taha Ramazanoğlu, T. Schmidt, B. Steffen","doi":"10.32866/001c.75416","DOIUrl":"https://doi.org/10.32866/001c.75416","url":null,"abstract":"Swiss dairy products are globally sought after but their production requires relatively large amounts of process heat, often generated from oil and gas. Low-carbon electricity- and biomass-based solutions exist but were often regarded as economically not viable in the past. Therefore, we evaluate the economic viability of low-carbon technologies for the Swiss dairy industry for scenarios of low and high fossil fuel prices in Europe, and its sensitivity to emission cost pathways. Results show a clear cost advantage of heat pumps and biomass boilers going forward, driven particularly by expected future gas prices.","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43648356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What do People want to do instead of Commuting to Work? 人们想做什么而不是通勤上班?
Pub Date : 2023-05-23 DOI: 10.32866/001c.75441
R. Noland, H. Younes, Wenwen Zhang
The COVID-19 pandemic resulted in a sudden shift to working at home. People stopped commuting to their jobs. We fielded two surveys in New Jersey during the pandemic and included questions on what respondents did with time saved from not commuting as well as which activities they wished to see continue after the pandemic subsides. Key results include that a majority of respondents reported spending more time with their family, almost half spent time watching TV or were on the internet, a large share slept later, and many walked more for exercise. We also queried respondents on activities they would like to continue after the pandemic is over, with nearly half desiring to work at home at least some of the time and about a third desiring to commute less. We also present results by gender, finding some differences in time use and preferences.
新冠肺炎大流行导致人们突然转向在家工作。人们不再通勤上班。疫情期间,我们在新泽西州进行了两项调查,其中包括受访者在不通勤的情况下做了什么,以及他们希望在疫情消退后继续进行哪些活动。关键结果包括,大多数受访者报告说,他们花了更多的时间与家人在一起,几乎一半的时间看电视或上网,很大一部分人睡得晚,许多人走路更多锻炼。我们还询问了受访者在疫情结束后希望继续进行的活动,近一半的人希望至少有一段时间在家工作,约三分之一的人希望减少通勤。我们还按性别列出了结果,发现在时间使用和偏好方面存在一些差异。
{"title":"What do People want to do instead of Commuting to Work?","authors":"R. Noland, H. Younes, Wenwen Zhang","doi":"10.32866/001c.75441","DOIUrl":"https://doi.org/10.32866/001c.75441","url":null,"abstract":"The COVID-19 pandemic resulted in a sudden shift to working at home. People stopped commuting to their jobs. We fielded two surveys in New Jersey during the pandemic and included questions on what respondents did with time saved from not commuting as well as which activities they wished to see continue after the pandemic subsides. Key results include that a majority of respondents reported spending more time with their family, almost half spent time watching TV or were on the internet, a large share slept later, and many walked more for exercise. We also queried respondents on activities they would like to continue after the pandemic is over, with nearly half desiring to work at home at least some of the time and about a third desiring to commute less. We also present results by gender, finding some differences in time use and preferences.","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43001294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Can Mobility of Care Be Identified From Transit Fare Card Data? A Case Study In Washington D.C. 能否从公交票价卡数据中识别护理的流动性?华盛顿特区案例研究。
Pub Date : 2023-05-18 DOI: 10.32866/001c.75352
D. Shuman, Awad Abdelhalim, Anson F. Stewart, Kayleigh B Campbell, Mira Patel, Inés Sánchez de Madariaga, Jinhua Zhao
Studies in the literature have found significant differences in travel behavior by gender on public transit that are largely attributable to household and care responsibilities falling disproportionately on women. While the majority of studies have relied on survey and qualitative data to assess “mobility of care”, we propose a novel data-driven workflow utilizing transit fare card transactions, name-based gender inference, and geospatial analysis to identify mobility of care trip making. We find that the share of women travelers trip-chaining in the direct vicinity of mobility of care places of interest is 10% - 15% higher than men.
文献研究发现,在公共交通工具上,不同性别的出行行为存在显著差异,这在很大程度上归因于家庭和护理责任不成比例地落在了女性身上。虽然大多数研究都依赖于调查和定性数据来评估“护理的流动性”,但我们提出了一种新的数据驱动工作流程,利用公交票价卡交易、基于姓名的性别推断和地理空间分析来识别护理出行的流动性。我们发现,女性旅行者在感兴趣的护理场所流动性直接附近的连锁旅行比例比男性高10%-15%。
{"title":"Can Mobility of Care Be Identified From Transit Fare Card Data? A Case Study In Washington D.C.","authors":"D. Shuman, Awad Abdelhalim, Anson F. Stewart, Kayleigh B Campbell, Mira Patel, Inés Sánchez de Madariaga, Jinhua Zhao","doi":"10.32866/001c.75352","DOIUrl":"https://doi.org/10.32866/001c.75352","url":null,"abstract":"Studies in the literature have found significant differences in travel behavior by gender on public transit that are largely attributable to household and care responsibilities falling disproportionately on women. While the majority of studies have relied on survey and qualitative data to assess “mobility of care”, we propose a novel data-driven workflow utilizing transit fare card transactions, name-based gender inference, and geospatial analysis to identify mobility of care trip making. We find that the share of women travelers trip-chaining in the direct vicinity of mobility of care places of interest is 10% - 15% higher than men.","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45528384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Differential Privacy Will Affect Estimates of Air Pollution Exposure and Disparities in the United States 差异隐私将如何影响对美国空气污染暴露和差异的估计
Pub Date : 2023-05-16 DOI: 10.32866/001c.74975
Madalsa Singh
Census data is crucial to understand energy and environmental justice outcomes such as poor air quality which disproportionately impact people of color in the U.S. Wwith the advent of sophisticated personal datasets and analysis, Census Bureau is considering adding top-down noise (differential privacy) and post-processing 2020 census data to reduce the risk of identification of individual respondents. Using 2010 demonstration census and pollution data, I find that compared to the original census, differentially private (DP) census significantly changes ambient pollution exposure in areas with sparse populations. White Americans have lowest variability, followed by Latinos, Asian, and Black Americans. DP underestimates pollution disparities for SO2 and PM2.5 while overestimates the pollution disparities for PM10.
人口普查数据对于理解能源和环境正义的结果至关重要,例如空气质量差,这对美国有色人种的影响尤为严重。随着复杂的个人数据集和分析的出现,人口普查局正在考虑增加自上而下的噪音(差异隐私),并对2020年人口普查数据进行后处理,以降低识别个人受访者的风险。使用2010年的示范人口普查和污染数据,我发现与最初的人口普查相比,差异私人(DP)人口普查显著改变了人口稀少地区的环境污染暴露。美国白人的变异性最低,其次是拉丁裔、亚裔和黑人。DP低估了SO2和PM2.5的污染差异,而高估了PM10的污染差异。
{"title":"How Differential Privacy Will Affect Estimates of Air Pollution Exposure and Disparities in the United States","authors":"Madalsa Singh","doi":"10.32866/001c.74975","DOIUrl":"https://doi.org/10.32866/001c.74975","url":null,"abstract":"Census data is crucial to understand energy and environmental justice outcomes such as poor air quality which disproportionately impact people of color in the U.S. Wwith the advent of sophisticated personal datasets and analysis, Census Bureau is considering adding top-down noise (differential privacy) and post-processing 2020 census data to reduce the risk of identification of individual respondents. Using 2010 demonstration census and pollution data, I find that compared to the original census, differentially private (DP) census significantly changes ambient pollution exposure in areas with sparse populations. White Americans have lowest variability, followed by Latinos, Asian, and Black Americans. DP underestimates pollution disparities for SO2 and PM2.5 while overestimates the pollution disparities for PM10.","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44087525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How is Intraday Metro Ridership related to Station Centrality in Athens, Greece? 希腊雅典每日地铁客流量与车站中心性的关系如何?
Pub Date : 2023-05-12 DOI: 10.32866/001c.75171
Athanasios Kopsidas, K. Kepaptsoglou
In this study, intraday correlations between station centralities and ridership at stations of the Athens metro system in Greece are explored. An unweighted L-space representation of the physical metro network is developed, and degree, closeness and betweenness are selected as station centrality measures. Hourly smart-card data are used for representing passenger flows. For station classification, principal component analysis and k-means clustering are utilized. The findings suggest that centrality and ridership usually move in opposite directions, morning peak-hour boardings are completely uncorrelated with station centrality, and metro stations can be classified as ‘central destinations’, ‘averagely central origins’, and ‘underutilized peripheral stations’.
在本研究中,探讨了希腊雅典地铁系统车站中心性与车站客流量之间的日内相关性。建立了物理地铁网络的非加权l空间表示,并选择程度、紧密度和间隔度作为车站中心性度量。每小时的智能卡数据被用来表示客流。对于站点分类,使用主成分分析和k-means聚类。研究结果表明,中心性和客流量通常是相反的方向,早高峰上车人数与车站中心性完全不相关,地铁站可以分为“中心目的地”、“平均中心起点”和“未充分利用的外围站”。
{"title":"How is Intraday Metro Ridership related to Station Centrality in Athens, Greece?","authors":"Athanasios Kopsidas, K. Kepaptsoglou","doi":"10.32866/001c.75171","DOIUrl":"https://doi.org/10.32866/001c.75171","url":null,"abstract":"In this study, intraday correlations between station centralities and ridership at stations of the Athens metro system in Greece are explored. An unweighted L-space representation of the physical metro network is developed, and degree, closeness and betweenness are selected as station centrality measures. Hourly smart-card data are used for representing passenger flows. For station classification, principal component analysis and k-means clustering are utilized. The findings suggest that centrality and ridership usually move in opposite directions, morning peak-hour boardings are completely uncorrelated with station centrality, and metro stations can be classified as ‘central destinations’, ‘averagely central origins’, and ‘underutilized peripheral stations’.","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70181640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SMATCH++: Standardized and Extended Evaluation of Semantic Graphs SMATCH++:语义图的标准化扩展评价
Pub Date : 2023-05-11 DOI: 10.48550/arXiv.2305.06993
J. Opitz
The Smatch metric is a popular method for evaluating graph distances, as is necessary, for instance, to assess the performance of semantic graph parsing systems. However, we observe some issues in the metric that jeopardize meaningful evaluation. E.g., opaque pre-processing choices can affect results, and current graph-alignment solvers do not provide us with upper-bounds. Without upper-bounds, however, fair evaluation is not guaranteed. Furthermore, adaptions of Smatch for extended tasks (e.g., fine-grained semantic similarity) are spread out, and lack a unifying framework. For better inspection, we divide the metric into three modules: pre-processing, alignment, and scoring. Examining each module, we specify its goals and diagnose potential issues, for which we discuss and test mitigation strategies. For pre-processing, we show how to fully conform to annotation guidelines that allow structurally deviating but valid graphs. For safer and enhanced alignment, we show the feasibility of optimal alignment in a standard evaluation setup, and develop a lossless graph compression method that shrinks the search space and significantly increases efficiency. For improved scoring, we propose standardized and extended metric calculation of fine-grained sub-graph meaning aspects. Our code is available at https://github.com/flipz357/smatchpp
Smatch度量是一种常用的评估图距离的方法,例如,评估语义图解析系统的性能是必要的。然而,我们注意到指标中的一些问题危及有意义的评价。例如,不透明的预处理选择可能会影响结果,而当前的图形对齐解算器无法为我们提供上限。然而,如果没有上限,就不能保证公平的评价。此外,Smatch对扩展任务的适应(例如,细粒度语义相似性)分散,缺乏统一的框架。为了更好地检查,我们将度量划分为三个模块:预处理、对齐和评分。检查每个模块,我们指定其目标并诊断潜在问题,为此我们讨论并测试缓解策略。对于预处理,我们展示了如何完全符合注释准则,这些准则允许结构上有偏差但有效的图。为了更安全和增强对齐,我们展示了在标准评估设置中进行最佳对齐的可行性,并开发了一种无损图压缩方法,该方法缩小了搜索空间并显著提高了效率。为了改进评分,我们提出了细粒度子图意义方面的标准化和扩展度量计算。我们的代码可在https://github.com/flipz357/smatchpp
{"title":"SMATCH++: Standardized and Extended Evaluation of Semantic Graphs","authors":"J. Opitz","doi":"10.48550/arXiv.2305.06993","DOIUrl":"https://doi.org/10.48550/arXiv.2305.06993","url":null,"abstract":"The Smatch metric is a popular method for evaluating graph distances, as is necessary, for instance, to assess the performance of semantic graph parsing systems. However, we observe some issues in the metric that jeopardize meaningful evaluation. E.g., opaque pre-processing choices can affect results, and current graph-alignment solvers do not provide us with upper-bounds. Without upper-bounds, however, fair evaluation is not guaranteed. Furthermore, adaptions of Smatch for extended tasks (e.g., fine-grained semantic similarity) are spread out, and lack a unifying framework. For better inspection, we divide the metric into three modules: pre-processing, alignment, and scoring. Examining each module, we specify its goals and diagnose potential issues, for which we discuss and test mitigation strategies. For pre-processing, we show how to fully conform to annotation guidelines that allow structurally deviating but valid graphs. For safer and enhanced alignment, we show the feasibility of optimal alignment in a standard evaluation setup, and develop a lossless graph compression method that shrinks the search space and significantly increases efficiency. For improved scoring, we propose standardized and extended metric calculation of fine-grained sub-graph meaning aspects. Our code is available at https://github.com/flipz357/smatchpp","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":"1 1","pages":"1550-1562"},"PeriodicalIF":0.0,"publicationDate":"2023-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46859672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Emissions Reductions from Electrifying High-Mileage Vehicles 高里程车辆电气化减排
Pub Date : 2023-05-10 DOI: 10.32866/001c.75133
Zack Aemmer, Daniel Malarkey, D. MacKenzie
This paper evaluates a strategy that would target sales of internal combustion vehicles driven at high annual mileage for displacement by electric vehicles at the time of initial sale. Using the 2017 National Household Travel Survey data, we observe that the top 20% of light duty vehicles by kilometers traveled generate 46% of the annual greenhouse gas emissions. Displacing the sale of a combustion engine vehicle in the top mileage quintile with an electric vehicle would reduce annual greenhouse gas emissions and certain criteria pollutants by more than 15 times as much as displacing a vehicle in the bottom mileage quintile.
本文评估了一种策略,该策略将以内燃机汽车的高年行驶里程为目标,在初始销售时由电动汽车替代。根据2017年全国家庭出行调查数据,我们观察到按行驶公里数计算排名前20%的轻型车辆产生了46%的年度温室气体排放。用电动汽车取代油耗最高的五分之一的内燃机汽车,每年减少的温室气体排放和某些标准污染物的排放量,是油耗最低的五分之一的汽车的15倍以上。
{"title":"Emissions Reductions from Electrifying High-Mileage Vehicles","authors":"Zack Aemmer, Daniel Malarkey, D. MacKenzie","doi":"10.32866/001c.75133","DOIUrl":"https://doi.org/10.32866/001c.75133","url":null,"abstract":"This paper evaluates a strategy that would target sales of internal combustion vehicles driven at high annual mileage for displacement by electric vehicles at the time of initial sale. Using the 2017 National Household Travel Survey data, we observe that the top 20% of light duty vehicles by kilometers traveled generate 46% of the annual greenhouse gas emissions. Displacing the sale of a combustion engine vehicle in the top mileage quintile with an electric vehicle would reduce annual greenhouse gas emissions and certain criteria pollutants by more than 15 times as much as displacing a vehicle in the bottom mileage quintile.","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41486509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Going beyond research datasets: Novel intent discovery in the industry setting 超越研究数据集:行业环境中的新意图发现
Pub Date : 2023-05-09 DOI: 10.48550/arXiv.2305.05474
Aleksandra Chrabrowa, Tsimur Hadeliya, D. Kajtoch, Robert Mroczkowski, Piotr Rybak
Novel intent discovery automates the process of grouping similar messages (questions) to identify previously unknown intents. However, current research focuses on publicly available datasets which have only the question field and significantly differ from real-life datasets. This paper proposes methods to improve the intent discovery pipeline deployed in a large e-commerce platform. We show the benefit of pre-training language models on in-domain data: both self-supervised and with weak supervision. We also devise the best method to utilize the conversational structure (i.e., question and answer) of real-life datasets during fine-tuning for clustering tasks, which we call Conv. All our methods combined to fully utilize real-life datasets give up to 33pp performance boost over state-of-the-art Constrained Deep Adaptive Clustering (CDAC) model for question only. By comparison CDAC model for the question data only gives only up to 13pp performance boost over the naive baseline.
新颖的意图发现自动化了对类似消息(问题)进行分组以识别先前未知意图的过程。然而,目前的研究集中在公开可用的数据集上,这些数据集只有问题领域,与现实生活中的数据集有很大不同。本文提出了改进部署在大型电子商务平台中的意图发现管道的方法。我们展示了在域内数据上预训练语言模型的好处:既有自我监督的,也有弱监督的。我们还设计了在聚类任务的微调过程中利用真实数据集的会话结构(即问答)的最佳方法,我们称之为Conv。与最先进的仅用于问题的约束深度自适应聚类(CDAC)模型相比,我们所有的方法结合起来,充分利用真实数据集中的性能提高了33pp。相比之下,问题数据的CDAC模型只比原始基线提供了高达13pp的性能提升。
{"title":"Going beyond research datasets: Novel intent discovery in the industry setting","authors":"Aleksandra Chrabrowa, Tsimur Hadeliya, D. Kajtoch, Robert Mroczkowski, Piotr Rybak","doi":"10.48550/arXiv.2305.05474","DOIUrl":"https://doi.org/10.48550/arXiv.2305.05474","url":null,"abstract":"Novel intent discovery automates the process of grouping similar messages (questions) to identify previously unknown intents. However, current research focuses on publicly available datasets which have only the question field and significantly differ from real-life datasets. This paper proposes methods to improve the intent discovery pipeline deployed in a large e-commerce platform. We show the benefit of pre-training language models on in-domain data: both self-supervised and with weak supervision. We also devise the best method to utilize the conversational structure (i.e., question and answer) of real-life datasets during fine-tuning for clustering tasks, which we call Conv. All our methods combined to fully utilize real-life datasets give up to 33pp performance boost over state-of-the-art Constrained Deep Adaptive Clustering (CDAC) model for question only. By comparison CDAC model for the question data only gives only up to 13pp performance boost over the naive baseline.","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":"1 1","pages":"895-911"},"PeriodicalIF":0.0,"publicationDate":"2023-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45021328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Large-Scale Taxi Assignment: Optimization and Learning 网上大型出租车作业的优化与学习
Pub Date : 2023-05-03 DOI: 10.32866/001c.74765
Omar Rifki, Thierry Garaix
We propose a solution method for online vehicle routing, which integrates a machine learning routine to improve tours’ quality. Our optimization model is based on the Bertsimas et al. (2019) re-optimization approach. Two separate routines are developed. The first one uses a neural network to produce realistic pick-up times for the customers to serve. The second one relies on Q-learning in addition to random walks for the construction of the backbone graph corresponding to the instance problem of each time step. The second routine gives improved results compared to the original approach.
我们提出了一种在线车辆路线的解决方法,该方法集成了机器学习例程来提高旅行质量。我们的优化模型基于Bertsimas等人(2019)的重新优化方法。开发了两个独立的例程。第一种是使用神经网络为客户提供真实的取货时间。第二种方法除了随机行走之外,还依赖于Q学习来构建与每个时间步长的实例问题相对应的主干图。与原始方法相比,第二个例程提供了改进的结果。
{"title":"Online Large-Scale Taxi Assignment: Optimization and Learning","authors":"Omar Rifki, Thierry Garaix","doi":"10.32866/001c.74765","DOIUrl":"https://doi.org/10.32866/001c.74765","url":null,"abstract":"We propose a solution method for online vehicle routing, which integrates a machine learning routine to improve tours’ quality. Our optimization model is based on the Bertsimas et al. (2019) re-optimization approach. Two separate routines are developed. The first one uses a neural network to produce realistic pick-up times for the customers to serve. The second one relies on Q-learning in addition to random walks for the construction of the backbone graph corresponding to the instance problem of each time step. The second routine gives improved results compared to the original approach.","PeriodicalId":73025,"journal":{"name":"Findings (Sydney (N.S.W.)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48844192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Findings (Sydney (N.S.W.)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1