Journal of data science, statistics, and visualisation最新文献

RoA: visual analytics support for deconfounded causal inference in observational studies RoA：可视化分析支持观察研究中的非混淆因果推理

Journal of data science, statistics, and visualisation

Pub Date : 2024-06-04 DOI: 10.52933/jdssv.v4i3.72

Dennis Dingen, Marcel Van 't Veer, T. Bakkes, Erik Korsten, Arthur Bouwman, J. van Wijk

The gold standard in medical research to estimate the causal effect of a treatment is the Randomized Controlled Trial (RCT), but in many cases these are not feasible due to ethical, financial or practical issues. Observational studies are an alternative, but can easily lead to doubtful results, because of unbalanced selection bias and confounding. Moreover, RCTs often only apply to a specific subgroup and cannot readily be extrapolated. In response, we present Rod of Asclepius (RoA), a novel visual analytics method that integrates modern techniques designed for identification of causal effects and effect size estimation with subgroup analysis. The result is an interactive display designed to combine exploratory analysis with a robust set of techniques, including causal do-calculus, propensity score weighting, and effect estimation. It enables analysts to conduct observational studies in an exploratory, yet robust way. This is demonstrated by means of a use case involving patients undergoing surgery, for which we collaborated closely with clinical researchers.

在医学研究中，估算治疗效果的黄金标准是随机对照试验（RCT），但在很多情况下，由于伦理、经济或实际问题，随机对照试验并不可行。观察研究是一种替代方法，但由于不平衡的选择偏差和混杂因素，很容易导致可疑的结果。此外，RCT 通常只适用于特定的亚组，不能轻易推断。为此，我们提出了Rod of Asclepius (RoA)，这是一种新颖的可视化分析方法，它将用于识别因果效应和效应大小估计的现代技术与亚组分析相结合。它是一种交互式显示，旨在将探索性分析与一套强大的技术（包括因果计算、倾向得分加权和效应估计）结合起来。它使分析人员能够以探索性但稳健的方式开展观察研究。我们与临床研究人员密切合作，通过一个涉及手术患者的使用案例来证明这一点。

引用次数: 0

An Efficient Way to Find Optimal Crossover Designs Using CVX for Precision Medicine 利用 CVX 为精准医疗寻找最佳交叉设计的有效方法

Journal of data science, statistics, and visualisation

Pub Date : 2024-06-04 DOI: 10.52933/jdssv.v4i3.83

Yin Li, Weng Kee Wong, Hua Zhou, Keumhee Chough Carriere

Crossover designs play an increasingly important role in precision medicine. We show the search of an optimal crossover design can be formulated as a convex optimization problem and convex optimization tools, such as CVX, can be directly used to search for an optimal crossover design. We first demonstrate how to transform crossover design problems into convex optimization problems and show CVX can effortlessly find optimal crossover designs that coincide with a few theoretical crossover optimal designs in the literature. The proposed approach is especially useful when it becomes problematic to construct optimal designs analytically for complicated models. We then apply CVX to find crossover designs for models with auto-correlated error structures or when the information matrices may be singular and analytical answers are unavailable. We also construct N-of-1 trials frequently used in precision medicine to estimate treatment effects on the individuals or to estimate average treatment effects, including finding dual-objective optimal crossover designs.

交叉设计在精准医疗中发挥着越来越重要的作用。我们展示了最优交叉设计的搜索可以表述为一个凸优化问题，并且凸优化工具（如 CVX）可以直接用于搜索最优交叉设计。我们首先演示了如何将交叉设计问题转化为凸优化问题，并表明 CVX 可以毫不费力地找到与文献中一些理论交叉最优设计相吻合的最优交叉设计。当复杂模型的最优设计难以通过分析构建时，所提出的方法尤其有用。然后，我们将 CVX 应用于具有自相关误差结构的模型，或者当信息矩阵可能是奇异的且无法获得分析答案时的交叉设计。我们还构建了精准医疗中常用的 N-of-1 试验，以估计对个体的治疗效果或平均治疗效果，包括找到双目标最优交叉设计。

引用次数: 0

Visual Interactive Parameter Selection for Temporal Blind Source Separation 时空盲源分离的视觉交互式参数选择

Journal of data science, statistics, and visualisation

Pub Date : 2024-06-04 DOI: 10.52933/jdssv.v4i3.82

C. Cappello, Nikolaus Piccolotto, C. Muehlmann, M. Bögl, Peter Filzmoser, Silvia Miksch, K. Nordhausen

Many fields of science and industry collect and analyze multivariate time-varying measurements, e.g., healthcare, geophysics, or finance. Such data is often high-dimensional, correlated, and noisy. Experts are interested in latent components of the dataset, but due to the aforementioned properties these are difficult to obtain. Temporal Blind Source Separation (TBSS) is a suitable and well-established framework for these data. However, the large choice of methods and their tuning parameters impedes the effective use of TBSS in practice. The goal of Visual Analytics (VA) is to create powerful analytic tools by combining the strengths of humans and computers. We designed, developed, and evaluated VA contributions in previous work to support TBSS-related analysis tasks. In this paper, we highlight the benefits and opportunities of VA concepts for statistic-oriented problems using a real-world TBSS application example with a dataset of climate and meteorological measurements in Italy.

许多科学和工业领域都在收集和分析多变量时变测量数据，例如医疗保健、地球物理或金融领域。这些数据通常具有高维、相关和噪声等特点。专家们对数据集的潜在成分很感兴趣，但由于上述特性，很难获得这些成分。时空盲源分离（TBSS）是一种适用于这些数据的成熟框架。然而，大量的方法及其调整参数阻碍了 TBSS 在实践中的有效应用。可视分析（VA）的目标是结合人类和计算机的优势，创造出强大的分析工具。我们在以前的工作中设计、开发并评估了可视化分析的贡献，以支持与 TBSS 相关的分析任务。在本文中，我们以意大利的气候和气象测量数据集为例，通过一个真实的 TBSS 应用实例，强调了可视化分析概念在面向统计问题方面的优势和机遇。

引用次数: 0

An edge preserving median filter for images based on level-sets 基于水平集的图像边缘保护中值滤波器

Journal of data science, statistics, and visualisation

Pub Date : 2024-06-04 DOI: 10.52933/jdssv.v4i3.74

Jean-Pierre Stander

We propose an edge preserving median filter, called the level-set adaptive median filter, for noise removal in images. This filter uses connected sets of pixels with the same value, namely level-sets, as flexible regions which contour to edges in the image. The filter determines whether a set is noise or signal and smooths the noise. These set regions are flexible in terms of shape since they are created based on their values, and being data-driven therefore provide the mechanism for the filter to preserve edges in the image. We used metrics such as Pratt's Figure of Merit and Peak-Signal-to-Noise Ratio on the labelled faces in the wild data set. We concluded that the proposed level-set adaptive median filter does remove noise while preserving the edges in the image better than the traditional adaptive median filter.

我们提出了一种边缘保护中值滤波器，称为水平集自适应中值滤波器，用于去除图像中的噪声。该滤波器使用具有相同值的相连像素集（即电平集）作为灵活区域，这些区域与图像中的边缘轮廓一致。该滤波器能确定一个集合是噪声还是信号，并对噪声进行平滑处理。这些集合区域的形状非常灵活，因为它们是根据其值创建的，因此数据驱动为滤波器提供了保留图像边缘的机制。我们在野生数据集中使用了普拉特功绩图和峰值信噪比等指标来衡量已标记的人脸。我们得出的结论是，与传统的自适应中值滤波器相比，所提出的电平集自适应中值滤波器在去除噪声的同时，还能更好地保留图像中的边缘。

引用次数: 0

Visualisations for Bayesian Additive Regression Trees 贝叶斯加性回归树的可视化方法

Journal of data science, statistics, and visualisation

Pub Date : 2024-02-07 DOI: 10.52933/jdssv.v4i1.79

Alan N. Inglis, Andrew Parnell, Catherine Hurley

Tree-based regression and classification has become a standard tool in modern data science. Bayesian Additive Regression Trees (BART) has in particular gained wide popularity due its flexibility in dealing with interactions and non-linear effects. BART is a Bayesian tree-based machine learning method that can be applied to both regression and classification problems and yields competitive or superior results when compared to other predictive models. As a Bayesian model, BART allows the practitioner to explore the uncertainty around predictions through the posterior distribution. In this paper, we present new Visualisation techniques for exploring BART models. We construct conventional plots to analyse a model’s performance and stability as well as create new tree-based plots to analyse variable importance, interaction, and tree structure. We employ Value Suppressing Uncertainty Palettes (VSUP) to construct heatmaps that display variable importance and interactions jointly using colour scale to represent posterior uncertainty. Our new Visualisations are designed to work with the most popular BART R packages available, namely BART, dbarts, and bartMachine. Our approach is implemented in the R package bartMan (BART Model ANalysis).

基于树的回归和分类已成为现代数据科学的标准工具。贝叶斯加性回归树（BART）因其在处理交互和非线性效应方面的灵活性，尤其受到广泛欢迎。BART 是一种基于贝叶斯树的机器学习方法，既可应用于回归问题，也可应用于分类问题，与其他预测模型相比，它能产生有竞争力或更优越的结果。作为一种贝叶斯模型，BART 允许实践者通过后验分布来探索预测的不确定性。在本文中，我们介绍了探索 BART 模型的新可视化技术。我们构建了常规图来分析模型的性能和稳定性，并创建了新的树状图来分析变量的重要性、交互作用和树状结构。我们采用价值抑制不确定性调色板（VSUP）来构建热图，利用色标共同显示变量重要性和交互作用，以表示后验不确定性。我们的新可视化设计可与现有最流行的 BART R 软件包（即 BART、dbarts 和 bartMachine）配合使用。我们的方法是在 R 软件包 bartMan（BART 模型分析）中实现的。

{"title":"Visualisations for Bayesian Additive Regression Trees","authors":"Alan N. Inglis, Andrew Parnell, Catherine Hurley","doi":"10.52933/jdssv.v4i1.79","DOIUrl":"https://doi.org/10.52933/jdssv.v4i1.79","url":null,"abstract":"Tree-based regression and classification has become a standard tool in modern data science. Bayesian Additive Regression Trees (BART) has in particular gained wide popularity due its flexibility in dealing with interactions and non-linear effects. BART is a Bayesian tree-based machine learning method that can be applied to both regression and classification problems and yields competitive or superior results when compared to other predictive models. As a Bayesian model, BART allows the practitioner to explore the uncertainty around predictions through the posterior distribution. In this paper, we present new Visualisation techniques for exploring BART models. We construct conventional plots to analyse a model’s performance and stability as well as create new tree-based plots to analyse variable importance, interaction, and tree structure. We employ Value Suppressing Uncertainty Palettes (VSUP) to construct heatmaps that display variable importance and interactions jointly using colour scale to represent posterior uncertainty. Our new Visualisations are designed to work with the most popular BART R packages available, namely BART, dbarts, and bartMachine. Our approach is implemented in the R package bartMan (BART Model ANalysis).","PeriodicalId":93459,"journal":{"name":"Journal of data science, statistics, and visualisation","volume":"18 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139796981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Visualisations for Bayesian Additive Regression Trees 贝叶斯加性回归树的可视化方法

Journal of data science, statistics, and visualisation

Pub Date : 2024-02-07 DOI: 10.52933/jdssv.v4i1.79

Alan N. Inglis, Andrew Parnell, Catherine Hurley

Tree-based regression and classification has become a standard tool in modern data science. Bayesian Additive Regression Trees (BART) has in particular gained wide popularity due its flexibility in dealing with interactions and non-linear effects. BART is a Bayesian tree-based machine learning method that can be applied to both regression and classification problems and yields competitive or superior results when compared to other predictive models. As a Bayesian model, BART allows the practitioner to explore the uncertainty around predictions through the posterior distribution. In this paper, we present new Visualisation techniques for exploring BART models. We construct conventional plots to analyse a model’s performance and stability as well as create new tree-based plots to analyse variable importance, interaction, and tree structure. We employ Value Suppressing Uncertainty Palettes (VSUP) to construct heatmaps that display variable importance and interactions jointly using colour scale to represent posterior uncertainty. Our new Visualisations are designed to work with the most popular BART R packages available, namely BART, dbarts, and bartMachine. Our approach is implemented in the R package bartMan (BART Model ANalysis).

基于树的回归和分类已成为现代数据科学的标准工具。贝叶斯加性回归树（BART）因其在处理交互和非线性效应方面的灵活性，尤其受到广泛欢迎。BART 是一种基于贝叶斯树的机器学习方法，既可应用于回归问题，也可应用于分类问题，与其他预测模型相比，它能产生有竞争力或更优越的结果。作为一种贝叶斯模型，BART 允许实践者通过后验分布来探索预测的不确定性。在本文中，我们介绍了探索 BART 模型的新可视化技术。我们构建了常规图来分析模型的性能和稳定性，并创建了新的树状图来分析变量的重要性、交互作用和树状结构。我们采用价值抑制不确定性调色板（VSUP）来构建热图，利用色标共同显示变量重要性和交互作用，以表示后验不确定性。我们的新可视化设计可与现有最流行的 BART R 软件包（即 BART、dbarts 和 bartMachine）配合使用。我们的方法是在 R 软件包 bartMan（BART 模型分析）中实现的。

{"title":"Visualisations for Bayesian Additive Regression Trees","authors":"Alan N. Inglis, Andrew Parnell, Catherine Hurley","doi":"10.52933/jdssv.v4i1.79","DOIUrl":"https://doi.org/10.52933/jdssv.v4i1.79","url":null,"abstract":"Tree-based regression and classification has become a standard tool in modern data science. Bayesian Additive Regression Trees (BART) has in particular gained wide popularity due its flexibility in dealing with interactions and non-linear effects. BART is a Bayesian tree-based machine learning method that can be applied to both regression and classification problems and yields competitive or superior results when compared to other predictive models. As a Bayesian model, BART allows the practitioner to explore the uncertainty around predictions through the posterior distribution. In this paper, we present new Visualisation techniques for exploring BART models. We construct conventional plots to analyse a model’s performance and stability as well as create new tree-based plots to analyse variable importance, interaction, and tree structure. We employ Value Suppressing Uncertainty Palettes (VSUP) to construct heatmaps that display variable importance and interactions jointly using colour scale to represent posterior uncertainty. Our new Visualisations are designed to work with the most popular BART R packages available, namely BART, dbarts, and bartMachine. Our approach is implemented in the R package bartMan (BART Model ANalysis).","PeriodicalId":93459,"journal":{"name":"Journal of data science, statistics, and visualisation","volume":"46 7-8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139857031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using hospital data for monitoring the dynamics of COVID-19 in France 利用医院数据监测法国COVID-19的动态

Journal of data science, statistics, and visualisation

Pub Date : 2022-11-28 DOI: 10.52933/jdssv.v2i7.48

M. Lavielle

The objective of this article is to show how daily hospital data can be used to monitor the evolution of the COVID-19 epidemic in France. A piecewise defined dynamic model allows to fit very well the available hospital admission, death and discharge data. The change-points detected correspond to moments when the dynamics of the epidemic changed abruptly. It is therefore a surveillance tool, not a forecasting tool. In other words, it can be used effectively to warn of a restart of epidemic activity, but it is not designed to assess the impact of a new lockdown or the emergence of a new variant.The model, data and fits are implemented in an interactive web application.

本文的目的是展示如何使用每日医院数据来监测法国COVID-19流行病的演变。分段定义的动态模型可以很好地拟合现有的住院、死亡和出院数据。检测到的变化点对应于流行病动态突然变化的时刻。因此，它是一种监测工具，而不是预测工具。换句话说，它可以有效地用于警告流行病活动的重新开始，但它不是用来评估新封锁或新变种出现的影响。模型、数据和拟合在交互式web应用程序中实现。

引用次数: 0

Implementation of an Adaptable COVID-19 Utilization and Resource Visualization Engine (CURVE) to Depict In-Hospital Resource Forecasts Over Time 实施适应性强的COVID-19利用和资源可视化引擎(CURVE)，以描述医院内资源随时间的预测

Journal of data science, statistics, and visualisation

Pub Date : 2022-11-28 DOI: 10.52933/jdssv.v2i7.19

Shih-Hsiung Chou, P. Turk, M. Kowalkowski, J. Kearns, J. Roberge, J. Priem, Y. Taylor, R. Burns, P. Palmer, A. McWilliams

We developed an interactive web-based, decision support application that can adapt to the rapid pace of change in region-specific pandemic related variables and knowledge, thereby providing timely, accurate insights to inform a large healthcare system’s proactive response to COVID-19 hospital resource planning. We designed the COVID-19 Utilization and Resource Visualization Engine (CURVE) app to be adaptable to real-time changes as the pandemic evolved, enabling decisions to be supported by contemporary local data and accurate predictive models. To demonstrate this flexibility, we sequentially implemented a Susceptible-Infected-Removed (SIR) model that incorporates social-distancing and imperfect detection (SIR-D2), an extended-state-space Bayesian SIR model (eSIR), and a time-series model (ARIMA). CURVE improves upon other pandemic forecasting solutions by providing adaptable decision support that generates locally calibrated forecasts aligned to health system specific data to guide COVID-19 pandemic planning. The app additionally enables systematic monitoring of forecast model performance and realignment that keeps pace with the pandemic’s volatile spread and behavior. CURVE provides a flexible pandemic decision support framework that places the most accurate, locally relevant information in front of decision makers to enable health systems to be proactive and prepared.

我们开发了一个基于网络的交互式决策支持应用程序，该应用程序可以适应特定地区大流行相关变量和知识的快速变化，从而提供及时、准确的见解，为大型医疗保健系统主动应对COVID-19医院资源规划提供信息。我们设计了COVID-19利用和资源可视化引擎(CURVE)应用程序，以适应大流行演变的实时变化，使决策能够得到当代本地数据和准确预测模型的支持。为了证明这种灵活性，我们依次实现了一个包含社交距离和不完美检测(SIR- d2)的易感-感染-移除(SIR)模型、一个扩展状态空间贝叶斯SIR模型(eSIR)和一个时间序列模型(ARIMA)。CURVE在其他大流行预测解决方案的基础上进行了改进，提供适应性决策支持，生成与卫生系统特定数据一致的本地校准预测，以指导COVID-19大流行规划。该应用程序还可以系统地监测预测模型的性能和调整，以跟上大流行的不稳定传播和行为。CURVE提供了一个灵活的大流行决策支持框架，将最准确、与当地相关的信息摆在决策者面前，使卫生系统能够积极主动、做好准备。

{"title":"Implementation of an Adaptable COVID-19 Utilization and Resource Visualization Engine (CURVE) to Depict In-Hospital Resource Forecasts Over Time","authors":"Shih-Hsiung Chou, P. Turk, M. Kowalkowski, J. Kearns, J. Roberge, J. Priem, Y. Taylor, R. Burns, P. Palmer, A. McWilliams","doi":"10.52933/jdssv.v2i7.19","DOIUrl":"https://doi.org/10.52933/jdssv.v2i7.19","url":null,"abstract":"We developed an interactive web-based, decision support application that can adapt to the rapid pace of change in region-specific pandemic related variables and knowledge, thereby providing timely, accurate insights to inform a large healthcare system’s proactive response to COVID-19 hospital resource planning. We designed the COVID-19 Utilization and Resource Visualization Engine (CURVE) app to be adaptable to real-time changes as the pandemic evolved, enabling decisions to be supported by contemporary local data and accurate predictive models. To demonstrate this flexibility, we sequentially implemented a Susceptible-Infected-Removed (SIR) model that incorporates social-distancing and imperfect detection (SIR-D2), an extended-state-space Bayesian SIR model (eSIR), and a time-series model (ARIMA). CURVE improves upon other pandemic forecasting solutions by providing adaptable decision support that generates locally calibrated forecasts aligned to health system specific data to guide COVID-19 pandemic planning. The app additionally enables systematic monitoring of forecast model performance and realignment that keeps pace with the pandemic’s volatile spread and behavior. CURVE provides a flexible pandemic decision support framework that places the most accurate, locally relevant information in front of decision makers to enable health systems to be proactive and prepared.","PeriodicalId":93459,"journal":{"name":"Journal of data science, statistics, and visualisation","volume":"5 1","pages":"1-13"},"PeriodicalIF":0.0,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86846708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multiple Changepoint Analysis of COVID-19 Infection Progression and Related Deaths in the Small Island State of Malta 小岛屿国家马耳他COVID-19感染进展和相关死亡的多变化点分析

Journal of data science, statistics, and visualisation

Pub Date : 2022-11-28 DOI: 10.52933/jdssv.v2i7.50

D. Suda, M. Inguanez, Gianluca Ursino

In December 2019, in the city of Wuhan (China), Severe Acute Respiratory Syndrome Coronavirus - 2 (SARS-CoV−2), a virus that causes what is known as Coronavirus Disease 2019 (better known as COVID-19), emerged. In a few months the virus spread around the world becoming a global pandemic that has shaken the world. On Malta (a nation consisting of an archipelago of islands of approximately 500000 people), which is the case study of this analysis, the first case was identified on 7/3/2020. In this paper, we shall fit a piecewise linear trend model to the log-scale of cumulative cases and deaths due to COVID-19 in Malta by implementing the SN-NOT changepoint model. This model combines the self-normalisation (SN) technique, which is used to test whether there is a single change-point in the linear trend of a time series, with the Narrowest Over Threshold algorithm (NOT) to achieve multiple change-point in the linear trend. Through analysis of news reports and other sources of information, estimated change-points are then compared to potential factors such as health restrictions, mass events, government policy and population behaviour that have affected these changes, in order to determine the efffect of these factors on the spread of the disease.

2019年12月，在中国武汉市出现了严重急性呼吸综合征冠状病毒- 2 (SARS-CoV - 2)，这是一种导致2019年冠状病毒病(更广为人知的是COVID-19)的病毒。几个月后，这种病毒在世界各地传播，成为一场震撼世界的全球大流行。在本分析的案例研究马耳他(一个由岛屿群岛组成的国家，人口约50万)，第一例病例于2020年7月3日被发现。本文将采用SN-NOT变点模型对马耳他COVID-19累计病例和死亡人数的对数尺度拟合分段线性趋势模型。该模型结合了用于检验时间序列线性趋势中是否存在单个变化点的自归一化(SN)技术和用于检验线性趋势中是否存在多个变化点的最窄阈值算法(NOT)。通过分析新闻报道和其他信息来源，然后将估计的变化点与影响这些变化的潜在因素(如卫生限制、大规模事件、政府政策和人口行为)进行比较，以确定这些因素对疾病传播的影响。

{"title":"Multiple Changepoint Analysis of COVID-19 Infection Progression and Related Deaths in the Small Island State of Malta","authors":"D. Suda, M. Inguanez, Gianluca Ursino","doi":"10.52933/jdssv.v2i7.50","DOIUrl":"https://doi.org/10.52933/jdssv.v2i7.50","url":null,"abstract":"In December 2019, in the city of Wuhan (China), Severe Acute Respiratory Syndrome Coronavirus - 2 (SARS-CoV−2), a virus that causes what is known as Coronavirus Disease 2019 (better known as COVID-19), emerged. In a few months the virus spread around the world becoming a global pandemic that has shaken the world. On Malta (a nation consisting of an archipelago of islands of approximately 500000 people), which is the case study of this analysis, the first case was identified on 7/3/2020. In this paper, we shall fit a piecewise linear trend model to the log-scale of cumulative cases and deaths due to COVID-19 in Malta by implementing the SN-NOT changepoint model. This model combines the self-normalisation (SN) technique, which is used to test whether there is a single change-point in the linear trend of a time series, with the Narrowest Over Threshold algorithm (NOT) to achieve multiple change-point in the linear trend. Through analysis of news reports and other sources of information, estimated change-points are then compared to potential factors such as health restrictions, mass events, government policy and population behaviour that have affected these changes, in order to determine the efffect of these factors on the spread of the disease.","PeriodicalId":93459,"journal":{"name":"Journal of data science, statistics, and visualisation","volume":"14 1","pages":"63-83"},"PeriodicalIF":0.0,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84968521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Visual Narratives of the Covid-19 pandemic Covid-19大流行的视觉叙事

Journal of data science, statistics, and visualisation

Pub Date : 2022-11-28 DOI: 10.52933/jdssv.v2i7.64

A. Wilhelm, Susan Vanderplas

Covid-19 has sparked a worldwide interest in understanding the dynamic evo- lution of a pandemic and tracking the effectiveness of preventive measures and rules. For this reason, numerous media and research groups have produced com- prehensive data visualisations to illustrate the relevant trends and figures. In this paper, we will look at a selection of Covid 19 data visualisations to evaluate and discuss the currently established visualisation tools in terms of their ability to provide a communication channel both within the data science team and between data analysts, domain experts and a general interested audience. Although there is no set catalogue of evaluation criteria for data visualisations, we will try to give an overview of the different core aspects of visualisation evaluation and their competing principles.

Covid-19引发了全世界对了解大流行动态演变和跟踪预防措施和规则有效性的兴趣。出于这个原因，许多媒体和研究小组已经制作了全面的数据可视化来说明相关的趋势和数字。在本文中，我们将研究Covid - 19数据可视化的选择，以评估和讨论当前建立的可视化工具在数据科学团队内部以及数据分析师、领域专家和一般感兴趣的受众之间提供沟通渠道的能力。虽然没有数据可视化评估标准的固定目录，我们将尝试给出可视化评估的不同核心方面及其相互竞争的原则的概述。

引用次数: 0