首页 > 最新文献

2019 Systems and Information Engineering Design Symposium (SIEDS)最新文献

英文 中文
Optimization of Production and Packaging Schedules in a Mixed Discrete/Continuous Manufacturing Environment 离散/连续混合制造环境下生产和包装计划的优化
Pub Date : 2019-04-01 DOI: 10.1109/SIEDS.2019.8735607
Jarett Cestaro, David Conklin, Douglas Ziman, Edmund Pan, Grant Anhorn, M. Cunningham, Nevan Schulte, Faraz Dadgostari, P. Beling
This research was driven by the need for a more efficient production scheduling system in a consumable liquid product division of a large consumer products company. The manufacturing process under inspection consists of continuous and discrete elements, on both production and packaging lines. The production lines are split into continuous production lines and batch production lines which produce the product in fixed batch amounts. Then there are several bottling lines, some of which package a particular bottle size and others that can package multiple bottle sizes. The main objective of this research was to reduce the amount of time it takes for the client to create production and bottling schedules. An optimization model was developed to automate this process and provide the client with the best possible schedule. The objective of the model is to minimize cost by minimizing the number of switches across the production and bottling lines, as well as minimizing the amount of overproduction. Inputs into the model include model parameters, like the number of shifts to schedule, and monthly demand numbers for each stock keeping unit (SKU). The variables being solved for are the amount of each flavor to be produced across the production lines during each shift, and the number of bottles of each SKU to be bottled across the bottling lines during each shift. Due to the unique constraints and resources of the client, a custom formulation using mixed integer programming was necessary to achieve these objectives. Overall, our model fell short in some areas but succeeded in others. Our analysis showed that the model had a 13% average decrease in production switches but an 87% average increase in bottling switches compared to the current manual scheduling system. However, the ability of our system to create a good enough initial schedule reduces the time it takes expert human schedulers to develop a final schedule by up to 85%. Runtime and computational constraints barred us from creating an optimal, cost-minimized solution for our client, and future work can be directed toward solving these issues.
本研究是由需要一个更有效的生产调度系统在一个大型消费品公司的消耗性液体产品部门。被检查的制造过程包括连续和离散的元素,在生产和包装线上。生产线分为连续生产线和批量生产线,以固定的批量生产产品。然后有几条装瓶生产线,其中一些包装特定的瓶子大小,而另一些可以包装多种瓶子大小。这项研究的主要目的是减少客户创建生产和装瓶时间表所需的时间。开发了一个优化模型来自动化这个过程,并为客户提供最佳的时间表。该模型的目标是通过最小化生产和装瓶线上的开关数量,以及最小化生产过剩的数量来最小化成本。模型的输入包括模型参数,比如要安排的班次数量,以及每个库存单位(SKU)的月需求数量。要解决的变量是在每班期间在生产线上生产的每种风味的数量,以及在每班期间在装瓶生产线上装瓶的每种SKU的瓶数。由于客户的独特约束和资源,需要使用混合整数规划的自定义公式来实现这些目标。总的来说,我们的模式在某些方面有所欠缺,但在其他方面取得了成功。我们的分析表明,与目前的手动调度系统相比,该模型的生产开关平均减少了13%,但装瓶开关平均增加了87%。然而,我们的系统创建一个足够好的初始时间表的能力减少了专家调度人员开发最终时间表所需的时间,最多可减少85%。运行时和计算限制使我们无法为客户创建最优的、成本最低的解决方案,未来的工作可以针对解决这些问题。
{"title":"Optimization of Production and Packaging Schedules in a Mixed Discrete/Continuous Manufacturing Environment","authors":"Jarett Cestaro, David Conklin, Douglas Ziman, Edmund Pan, Grant Anhorn, M. Cunningham, Nevan Schulte, Faraz Dadgostari, P. Beling","doi":"10.1109/SIEDS.2019.8735607","DOIUrl":"https://doi.org/10.1109/SIEDS.2019.8735607","url":null,"abstract":"This research was driven by the need for a more efficient production scheduling system in a consumable liquid product division of a large consumer products company. The manufacturing process under inspection consists of continuous and discrete elements, on both production and packaging lines. The production lines are split into continuous production lines and batch production lines which produce the product in fixed batch amounts. Then there are several bottling lines, some of which package a particular bottle size and others that can package multiple bottle sizes. The main objective of this research was to reduce the amount of time it takes for the client to create production and bottling schedules. An optimization model was developed to automate this process and provide the client with the best possible schedule. The objective of the model is to minimize cost by minimizing the number of switches across the production and bottling lines, as well as minimizing the amount of overproduction. Inputs into the model include model parameters, like the number of shifts to schedule, and monthly demand numbers for each stock keeping unit (SKU). The variables being solved for are the amount of each flavor to be produced across the production lines during each shift, and the number of bottles of each SKU to be bottled across the bottling lines during each shift. Due to the unique constraints and resources of the client, a custom formulation using mixed integer programming was necessary to achieve these objectives. Overall, our model fell short in some areas but succeeded in others. Our analysis showed that the model had a 13% average decrease in production switches but an 87% average increase in bottling switches compared to the current manual scheduling system. However, the ability of our system to create a good enough initial schedule reduces the time it takes expert human schedulers to develop a final schedule by up to 85%. Runtime and computational constraints barred us from creating an optimal, cost-minimized solution for our client, and future work can be directed toward solving these issues.","PeriodicalId":265421,"journal":{"name":"2019 Systems and Information Engineering Design Symposium (SIEDS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125122758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lost in Space: A Case Study on Optimizing Student Spaces at the University of Virginia 迷失在空间:优化弗吉尼亚大学学生空间的案例研究
Pub Date : 2019-04-01 DOI: 10.1109/SIEDS.2019.8735600
Hayley Waleska, Caroline McNichols, Stefan Zachar, Torian Wright, Joshua Cauthen, Seshi Konu, M. DeDomenico, R. Bailey
Reservable student space is an essential resource for student organizations at universities. The ability to provide equitable access to spaces is a key role of the administration. The focus of this paper is a case study exploring mechanisms to improve both spaces and how students access them at the University of Virginia, taking a human-centered design approach to 1) analyzing the current system, 2) identifying, evaluating, and evolving recommendations to improve system performance, and 3) assessing impact of recommendations. Based on prior studies, usage data, surveys, interviews, and focus groups, we identified two driving questions: 1) Does the university have the needed spaces? 2) Are these spaces accessible by student groups? In response, the team developed a dual focus: Space Design and Utilization (SDU) to address the idea of “right spaces” and Reservation System Design (RSD) to address issues related to ease of access. SDU revealed issues with overly-strict policies and space design, a disparity in the spread of spaces across campus, and a shortage of spaces equipped to diverse student activity. To address these issues, we recommend the university audit its policies, focus future construction on creating hubs of space near student housing, and emphasize the ideal of multi-use space during new construction and renovations. RSD revealed a lack of procedural transparency leads to feelings of inequity between student groups, incorrect assumptions regarding users lead to dysfunctional interactions, and the system was not making optimal use of the limited spatial resources. To address issues, we recommend the university increase transparency through clear and consistent communication, refrain from making unjustified assumptions of users, and allocate spaces to proper events while allowing flexibility within spaces.
预留学生空间是大学学生组织的重要资源。提供公平使用空间的能力是行政部门的关键作用。本文的重点是一个案例研究,探索弗吉尼亚大学改善这两个空间的机制以及学生如何访问它们,采用以人为中心的设计方法来1)分析当前系统,2)识别、评估和发展建议以提高系统性能,3)评估建议的影响。基于之前的研究、使用数据、调查、访谈和焦点小组,我们确定了两个驱动问题:1)大学有所需的空间吗?2)这些空间是否可供学生团体使用?作为回应,该团队开发了双重重点:空间设计和利用(SDU)解决“合适的空间”的想法,预留系统设计(RSD)解决与方便访问相关的问题。SDU透露了过于严格的政策和空间设计问题,校园空间分布的差异,以及缺乏用于多样化学生活动的空间。为了解决这些问题,我们建议大学审查其政策,将未来的建设重点放在创造学生宿舍附近的空间中心上,并在新建和翻新时强调多功能空间的理想。RSD显示,程序透明度的缺乏导致学生群体之间的不平等感,对用户的错误假设导致交互功能失调,系统没有充分利用有限的空间资源。为了解决这些问题,我们建议大学通过清晰和一致的沟通来提高透明度,避免对用户做出不合理的假设,并为适当的活动分配空间,同时允许空间内的灵活性。
{"title":"Lost in Space: A Case Study on Optimizing Student Spaces at the University of Virginia","authors":"Hayley Waleska, Caroline McNichols, Stefan Zachar, Torian Wright, Joshua Cauthen, Seshi Konu, M. DeDomenico, R. Bailey","doi":"10.1109/SIEDS.2019.8735600","DOIUrl":"https://doi.org/10.1109/SIEDS.2019.8735600","url":null,"abstract":"Reservable student space is an essential resource for student organizations at universities. The ability to provide equitable access to spaces is a key role of the administration. The focus of this paper is a case study exploring mechanisms to improve both spaces and how students access them at the University of Virginia, taking a human-centered design approach to 1) analyzing the current system, 2) identifying, evaluating, and evolving recommendations to improve system performance, and 3) assessing impact of recommendations. Based on prior studies, usage data, surveys, interviews, and focus groups, we identified two driving questions: 1) Does the university have the needed spaces? 2) Are these spaces accessible by student groups? In response, the team developed a dual focus: Space Design and Utilization (SDU) to address the idea of “right spaces” and Reservation System Design (RSD) to address issues related to ease of access. SDU revealed issues with overly-strict policies and space design, a disparity in the spread of spaces across campus, and a shortage of spaces equipped to diverse student activity. To address these issues, we recommend the university audit its policies, focus future construction on creating hubs of space near student housing, and emphasize the ideal of multi-use space during new construction and renovations. RSD revealed a lack of procedural transparency leads to feelings of inequity between student groups, incorrect assumptions regarding users lead to dysfunctional interactions, and the system was not making optimal use of the limited spatial resources. To address issues, we recommend the university increase transparency through clear and consistent communication, refrain from making unjustified assumptions of users, and allocate spaces to proper events while allowing flexibility within spaces.","PeriodicalId":265421,"journal":{"name":"2019 Systems and Information Engineering Design Symposium (SIEDS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132258146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the Viability of a Fold-Out Hydroponic Farm for Humanitarian Relief Efforts in Dominica 评估在多米尼加开展人道主义救援工作的折叠水培农场的可行性
Pub Date : 2019-04-01 DOI: 10.1109/SIEDS.2019.8735624
Annie Hatcher, Stephen Jung, Holden Keegan, Todd Le, Henry C Quach, C. Ward, Justin L. Weisberg, G. Louis
Hurricane Maria destroyed most of the agricultural sector on the island of Dominica and left the inhabitants with no locally grown produce for months. As climate change brings more frequent and extreme weather events, the Caribbean region becomes increasingly vulnerable to their adverse effects. Agriculture is a large component of Dominica's economy, and the country seeks to adapt its agricultural practices to accommodate the changing environmental conditions. A Charlottesville, VA startup company, Babylon Micro-Farms (BMF), received a grant to adapt their current hydroponic crop cultivation system to create a prototype for humanitarian food assistance. This prototype “fold-out farm” (FoF) would be solar-powered and transportable to Dominica for testing as a reliable source of fresh produce for families. The produce grown using the FoF could also provide extra income for these families. In order to assess the feasibility of implementing the FoF in Dominica, our project team is developing a Capacity Factor Analysis (CFA). CFA assesses a community's capability to successfully acquire and independently sustain a technology using the following factors: Institutional, Human Resources, Technical, Economic, Environmental, Energy, Socio-Cultural, and Service. By analyzing these individual factors, we hope to identify issues likely to affect the successful deployment of the FoF in Dominica. The results of this CFA lead to suggestions for improvements to the FoF technology and its implementation in Dominica. The development of this CFA will continue to be useful to BMF in the future while implementing their technology in other communities.
飓风玛丽亚摧毁了多米尼加岛上的大部分农业部门,使居民几个月没有当地种植的农产品。随着气候变化带来更加频繁和极端的天气事件,加勒比地区越来越容易受到其不利影响。农业是多米尼加经济的一个重要组成部分,该国力求调整其农业做法,以适应不断变化的环境条件。弗吉尼亚州夏洛茨维尔的一家初创公司巴比伦微型农场(BMF)获得了一笔赠款,用于改造他们目前的水培作物种植系统,以创建人道主义粮食援助的原型。这个原型“折叠农场”(FoF)将由太阳能供电,并可运输到多米尼加进行测试,作为家庭新鲜农产品的可靠来源。使用FoF种植的农产品也可以为这些家庭提供额外的收入。为了评估在多米尼加实施FoF的可行性,我们的项目团队正在开发一种能力因素分析(CFA)。CFA通过以下因素评估一个社区成功获取和独立维持一项技术的能力:制度、人力资源、技术、经济、环境、能源、社会文化和服务。通过分析这些个别因素,我们希望找出可能影响在多米尼加成功部署FoF的问题。CFA的结果为改进FoF技术及其在多米尼加的实施提出了建议。该CFA的开发将在未来继续对BMF有用,同时在其他社区实施他们的技术。
{"title":"Assessing the Viability of a Fold-Out Hydroponic Farm for Humanitarian Relief Efforts in Dominica","authors":"Annie Hatcher, Stephen Jung, Holden Keegan, Todd Le, Henry C Quach, C. Ward, Justin L. Weisberg, G. Louis","doi":"10.1109/SIEDS.2019.8735624","DOIUrl":"https://doi.org/10.1109/SIEDS.2019.8735624","url":null,"abstract":"Hurricane Maria destroyed most of the agricultural sector on the island of Dominica and left the inhabitants with no locally grown produce for months. As climate change brings more frequent and extreme weather events, the Caribbean region becomes increasingly vulnerable to their adverse effects. Agriculture is a large component of Dominica's economy, and the country seeks to adapt its agricultural practices to accommodate the changing environmental conditions. A Charlottesville, VA startup company, Babylon Micro-Farms (BMF), received a grant to adapt their current hydroponic crop cultivation system to create a prototype for humanitarian food assistance. This prototype “fold-out farm” (FoF) would be solar-powered and transportable to Dominica for testing as a reliable source of fresh produce for families. The produce grown using the FoF could also provide extra income for these families. In order to assess the feasibility of implementing the FoF in Dominica, our project team is developing a Capacity Factor Analysis (CFA). CFA assesses a community's capability to successfully acquire and independently sustain a technology using the following factors: Institutional, Human Resources, Technical, Economic, Environmental, Energy, Socio-Cultural, and Service. By analyzing these individual factors, we hope to identify issues likely to affect the successful deployment of the FoF in Dominica. The results of this CFA lead to suggestions for improvements to the FoF technology and its implementation in Dominica. The development of this CFA will continue to be useful to BMF in the future while implementing their technology in other communities.","PeriodicalId":265421,"journal":{"name":"2019 Systems and Information Engineering Design Symposium (SIEDS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116327112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Collection Methods for Building a Free Response Training Simulation 建立自由反应训练模拟的数据收集方法
Pub Date : 2019-04-01 DOI: 10.1109/SIEDS.2019.8735621
Vaibhav Sharma, Benjamin Shpringer, S. Yang, M. Bolger, Sodiq Adewole, D. Brown, Erfaneh Gharavi
Most past research in the area of serious games for simulation has focused on games with constrained multiple-choice based dialogue systems. Recent advancements in natural language processing research make free-input text classification-based dialogue systems more feasible, but an effective framework for collecting training data for such systems has not yet been developed. This paper presents methods for collecting and generating data for training a free-input classification-based system. Various data crowdsourcing prompt types are presented. A binary category system, which increases the fidelity of the labeling to make free-input classification more effective, is presented. Finally, a data generation algorithm based on the binary data labeling system is presented. Future work will use the data crowdsourcing and generation methods presented here to implement a free-input dialogue system in a virtual reality (VR) simulation designed for cultural competency training.
过去在模拟严肃游戏领域的大多数研究都集中在带有限制性多选对话系统的游戏上。自然语言处理研究的最新进展使得基于自由输入文本分类的对话系统更加可行,但尚未开发出用于收集此类系统训练数据的有效框架。本文提出了收集和生成用于训练自由输入分类系统的数据的方法。介绍了各种数据众包提示类型。提出了一种二元分类系统,提高了标注的保真度,使自由输入分类更加有效。最后,提出了一种基于二进制数据标注系统的数据生成算法。未来的工作将使用这里提出的数据众包和生成方法,在为文化能力培训设计的虚拟现实(VR)模拟中实现自由输入对话系统。
{"title":"Data Collection Methods for Building a Free Response Training Simulation","authors":"Vaibhav Sharma, Benjamin Shpringer, S. Yang, M. Bolger, Sodiq Adewole, D. Brown, Erfaneh Gharavi","doi":"10.1109/SIEDS.2019.8735621","DOIUrl":"https://doi.org/10.1109/SIEDS.2019.8735621","url":null,"abstract":"Most past research in the area of serious games for simulation has focused on games with constrained multiple-choice based dialogue systems. Recent advancements in natural language processing research make free-input text classification-based dialogue systems more feasible, but an effective framework for collecting training data for such systems has not yet been developed. This paper presents methods for collecting and generating data for training a free-input classification-based system. Various data crowdsourcing prompt types are presented. A binary category system, which increases the fidelity of the labeling to make free-input classification more effective, is presented. Finally, a data generation algorithm based on the binary data labeling system is presented. Future work will use the data crowdsourcing and generation methods presented here to implement a free-input dialogue system in a virtual reality (VR) simulation designed for cultural competency training.","PeriodicalId":265421,"journal":{"name":"2019 Systems and Information Engineering Design Symposium (SIEDS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126448807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Ideal Warrior and Robot Relations: Stress and Empathy's Role in Human-Robot Teaming 理想战士与机器人的关系:压力与共情在人-机器人团队中的作用
Pub Date : 2019-04-01 DOI: 10.1109/SIEDS.2019.8735613
J. Peterson, Chase Cohen, P. Harrison, Jonathan Novak, Chad C. Tossell, Elizabeth Phillips
The battlefield of the future will look very different than the battlefields of the past. Automated technologies are finding themselves more and more integrated into every aspect of the fight. As technology continues to advance, the United States Military must consider what a human-machine team will look like and how an optimal relationship between the two assets can be formed, especially under the stressful conditions that often characterize military contexts. For a human-machine team in a military context to work at maximum efficiency, an ideal level of empathy towards an automated teammate must be obtained. The goal of this study is to determine the effect stress can have on an individual's empathetic reaction toward a Pepper robot. Twenty-eight participants interacted with a Pepper robot either under stress or not. Empathy toward the robot was measured through subjective assessments as well as by participant decisions to continue interacting with Pepper even though doing so would harm the robot. Although not conclusive, the results suggest an interaction between participant gender and stress on empathy toward the Pepper robot. Women showed more empathy toward Pepper under higher levels of stress than lower levels of stress. However, the opposite was true for men. Men showed less empathy toward Pepper under higher levels of stress. The results of this study could help to inform military training and robot design.
未来的战场将与过去的战场大不相同。自动化技术发现自己越来越多地融入到战斗的各个方面。随着技术的不断进步,美国军方必须考虑人机团队将是什么样子,以及如何在这两种资产之间形成最佳关系,特别是在军事环境中经常出现的压力条件下。为了使军事环境中的人机团队以最高效率工作,必须获得对自动化队友的理想移情水平。这项研究的目的是确定压力对个人对Pepper机器人的移情反应的影响。28名参与者在压力下或没有压力的情况下与Pepper机器人互动。对机器人的同理心是通过主观评估和参与者决定继续与Pepper互动来衡量的,即使这样做会伤害机器人。虽然不是决定性的,但结果表明,参与者的性别和对Pepper机器人的同理心压力之间存在相互作用。女性在高压力下比低压力下对Pepper表现出更多的同理心。然而,男性的情况正好相反。压力越大,男性对佩珀的同情就越少。这项研究的结果可能有助于为军事训练和机器人设计提供信息。
{"title":"Ideal Warrior and Robot Relations: Stress and Empathy's Role in Human-Robot Teaming","authors":"J. Peterson, Chase Cohen, P. Harrison, Jonathan Novak, Chad C. Tossell, Elizabeth Phillips","doi":"10.1109/SIEDS.2019.8735613","DOIUrl":"https://doi.org/10.1109/SIEDS.2019.8735613","url":null,"abstract":"The battlefield of the future will look very different than the battlefields of the past. Automated technologies are finding themselves more and more integrated into every aspect of the fight. As technology continues to advance, the United States Military must consider what a human-machine team will look like and how an optimal relationship between the two assets can be formed, especially under the stressful conditions that often characterize military contexts. For a human-machine team in a military context to work at maximum efficiency, an ideal level of empathy towards an automated teammate must be obtained. The goal of this study is to determine the effect stress can have on an individual's empathetic reaction toward a Pepper robot. Twenty-eight participants interacted with a Pepper robot either under stress or not. Empathy toward the robot was measured through subjective assessments as well as by participant decisions to continue interacting with Pepper even though doing so would harm the robot. Although not conclusive, the results suggest an interaction between participant gender and stress on empathy toward the Pepper robot. Women showed more empathy toward Pepper under higher levels of stress than lower levels of stress. However, the opposite was true for men. Men showed less empathy toward Pepper under higher levels of stress. The results of this study could help to inform military training and robot design.","PeriodicalId":265421,"journal":{"name":"2019 Systems and Information Engineering Design Symposium (SIEDS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115066108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Exploratory Data Analysis of a Unified Host and Network Dataset 统一主机与网络数据集的探索性数据分析
Pub Date : 2019-04-01 DOI: 10.1109/SIEDS.2019.8735640
Catherine Beazley, Karan Gadiya, Ravi K U Rakesh, D. Roden, Boda Ye, Brendan Abraham, Donald E. Brown, M. Veeraraghavan
Exploratory data analysis is invaluable for understanding data, choosing correct models, and interpreting, validating, and applying results. It often leads to the discovery of patterns that can answer a number of research questions. In this paper, we perform exploratory data analysis on cybersecurity data in the NetFlow Dataset from “The Unified Host and Network Dataset”. “The Unified Host and Network Dataset” is a large, open source dataset collected on the Los Alamos National Laboratory (LANL) enterprise network that was published to encourage new research in cybersecurity. The NetFlow Dataset is a compilation of flow logs from routers within the LANL network that are aggregated to a relational format using network stitching. Our exploratory data analysis shows distinct patterns and clusters within a day of data. Specifically, scatter plots of the number of packets sent by the destination device versus the number of packets sent by the source device show three distinct, no-intercept linear relationships between the variables. The relationships suggest three common patterns for how the source device and destination device interactively send packets to each other. Our analysis also shows that byte and packet distributions of connections on rare ports and connections on common ports are statistically different, suggesting these differences can be used to discriminate between normal and abnormal network behavior. Our findings may be useful for research into classification problems with a Unified Host and Network Dataset and for furthering cluster analysis in cybersecurity research.
探索性数据分析对于理解数据、选择正确的模型以及解释、验证和应用结果是非常宝贵的。它通常会导致发现可以回答许多研究问题的模式。本文对来自“统一主机与网络数据集”的NetFlow数据集中的网络安全数据进行了探索性数据分析。“统一主机和网络数据集”是在洛斯阿拉莫斯国家实验室(LANL)企业网络上收集的大型开源数据集,旨在鼓励网络安全方面的新研究。NetFlow数据集是来自LANL网络内路由器的流量日志的汇编,这些日志使用网络拼接聚合为关系格式。我们的探索性数据分析在一天的数据中显示出不同的模式和集群。具体来说,目标设备发送的数据包数量与源设备发送的数据包数量的散点图显示了变量之间三个不同的、无截距的线性关系。这些关系为源设备和目的设备如何相互交互发送数据包提供了三种常见模式。我们的分析还表明,罕见端口上的连接和普通端口上的连接的字节和数据包分布在统计上是不同的,这表明这些差异可以用来区分正常和异常的网络行为。我们的发现可能有助于研究统一主机和网络数据集的分类问题,并进一步促进网络安全研究中的聚类分析。
{"title":"Exploratory Data Analysis of a Unified Host and Network Dataset","authors":"Catherine Beazley, Karan Gadiya, Ravi K U Rakesh, D. Roden, Boda Ye, Brendan Abraham, Donald E. Brown, M. Veeraraghavan","doi":"10.1109/SIEDS.2019.8735640","DOIUrl":"https://doi.org/10.1109/SIEDS.2019.8735640","url":null,"abstract":"Exploratory data analysis is invaluable for understanding data, choosing correct models, and interpreting, validating, and applying results. It often leads to the discovery of patterns that can answer a number of research questions. In this paper, we perform exploratory data analysis on cybersecurity data in the NetFlow Dataset from “The Unified Host and Network Dataset”. “The Unified Host and Network Dataset” is a large, open source dataset collected on the Los Alamos National Laboratory (LANL) enterprise network that was published to encourage new research in cybersecurity. The NetFlow Dataset is a compilation of flow logs from routers within the LANL network that are aggregated to a relational format using network stitching. Our exploratory data analysis shows distinct patterns and clusters within a day of data. Specifically, scatter plots of the number of packets sent by the destination device versus the number of packets sent by the source device show three distinct, no-intercept linear relationships between the variables. The relationships suggest three common patterns for how the source device and destination device interactively send packets to each other. Our analysis also shows that byte and packet distributions of connections on rare ports and connections on common ports are statistically different, suggesting these differences can be used to discriminate between normal and abnormal network behavior. Our findings may be useful for research into classification problems with a Unified Host and Network Dataset and for furthering cluster analysis in cybersecurity research.","PeriodicalId":265421,"journal":{"name":"2019 Systems and Information Engineering Design Symposium (SIEDS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132950327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2019 Systems and Information Engineering Design Symposium (SIEDS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1