Data mining ocean model output at the Naval Oceanographic Office Shared Resource Center

OCEANS '02 MTS/IEEE Pub Date : 2002-10-29 DOI:10.1109/OCEANS.2002.1192064

P. Gruzinskas, A. Haas, L. Goon

{"title":"Data mining ocean model output at the Naval Oceanographic Office Shared Resource Center","authors":"P. Gruzinskas, A. Haas, L. Goon","doi":"10.1109/OCEANS.2002.1192064","DOIUrl":null,"url":null,"abstract":"One of the computational technology areas supported by the High Performance Computing Modernization Program is Climate, Weather, and Ocean (CWO) modeling. To this end, state-of-art computing architectures are leveraged against the extremely difficult problem of mathematically modeling and predicting the behavior of a variety of ocean climatological parameters. The problem at hand is the technology to store, retrieve, manipulate, and display these data has not kept pace with the computational technology. During the last five years, we have seen significant cost reductions associated with applying the status quo in visualization techniques to scientific data sets. This is due in large part to the computer gaming industry, driven by the huge profit margins associated with that market. The scientific community has benefited by these advances in low-cost architectures, but only as a by-product of its original intent, which is entertainment. Even so, these low-cost architectures are not designed to handle the scale of data sizes presented by the scientific community and serve only to make inadequate techniques cheaper to field and use. The Naval Oceanographic Office Major Shared Resource Center (NAVO MSRC) Visualization Center is challenged with providing its users state-of-the-art analysis environments for the interrogation of their increasingly large data sets. This paper deals with the data generated by the CWO community, all of whom work with large domains and high resolutions (either vertically, horizontally, or both) that all vary over time. This leads to very large data sets (rows columns layers attribute per cell) for each time step and can challenge even the most powerful architectures when trying to extract or \"mine\" information from the raw data. As in most visualization applications, the model output deals with physical parameters that are invisible to the naked eye. This means effective methods of display are required for ocean circulation or currents, sea surface height, temperature, salinity, and so on. One analogy, which no doubt started the concept of \"data mining\", is that the raw data represent a huge block of ore from which gold nuggets of valuable information (features) must be extracted or mined. This paper concerns the technical solutions that were built to solve the challenges described above, including algorithms, data descriptions, and formats.","PeriodicalId":431594,"journal":{"name":"OCEANS '02 MTS/IEEE","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"OCEANS '02 MTS/IEEE","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/OCEANS.2002.1192064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

One of the computational technology areas supported by the High Performance Computing Modernization Program is Climate, Weather, and Ocean (CWO) modeling. To this end, state-of-art computing architectures are leveraged against the extremely difficult problem of mathematically modeling and predicting the behavior of a variety of ocean climatological parameters. The problem at hand is the technology to store, retrieve, manipulate, and display these data has not kept pace with the computational technology. During the last five years, we have seen significant cost reductions associated with applying the status quo in visualization techniques to scientific data sets. This is due in large part to the computer gaming industry, driven by the huge profit margins associated with that market. The scientific community has benefited by these advances in low-cost architectures, but only as a by-product of its original intent, which is entertainment. Even so, these low-cost architectures are not designed to handle the scale of data sizes presented by the scientific community and serve only to make inadequate techniques cheaper to field and use. The Naval Oceanographic Office Major Shared Resource Center (NAVO MSRC) Visualization Center is challenged with providing its users state-of-the-art analysis environments for the interrogation of their increasingly large data sets. This paper deals with the data generated by the CWO community, all of whom work with large domains and high resolutions (either vertically, horizontally, or both) that all vary over time. This leads to very large data sets (rows columns layers attribute per cell) for each time step and can challenge even the most powerful architectures when trying to extract or "mine" information from the raw data. As in most visualization applications, the model output deals with physical parameters that are invisible to the naked eye. This means effective methods of display are required for ocean circulation or currents, sea surface height, temperature, salinity, and so on. One analogy, which no doubt started the concept of "data mining", is that the raw data represent a huge block of ore from which gold nuggets of valuable information (features) must be extracted or mined. This paper concerns the technical solutions that were built to solve the challenges described above, including algorithms, data descriptions, and formats.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

海军海洋学办公室共享资源中心的数据挖掘海洋模型输出

高性能计算现代化计划支持的计算技术领域之一是气候、天气和海洋(CWO)建模。为此，利用最先进的计算架构来解决数学建模和预测各种海洋气候参数的行为这一极其困难的问题。当前的问题是存储、检索、操作和显示这些数据的技术没有跟上计算技术的步伐。在过去的五年中，我们看到将可视化技术应用于科学数据集的现状显著降低了成本。这在很大程度上要归功于电脑游戏行业，该行业的巨大利润空间推动了这一市场的发展。科学界从这些低成本架构的进步中受益，但这只是其最初意图(娱乐)的副产品。即便如此，这些低成本的架构并不是为了处理科学界呈现的数据规模而设计的，而只是为了使不充分的技术在现场和使用上更便宜。海军海洋学办公室主要共享资源中心(NAVO MSRC)可视化中心面临的挑战是为用户提供最先进的分析环境，以查询他们日益庞大的数据集。本文处理由CWO社区生成的数据，所有这些社区都处理随时间变化的大域和高分辨率(垂直、水平或两者兼而有之)。这将导致每个时间步产生非常大的数据集(每个单元格的行、列、层、属性)，并且在尝试从原始数据中提取或“挖掘”信息时，即使是最强大的架构也会受到挑战。与大多数可视化应用程序一样，模型输出处理肉眼看不见的物理参数。这意味着需要有效的方法来显示海洋环流或洋流、海面高度、温度、盐度等。一个类比无疑开创了“数据挖掘”的概念，原始数据代表了一块巨大的矿石，必须从中提取或挖掘有价值的信息(特征)的金块。本文关注为解决上述挑战而构建的技术解决方案，包括算法、数据描述和格式。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

OCEANS '02 MTS/IEEE

自引率

0.00%

发文量

期刊最新文献

Wave height measurements using acoustic surface tracking Implementation of an important wave model on parallel architectures The New Jersey Shelf Observing System Automated model-based localization of marine mammals near California SEA-COOS: Southeast Atlantic Coastal Ocean Observing System