用于开发和测试光通信系统 ML 模型的实验数据集

IF 4 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Journal of Optical Communications and Networking Pub Date : 2024-10-04 DOI:10.1364/JOCN.531788
Caio Santos;Abdelrahmane Moawad;Behnam Shariati;Robert Emmerich;Pooyan Safari;Colja Schubert;Johannes K. Fischer
{"title":"用于开发和测试光通信系统 ML 模型的实验数据集","authors":"Caio Santos;Abdelrahmane Moawad;Behnam Shariati;Robert Emmerich;Pooyan Safari;Colja Schubert;Johannes K. Fischer","doi":"10.1364/JOCN.531788","DOIUrl":null,"url":null,"abstract":"Due to the scarcity of diverse and well-organized public datasets, individual research organizations are often forced to develop and utilize their own datasets. However, the utilization of machine learning (ML) models in optical communications and networks heavily depends on the existence of high-quality datasets, especially covering the various parameters to be optimized in wavelength-division multiplexing (WDM) systems. In this work, we present a public dataset for developing and testing ML models. The dataset is developed in a laboratory setting and includes 12,672 samples including data points with different modulation formats, symbol rates, distances, WDM channel allocation profiles, etc. Each data point offers more than 60 features, revealing almost every aspect of the transmission setup. Moreover, we provide optical spectra of the entire C-band as well as a constellation diagram of the channel under test for all the data points. The diversity and extensiveness of the dataset alongside a well-structured document would allow plenty of use-cases and studies to be carried out covering quality of transmission (QoT) studies, optical spectrum analysis, constellation diagram modeling, digital twin evaluation, etc. Similar to our previous efforts, the current dataset aims to facilitate collaboration by offering a way for fair comparison of research outcomes in data analysis within the domain of optical communication systems.","PeriodicalId":50103,"journal":{"name":"Journal of Optical Communications and Networking","volume":"16 11","pages":"G1-G10"},"PeriodicalIF":4.0000,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Experimental dataset for developing and testing ML models in optical communication systems\",\"authors\":\"Caio Santos;Abdelrahmane Moawad;Behnam Shariati;Robert Emmerich;Pooyan Safari;Colja Schubert;Johannes K. Fischer\",\"doi\":\"10.1364/JOCN.531788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to the scarcity of diverse and well-organized public datasets, individual research organizations are often forced to develop and utilize their own datasets. However, the utilization of machine learning (ML) models in optical communications and networks heavily depends on the existence of high-quality datasets, especially covering the various parameters to be optimized in wavelength-division multiplexing (WDM) systems. In this work, we present a public dataset for developing and testing ML models. The dataset is developed in a laboratory setting and includes 12,672 samples including data points with different modulation formats, symbol rates, distances, WDM channel allocation profiles, etc. Each data point offers more than 60 features, revealing almost every aspect of the transmission setup. Moreover, we provide optical spectra of the entire C-band as well as a constellation diagram of the channel under test for all the data points. The diversity and extensiveness of the dataset alongside a well-structured document would allow plenty of use-cases and studies to be carried out covering quality of transmission (QoT) studies, optical spectrum analysis, constellation diagram modeling, digital twin evaluation, etc. Similar to our previous efforts, the current dataset aims to facilitate collaboration by offering a way for fair comparison of research outcomes in data analysis within the domain of optical communication systems.\",\"PeriodicalId\":50103,\"journal\":{\"name\":\"Journal of Optical Communications and Networking\",\"volume\":\"16 11\",\"pages\":\"G1-G10\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Optical Communications and Networking\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10705755/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Optical Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10705755/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

摘要

由于缺乏种类繁多、组织有序的公共数据集,各个研究机构往往不得不开发和利用自己的数据集。然而,机器学习(ML)模型在光通信和网络中的应用在很大程度上依赖于高质量数据集的存在,尤其是涵盖波分复用(WDM)系统中需要优化的各种参数的数据集。在这项工作中,我们提出了一个用于开发和测试 ML 模型的公共数据集。该数据集是在实验室环境中开发的,包含 12672 个样本,包括不同调制格式、符号率、距离、波分复用信道分配情况等数据点。每个数据点提供 60 多个特征,揭示了传输设置的几乎所有方面。此外,我们还为所有数据点提供了整个 C 波段的光学光谱以及被测信道的星座图。数据集的多样性和广泛性以及结构良好的文档将允许开展大量的用例和研究,包括传输质量(QoT)研究、光学频谱分析、星座图建模、数字孪生评估等。与我们之前的工作类似,当前的数据集旨在通过提供一种公平比较光通信系统领域内数据分析研究成果的方法来促进合作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Experimental dataset for developing and testing ML models in optical communication systems
Due to the scarcity of diverse and well-organized public datasets, individual research organizations are often forced to develop and utilize their own datasets. However, the utilization of machine learning (ML) models in optical communications and networks heavily depends on the existence of high-quality datasets, especially covering the various parameters to be optimized in wavelength-division multiplexing (WDM) systems. In this work, we present a public dataset for developing and testing ML models. The dataset is developed in a laboratory setting and includes 12,672 samples including data points with different modulation formats, symbol rates, distances, WDM channel allocation profiles, etc. Each data point offers more than 60 features, revealing almost every aspect of the transmission setup. Moreover, we provide optical spectra of the entire C-band as well as a constellation diagram of the channel under test for all the data points. The diversity and extensiveness of the dataset alongside a well-structured document would allow plenty of use-cases and studies to be carried out covering quality of transmission (QoT) studies, optical spectrum analysis, constellation diagram modeling, digital twin evaluation, etc. Similar to our previous efforts, the current dataset aims to facilitate collaboration by offering a way for fair comparison of research outcomes in data analysis within the domain of optical communication systems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.40
自引率
16.00%
发文量
104
审稿时长
4 months
期刊介绍: The scope of the Journal includes advances in the state-of-the-art of optical networking science, technology, and engineering. Both theoretical contributions (including new techniques, concepts, analyses, and economic studies) and practical contributions (including optical networking experiments, prototypes, and new applications) are encouraged. Subareas of interest include the architecture and design of optical networks, optical network survivability and security, software-defined optical networking, elastic optical networks, data and control plane advances, network management related innovation, and optical access networks. Enabling technologies and their applications are suitable topics only if the results are shown to directly impact optical networking beyond simple point-to-point networks.
期刊最新文献
Introduction to the Benchmarking in Optical Networks Special Issue Protocol-aware approach for mitigating radiation-induced errors in free-space optical downlinks Security enhancement for NOMA-PON with 2D cellular automata and Turing pattern cascading scramble aided fixed-point extended logistic chaotic encryption In-network stable radix sorter using many FPGAs with high-bandwidth photonics [Invited] Power-consumption analysis for different IPoWDM network architectures with ZR/ZR+ and long-haul muxponders
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1