Sen Shen;Jing Han;Klodian Bardhi;Haiyuan Li;Ruizhi Yang;Yiran Teng;Vaigai Yokar;Shuangyi Yan;Dimitra Simeonidou
{"title":"Unified monitoring and telemetry platform supporting network intelligence in optical networks","authors":"Sen Shen;Jing Han;Klodian Bardhi;Haiyuan Li;Ruizhi Yang;Yiran Teng;Vaigai Yokar;Shuangyi Yan;Dimitra Simeonidou","doi":"10.1364/JOCN.538552","DOIUrl":null,"url":null,"abstract":"In recent years, machine-learning (ML) applications have generated considerable interest and shown great potential in optimizing optical network management, such as quality of transmission estimation, traffic prediction, and resource allocation. However, these applications often require large datasets for training, inference, and updating, while network operators are generally reluctant to disclose their data due to privacy concerns and the sensitivity of operational information. Most open-source datasets typically lack transparency regarding network specifics, such as topology details and device configurations, making data acquisition and ML model training more difficult. In response, this paper presents a unified monitoring and telemetry platform that leverages distributed and centralized time-series databases on InfluxDB, a Kafka-based telemetry pipeline, and advanced ML applications. The separation of distributed and centralized databases improves data management flexibility and scalability. The Kafka-based telemetry pipeline ensures high-throughput, low-latency data streaming with end-to-end latency under 0.05 s through optimized partitioning. Additionally, integrating Kafka and InfluxDB allows for real-time data visualization from multiple sources, improving transparency and supporting real-time data streaming for network applications. By implementing this advanced telemetry and ML architecture, network operators can build a more intelligent, responsive, and resilient optical network infrastructure.","PeriodicalId":50103,"journal":{"name":"Journal of Optical Communications and Networking","volume":"17 2","pages":"139-151"},"PeriodicalIF":4.0000,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Optical Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10856707/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, machine-learning (ML) applications have generated considerable interest and shown great potential in optimizing optical network management, such as quality of transmission estimation, traffic prediction, and resource allocation. However, these applications often require large datasets for training, inference, and updating, while network operators are generally reluctant to disclose their data due to privacy concerns and the sensitivity of operational information. Most open-source datasets typically lack transparency regarding network specifics, such as topology details and device configurations, making data acquisition and ML model training more difficult. In response, this paper presents a unified monitoring and telemetry platform that leverages distributed and centralized time-series databases on InfluxDB, a Kafka-based telemetry pipeline, and advanced ML applications. The separation of distributed and centralized databases improves data management flexibility and scalability. The Kafka-based telemetry pipeline ensures high-throughput, low-latency data streaming with end-to-end latency under 0.05 s through optimized partitioning. Additionally, integrating Kafka and InfluxDB allows for real-time data visualization from multiple sources, improving transparency and supporting real-time data streaming for network applications. By implementing this advanced telemetry and ML architecture, network operators can build a more intelligent, responsive, and resilient optical network infrastructure.
期刊介绍:
The scope of the Journal includes advances in the state-of-the-art of optical networking science, technology, and engineering. Both theoretical contributions (including new techniques, concepts, analyses, and economic studies) and practical contributions (including optical networking experiments, prototypes, and new applications) are encouraged. Subareas of interest include the architecture and design of optical networks, optical network survivability and security, software-defined optical networking, elastic optical networks, data and control plane advances, network management related innovation, and optical access networks. Enabling technologies and their applications are suitable topics only if the results are shown to directly impact optical networking beyond simple point-to-point networks.