In vivo localization of larval zebrafish’s cardiac chambers in lightsheet fluorescence microscopy using a customized reward function module-incorporated Deep-Q-Network model for reinforcement learning
{"title":"In vivo localization of larval zebrafish’s cardiac chambers in lightsheet fluorescence microscopy using a customized reward function module-incorporated Deep-Q-Network model for reinforcement learning","authors":"Hao-Hsuan Chung , Jen-Jee Chen , Huai-Jen Tsai , Po-Sheng Hu","doi":"10.1016/j.ijleo.2025.172255","DOIUrl":null,"url":null,"abstract":"<div><div>Integrating deep learning techniques into optical imaging systems has ramped up the procedures of biomedical investigation and allowed an unprecedented number of ways to acquiring experimental data. This study attempts to apply a Deep-Q Network (DQN)-based reinforcement learning technique to automatically locate the cross section of the ventricular chamber of zebrafish larvae in a light-sheet fluorescence microscopy. Experimentally, a total of 920 cardiac images of zebrafish larvae were acquired, pre-formatted and manually annotated to ensure the maximal utilization and precise interpretation of the data. Subsequently, a YOLOv5 algorithm was used for image recognition, which were implemented as the state of the custom-created environment in reinforcement learning. A reward score can then be derived through the controlling of the motorized moving platform and used to train a positioning model based on the Deep-Q Network (DQN). This reinforcement learning model not only can learn and improve from the past experience, but also optimize its dynamic behavior on the basis of the designs of a reward formula and an architecture of the learning strategy. Such reward feedback system dynamically directs the sample-loaded moving platform toward an image cross-section of larval zebrafish's cardiac chamber with the best contrast and clarity. The training results indicate that the object-searching DQN model is capable of precisely allocating the optimal cross-section of the ventricular chamber, can achieve the task with a success rate of 96 % in 600 training episodes, and has the advantages of efficiency and precision when compared with the Greedy method, a random strategy and a modified reward formula.</div></div>","PeriodicalId":19513,"journal":{"name":"Optik","volume":"326 ","pages":"Article 172255"},"PeriodicalIF":3.1000,"publicationDate":"2025-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optik","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030402625000439","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
Integrating deep learning techniques into optical imaging systems has ramped up the procedures of biomedical investigation and allowed an unprecedented number of ways to acquiring experimental data. This study attempts to apply a Deep-Q Network (DQN)-based reinforcement learning technique to automatically locate the cross section of the ventricular chamber of zebrafish larvae in a light-sheet fluorescence microscopy. Experimentally, a total of 920 cardiac images of zebrafish larvae were acquired, pre-formatted and manually annotated to ensure the maximal utilization and precise interpretation of the data. Subsequently, a YOLOv5 algorithm was used for image recognition, which were implemented as the state of the custom-created environment in reinforcement learning. A reward score can then be derived through the controlling of the motorized moving platform and used to train a positioning model based on the Deep-Q Network (DQN). This reinforcement learning model not only can learn and improve from the past experience, but also optimize its dynamic behavior on the basis of the designs of a reward formula and an architecture of the learning strategy. Such reward feedback system dynamically directs the sample-loaded moving platform toward an image cross-section of larval zebrafish's cardiac chamber with the best contrast and clarity. The training results indicate that the object-searching DQN model is capable of precisely allocating the optimal cross-section of the ventricular chamber, can achieve the task with a success rate of 96 % in 600 training episodes, and has the advantages of efficiency and precision when compared with the Greedy method, a random strategy and a modified reward formula.
期刊介绍:
Optik publishes articles on all subjects related to light and electron optics and offers a survey on the state of research and technical development within the following fields:
Optics:
-Optics design, geometrical and beam optics, wave optics-
Optical and micro-optical components, diffractive optics, devices and systems-
Photoelectric and optoelectronic devices-
Optical properties of materials, nonlinear optics, wave propagation and transmission in homogeneous and inhomogeneous materials-
Information optics, image formation and processing, holographic techniques, microscopes and spectrometer techniques, and image analysis-
Optical testing and measuring techniques-
Optical communication and computing-
Physiological optics-
As well as other related topics.