Marjolaine Matabos , Pierre Cottais , Riwan Leroux , Yannick Cenatiempo , Charlotte Gasne-Destaville , Nicolas Roullet , Jozée Sarrazin , Julie Tourolle , Catherine Borremans
{"title":"深海间谍:促进科学和海洋知识普及的在线公民科学注释平台","authors":"Marjolaine Matabos , Pierre Cottais , Riwan Leroux , Yannick Cenatiempo , Charlotte Gasne-Destaville , Nicolas Roullet , Jozée Sarrazin , Julie Tourolle , Catherine Borremans","doi":"10.1016/j.ecoinf.2025.103065","DOIUrl":null,"url":null,"abstract":"<div><div>The recent development of deep-sea observatories has enabled the acquisition of high temporal resolution imagery for studying the dynamics of deep-sea communities on hourly to multi-decadal scales. These unprecedented datasets offer valuable insight into the variation of species abundance and biology in relation to changes in environmental conditions. Since 2010, camera systems deployed at hydrothermal vents have acquired over 11 terabytes (TB) of data that cannot be processed by research labs only. Although deep learning offers an alternative to human processing, training algorithms requires large annotated reference datasets. The Deep Sea Spy project allows citizens to contribute to the annotation of pictures acquired with underwater platforms. Based on approximately 4000 photos, each annotated 10 times by independent participants, we were able to develop a data validation workflow that can be applied to similar databases. We compared these annotations with expert-annotated data and analysed the agreement rate among participants for each of the 15,000 annotated individual organisms to optimise the robustness and confidence level in non-expert citizen science. The optimal number of repeat annotations per photo was also analysed to guide the definition of a trade-off between the accuracy and amount of data. An agreement rate of 0.4 (i.e., 4 out of 10 participants detecting one given individual) was established as an efficient threshold to reach counts similar to that obtained from an expert. One important result lies in the robustness of the temporal trends of species abundance as revealed by time-series analyses. Regarding the number of times a photo needs to be annotated, results varied greatly depending on the target species and the difficulty of the associated task. Finally, we present the communication tools and actions deployed during the project and how the platform can serve educational and decision-making purposes. Deep Sea Spy and the proposed workflow have a strong potential to enhance marine environmental observation and monitoring.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103065"},"PeriodicalIF":5.8000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep sea spy: An online citizen science annotation platform for science and ocean literacy\",\"authors\":\"Marjolaine Matabos , Pierre Cottais , Riwan Leroux , Yannick Cenatiempo , Charlotte Gasne-Destaville , Nicolas Roullet , Jozée Sarrazin , Julie Tourolle , Catherine Borremans\",\"doi\":\"10.1016/j.ecoinf.2025.103065\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The recent development of deep-sea observatories has enabled the acquisition of high temporal resolution imagery for studying the dynamics of deep-sea communities on hourly to multi-decadal scales. These unprecedented datasets offer valuable insight into the variation of species abundance and biology in relation to changes in environmental conditions. Since 2010, camera systems deployed at hydrothermal vents have acquired over 11 terabytes (TB) of data that cannot be processed by research labs only. Although deep learning offers an alternative to human processing, training algorithms requires large annotated reference datasets. The Deep Sea Spy project allows citizens to contribute to the annotation of pictures acquired with underwater platforms. Based on approximately 4000 photos, each annotated 10 times by independent participants, we were able to develop a data validation workflow that can be applied to similar databases. We compared these annotations with expert-annotated data and analysed the agreement rate among participants for each of the 15,000 annotated individual organisms to optimise the robustness and confidence level in non-expert citizen science. The optimal number of repeat annotations per photo was also analysed to guide the definition of a trade-off between the accuracy and amount of data. An agreement rate of 0.4 (i.e., 4 out of 10 participants detecting one given individual) was established as an efficient threshold to reach counts similar to that obtained from an expert. One important result lies in the robustness of the temporal trends of species abundance as revealed by time-series analyses. Regarding the number of times a photo needs to be annotated, results varied greatly depending on the target species and the difficulty of the associated task. Finally, we present the communication tools and actions deployed during the project and how the platform can serve educational and decision-making purposes. Deep Sea Spy and the proposed workflow have a strong potential to enhance marine environmental observation and monitoring.</div></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":\"86 \",\"pages\":\"Article 103065\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-02-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954125000743\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125000743","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
Deep sea spy: An online citizen science annotation platform for science and ocean literacy
The recent development of deep-sea observatories has enabled the acquisition of high temporal resolution imagery for studying the dynamics of deep-sea communities on hourly to multi-decadal scales. These unprecedented datasets offer valuable insight into the variation of species abundance and biology in relation to changes in environmental conditions. Since 2010, camera systems deployed at hydrothermal vents have acquired over 11 terabytes (TB) of data that cannot be processed by research labs only. Although deep learning offers an alternative to human processing, training algorithms requires large annotated reference datasets. The Deep Sea Spy project allows citizens to contribute to the annotation of pictures acquired with underwater platforms. Based on approximately 4000 photos, each annotated 10 times by independent participants, we were able to develop a data validation workflow that can be applied to similar databases. We compared these annotations with expert-annotated data and analysed the agreement rate among participants for each of the 15,000 annotated individual organisms to optimise the robustness and confidence level in non-expert citizen science. The optimal number of repeat annotations per photo was also analysed to guide the definition of a trade-off between the accuracy and amount of data. An agreement rate of 0.4 (i.e., 4 out of 10 participants detecting one given individual) was established as an efficient threshold to reach counts similar to that obtained from an expert. One important result lies in the robustness of the temporal trends of species abundance as revealed by time-series analyses. Regarding the number of times a photo needs to be annotated, results varied greatly depending on the target species and the difficulty of the associated task. Finally, we present the communication tools and actions deployed during the project and how the platform can serve educational and decision-making purposes. Deep Sea Spy and the proposed workflow have a strong potential to enhance marine environmental observation and monitoring.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.