Kasia M. Kenitz, Eric C. Orenstein, Clarissa R. Anderson, Alexander J. Barth, Christian Briseño-Avena, David A. Caron, Melissa L. Carter, Emily Eggleston, Peter J. S. Franks, James T. Fumo, Jules S. Jaffe, Kelsey A. McBeain, Anthony Odell, Kristi Seech, Rebecca Shipe, Jayme Smith, Darcy A. A. Taniguchi, Elizabeth L. Venrick, Andrew D. Barton
{"title":"Convening Expert Taxonomists to Build Image Libraries for Training Automated Classifiers","authors":"Kasia M. Kenitz, Eric C. Orenstein, Clarissa R. Anderson, Alexander J. Barth, Christian Briseño-Avena, David A. Caron, Melissa L. Carter, Emily Eggleston, Peter J. S. Franks, James T. Fumo, Jules S. Jaffe, Kelsey A. McBeain, Anthony Odell, Kristi Seech, Rebecca Shipe, Jayme Smith, Darcy A. A. Taniguchi, Elizabeth L. Venrick, Andrew D. Barton","doi":"10.1002/lob.10584","DOIUrl":null,"url":null,"abstract":"<p>Digital imaging technologies are increasingly used to study life in the ocean. To deal with the large volume of image data collected over space and time, scientists employ various machine learning and deep learning algorithms to perform automated image classification. Training of classifiers requires a large number of expertly curated sets of images, a time-consuming process that requires taxonomic knowledge and understanding of the local ecosystem. The creation of these labeled training sets is the critical bottleneck for building skillful automated classifiers. Here, we discuss how we overcame this barrier by leveraging taxonomic knowledge from a group of specialists in a workshop setting and suggest best practices for effectively organizing image annotation efforts. In our experience, this 2 day workshop proved very insightful and facilitated classification of over 4 years of plankton images obtained at Scripps Pier (La Jolla, CA), focusing on diatoms and dinoflagellates. We highlight the importance of facilitating a dialog between taxonomists and engineers to better integrate ecological goals with computational constraints, and encourage continuous involvement of taxonomic experts for successful implementation of automated classifiers.</p>","PeriodicalId":40008,"journal":{"name":"Limnology and Oceanography Bulletin","volume":"32 3","pages":"89-97"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/lob.10584","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Limnology and Oceanography Bulletin","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/lob.10584","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Digital imaging technologies are increasingly used to study life in the ocean. To deal with the large volume of image data collected over space and time, scientists employ various machine learning and deep learning algorithms to perform automated image classification. Training of classifiers requires a large number of expertly curated sets of images, a time-consuming process that requires taxonomic knowledge and understanding of the local ecosystem. The creation of these labeled training sets is the critical bottleneck for building skillful automated classifiers. Here, we discuss how we overcame this barrier by leveraging taxonomic knowledge from a group of specialists in a workshop setting and suggest best practices for effectively organizing image annotation efforts. In our experience, this 2 day workshop proved very insightful and facilitated classification of over 4 years of plankton images obtained at Scripps Pier (La Jolla, CA), focusing on diatoms and dinoflagellates. We highlight the importance of facilitating a dialog between taxonomists and engineers to better integrate ecological goals with computational constraints, and encourage continuous involvement of taxonomic experts for successful implementation of automated classifiers.
期刊介绍:
All past issues of the Limnology and Oceanography Bulletin are available online, including its predecessors Communications to Members and the ASLO Bulletin. Access to the current and previous volume is restricted to members and institutions with a subscription to the ASLO journals. All other issues are freely accessible without a subscription. As part of ASLO’s mission to disseminate and communicate knowledge in the aquatic sciences.