Boqiang Zhang , Jinhao Yan , Yuhe Gao , GenLiang Yang , Kunpeng Zhang , Junwu Li
{"title":"Automatic granary sweeping strategy using visual large language model","authors":"Boqiang Zhang , Jinhao Yan , Yuhe Gao , GenLiang Yang , Kunpeng Zhang , Junwu Li","doi":"10.1016/j.jspr.2025.102619","DOIUrl":null,"url":null,"abstract":"<div><div>Food security is a fundamental element of human survival. Reducing grain losses and ensuring grain quality have extremely important practical implications. Enhancing the granary's intelligence is particularly important due to several issues affecting residue grain sweeping, including manual inefficiency, incomplete coverage, and expensive equipment. This work proposes a new method called the Residual Grain Sweeping Visual Large Mode (RGSVLM)<sup>1</sup> based on the Visual Large Language Model (VLLM). First, we constructed a semantic dataset containing images of various residual grain dispersal patterns captured in real granary environments. We also introduced an improved version of the Fast Segment Anything Model (FastSAM) algorithm to detect residual grains in the field images, extract visual features, and achieve accurate segmentation. In addition, we crafted prompt engineering that combines image data to produce corresponding textual datasets that effectively reflect the real-world situation. Next, we integrated this dataset with a chain of reasoning framework to fine-tune the visual large language model for specific tasks. This approach compensates for the original model's limitations in logical reasoning, enabling it to simulate human thought processes and generate clear and reasonable answers. In a granary environment, RGSVLM performs better than other models. This study's development and implementation of RGSVLM offers innovative concepts and techniques for building intelligent granaries.</div></div>","PeriodicalId":17019,"journal":{"name":"Journal of Stored Products Research","volume":"112 ","pages":"Article 102619"},"PeriodicalIF":2.7000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Stored Products Research","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022474X25000785","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENTOMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Food security is a fundamental element of human survival. Reducing grain losses and ensuring grain quality have extremely important practical implications. Enhancing the granary's intelligence is particularly important due to several issues affecting residue grain sweeping, including manual inefficiency, incomplete coverage, and expensive equipment. This work proposes a new method called the Residual Grain Sweeping Visual Large Mode (RGSVLM)1 based on the Visual Large Language Model (VLLM). First, we constructed a semantic dataset containing images of various residual grain dispersal patterns captured in real granary environments. We also introduced an improved version of the Fast Segment Anything Model (FastSAM) algorithm to detect residual grains in the field images, extract visual features, and achieve accurate segmentation. In addition, we crafted prompt engineering that combines image data to produce corresponding textual datasets that effectively reflect the real-world situation. Next, we integrated this dataset with a chain of reasoning framework to fine-tune the visual large language model for specific tasks. This approach compensates for the original model's limitations in logical reasoning, enabling it to simulate human thought processes and generate clear and reasonable answers. In a granary environment, RGSVLM performs better than other models. This study's development and implementation of RGSVLM offers innovative concepts and techniques for building intelligent granaries.
期刊介绍:
The Journal of Stored Products Research provides an international medium for the publication of both reviews and original results from laboratory and field studies on the preservation and safety of stored products, notably food stocks, covering storage-related problems from the producer through the supply chain to the consumer. Stored products are characterised by having relatively low moisture content and include raw and semi-processed foods, animal feedstuffs, and a range of other durable items, including materials such as clothing or museum artefacts.