{"title":"基于Python网络爬虫技术的图像信息采集系统","authors":"DongHao Jin","doi":"10.17762/converter.236","DOIUrl":null,"url":null,"abstract":"Collecting data from the Internet is the key to solve the problem of data sources. This paper studies the image information collection system based on Python web crawler technology.This paper studies and develops a data acquisition system based on Python web crawler technology, which realizes the automatic collection of subject data. In this paper, we use urllib, beautiful soup, threading library to design and develop a system model framework including data crawling, exception handling, robots protocol management and multithreading management modules. Through the application of specific cases, this paper introduces the data acquisition process. Experimental data show that compared with the traditional manual data acquisition, the proposed method greatly improves the work efficiency.","PeriodicalId":10707,"journal":{"name":"CONVERTER","volume":"34 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Image Information Collection System Based on Python Web Crawler Technology\",\"authors\":\"DongHao Jin\",\"doi\":\"10.17762/converter.236\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Collecting data from the Internet is the key to solve the problem of data sources. This paper studies the image information collection system based on Python web crawler technology.This paper studies and develops a data acquisition system based on Python web crawler technology, which realizes the automatic collection of subject data. In this paper, we use urllib, beautiful soup, threading library to design and develop a system model framework including data crawling, exception handling, robots protocol management and multithreading management modules. Through the application of specific cases, this paper introduces the data acquisition process. Experimental data show that compared with the traditional manual data acquisition, the proposed method greatly improves the work efficiency.\",\"PeriodicalId\":10707,\"journal\":{\"name\":\"CONVERTER\",\"volume\":\"34 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CONVERTER\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17762/converter.236\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CONVERTER","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17762/converter.236","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Image Information Collection System Based on Python Web Crawler Technology
Collecting data from the Internet is the key to solve the problem of data sources. This paper studies the image information collection system based on Python web crawler technology.This paper studies and develops a data acquisition system based on Python web crawler technology, which realizes the automatic collection of subject data. In this paper, we use urllib, beautiful soup, threading library to design and develop a system model framework including data crawling, exception handling, robots protocol management and multithreading management modules. Through the application of specific cases, this paper introduces the data acquisition process. Experimental data show that compared with the traditional manual data acquisition, the proposed method greatly improves the work efficiency.