{"title":"利用Apache Spark计算学生的特征向量进行人脸识别","authors":"D. Kariboz, A. Bogdanchikov, K. Orynbekova","doi":"10.1109/ICECCO48375.2019.9043282","DOIUrl":null,"url":null,"abstract":"With a massive increase of images and videos in the internet, processing these data for face detection and recognition is a bottleneck in current situation. At least there is a lack of frameworks and tools to process lots of data efficiently and not time consuming. System proposed in this paper offers face recognition framework based on Apache Spark to analyze lots images and videos from Instagram. As a people need to be searched and recognized are university students, their feature vectors used for recognition initially computed and can be enlarged at any moment. Spark’s Resilient Distributed Databases gives ability to compute all the intermediate data directly in memory making computations faster meanwhile keeping resilience property.","PeriodicalId":166322,"journal":{"name":"2019 15th International Conference on Electronics, Computer and Computation (ICECCO)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Computing feature vectors of students for face recognition using Apache Spark\",\"authors\":\"D. Kariboz, A. Bogdanchikov, K. Orynbekova\",\"doi\":\"10.1109/ICECCO48375.2019.9043282\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With a massive increase of images and videos in the internet, processing these data for face detection and recognition is a bottleneck in current situation. At least there is a lack of frameworks and tools to process lots of data efficiently and not time consuming. System proposed in this paper offers face recognition framework based on Apache Spark to analyze lots images and videos from Instagram. As a people need to be searched and recognized are university students, their feature vectors used for recognition initially computed and can be enlarged at any moment. Spark’s Resilient Distributed Databases gives ability to compute all the intermediate data directly in memory making computations faster meanwhile keeping resilience property.\",\"PeriodicalId\":166322,\"journal\":{\"name\":\"2019 15th International Conference on Electronics, Computer and Computation (ICECCO)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 15th International Conference on Electronics, Computer and Computation (ICECCO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECCO48375.2019.9043282\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 15th International Conference on Electronics, Computer and Computation (ICECCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECCO48375.2019.9043282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Computing feature vectors of students for face recognition using Apache Spark
With a massive increase of images and videos in the internet, processing these data for face detection and recognition is a bottleneck in current situation. At least there is a lack of frameworks and tools to process lots of data efficiently and not time consuming. System proposed in this paper offers face recognition framework based on Apache Spark to analyze lots images and videos from Instagram. As a people need to be searched and recognized are university students, their feature vectors used for recognition initially computed and can be enlarged at any moment. Spark’s Resilient Distributed Databases gives ability to compute all the intermediate data directly in memory making computations faster meanwhile keeping resilience property.