{"title":"数据挖掘和深度分析。Qafqaz大学HTTP服务器日志分析案例研究","authors":"A. Adamov","doi":"10.1109/ICAICT.2014.7035947","DOIUrl":null,"url":null,"abstract":"The Internet Services, Web and Mobile Applications, Pervasive Communication widely available today meeting many of our needs and stimulating production of tremendous amounts of data. Over 90% of this information is unstructured, what means data does not have predefined structure and model. Generally, unstructured data is useless unless applying data mining or data extraction techniques. At the same time, just in case if we are able to process and understand data, this data worth anything, otherwise it becomes useless. Although, small part of this huge amount is structured (logs) or semi-structured (email, website), it is difficult to process and manage this data without advanced data analytics techniques. This paper provides an example of applying Data Mining and Analysis techniques on the data generated by HTTP Server Logs. Experimental results show that proposed analysis approach based on Regular Expressions is highly efficient and flexible. Results of such analysis are highly beneficial for any company which concerns about efficiency of their Internet-presence giving them important information based on the real data.","PeriodicalId":103329,"journal":{"name":"2014 IEEE 8th International Conference on Application of Information and Communication Technologies (AICT)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Data mining and analysis in depth. case study of Qafqaz University HTTP server log analysis\",\"authors\":\"A. Adamov\",\"doi\":\"10.1109/ICAICT.2014.7035947\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Internet Services, Web and Mobile Applications, Pervasive Communication widely available today meeting many of our needs and stimulating production of tremendous amounts of data. Over 90% of this information is unstructured, what means data does not have predefined structure and model. Generally, unstructured data is useless unless applying data mining or data extraction techniques. At the same time, just in case if we are able to process and understand data, this data worth anything, otherwise it becomes useless. Although, small part of this huge amount is structured (logs) or semi-structured (email, website), it is difficult to process and manage this data without advanced data analytics techniques. This paper provides an example of applying Data Mining and Analysis techniques on the data generated by HTTP Server Logs. Experimental results show that proposed analysis approach based on Regular Expressions is highly efficient and flexible. Results of such analysis are highly beneficial for any company which concerns about efficiency of their Internet-presence giving them important information based on the real data.\",\"PeriodicalId\":103329,\"journal\":{\"name\":\"2014 IEEE 8th International Conference on Application of Information and Communication Technologies (AICT)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE 8th International Conference on Application of Information and Communication Technologies (AICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAICT.2014.7035947\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 8th International Conference on Application of Information and Communication Technologies (AICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICT.2014.7035947","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data mining and analysis in depth. case study of Qafqaz University HTTP server log analysis
The Internet Services, Web and Mobile Applications, Pervasive Communication widely available today meeting many of our needs and stimulating production of tremendous amounts of data. Over 90% of this information is unstructured, what means data does not have predefined structure and model. Generally, unstructured data is useless unless applying data mining or data extraction techniques. At the same time, just in case if we are able to process and understand data, this data worth anything, otherwise it becomes useless. Although, small part of this huge amount is structured (logs) or semi-structured (email, website), it is difficult to process and manage this data without advanced data analytics techniques. This paper provides an example of applying Data Mining and Analysis techniques on the data generated by HTTP Server Logs. Experimental results show that proposed analysis approach based on Regular Expressions is highly efficient and flexible. Results of such analysis are highly beneficial for any company which concerns about efficiency of their Internet-presence giving them important information based on the real data.