Pub Date : 2015-02-01DOI: 10.1109/ICOSC.2015.7050815
Ke Ding, Nicholas Pantic, You Lu, S. Manna, M. Husain
Phishing attacks are a form of social engineering technique used for stealing private information from users through emails. A general approach for phishing susceptibility analysis is to profile the user's personality using personality models such as the Five Factor Model (FFM) and find out the susceptibility for a set of phishing attempts. The FFM is a personality profiling system that scores participants on five separate personality traits: openness to experience (O), conscientiousness (C), extraversion (E), agreeableness (A), and neuroticism (N). However, existing approaches don't take into account the fact that based on the content, for example, a phishing email offering an enticing free prize might be very effective on a dominant O-personality (curious, open to new experience), but not to an N-personality (tendency of experiencing negative emotion). Therefore, it is necessary to consider the personality bias of the phishing email contents during the susceptibility analysis. In this paper, we have proposed a method to construct a dictionary based on the semantic similarity of prospective words describing the FFM. Words generated through this dictionary can be used to label the phishing emails according to the personality bias and serve as the key component of a personality bias classification system of phishing emails. We have validated our dictionary construction using a large public corpus of phishing email data which shows the potential of the proposed system in anti-phishing research.
{"title":"Towards building a word similarity dictionary for personality bias classification of phishing email contents","authors":"Ke Ding, Nicholas Pantic, You Lu, S. Manna, M. Husain","doi":"10.1109/ICOSC.2015.7050815","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050815","url":null,"abstract":"Phishing attacks are a form of social engineering technique used for stealing private information from users through emails. A general approach for phishing susceptibility analysis is to profile the user's personality using personality models such as the Five Factor Model (FFM) and find out the susceptibility for a set of phishing attempts. The FFM is a personality profiling system that scores participants on five separate personality traits: openness to experience (O), conscientiousness (C), extraversion (E), agreeableness (A), and neuroticism (N). However, existing approaches don't take into account the fact that based on the content, for example, a phishing email offering an enticing free prize might be very effective on a dominant O-personality (curious, open to new experience), but not to an N-personality (tendency of experiencing negative emotion). Therefore, it is necessary to consider the personality bias of the phishing email contents during the susceptibility analysis. In this paper, we have proposed a method to construct a dictionary based on the semantic similarity of prospective words describing the FFM. Words generated through this dictionary can be used to label the phishing emails according to the personality bias and serve as the key component of a personality bias classification system of phishing emails. We have validated our dictionary construction using a large public corpus of phishing email data which shows the potential of the proposed system in anti-phishing research.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"165 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114182198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2015-02-01DOI: 10.1109/ICOSC.2015.7050793
M. Vora
In a service oriented world, performance plays a vital role in the success of any IT system. For an application running in a production environment, whenever there is a change in the workload or workload pattern, utilization of major server resources like cpus, disks, memory, network etc. will also change. In this paper, we are extending our methodology to estimate the server resource utilization for any given workload pattern by extracting the optimal information from the historic production logs (application logs and resource utilization or system monitoring logs) using a specifically designed genetic algorithm. Across all experimental validations, we find the average absolute error in estimating utilization of server resources was less than 15%. Unlike traditional approaches to estimate overall resource utilization, method presented here, neither requires to estimate service demands for each individual application functions nor does it require to benchmark individual business transactions. Only necessary input to the model is the application logs containing the information about the throughput (for example an access log in case of web application) and system monitoring logs containing aggregate resource utilization information.
{"title":"Workload pattern analysis","authors":"M. Vora","doi":"10.1109/ICOSC.2015.7050793","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050793","url":null,"abstract":"In a service oriented world, performance plays a vital role in the success of any IT system. For an application running in a production environment, whenever there is a change in the workload or workload pattern, utilization of major server resources like cpus, disks, memory, network etc. will also change. In this paper, we are extending our methodology to estimate the server resource utilization for any given workload pattern by extracting the optimal information from the historic production logs (application logs and resource utilization or system monitoring logs) using a specifically designed genetic algorithm. Across all experimental validations, we find the average absolute error in estimating utilization of server resources was less than 15%. Unlike traditional approaches to estimate overall resource utilization, method presented here, neither requires to estimate service demands for each individual application functions nor does it require to benchmark individual business transactions. Only necessary input to the model is the application logs containing the information about the throughput (for example an access log in case of web application) and system monitoring logs containing aggregate resource utilization information.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123968558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2015-02-01DOI: 10.1109/ICOSC.2015.7050818
Qingliang Miao, Yao Meng, Lu Fang, Fumihito Nishino, N. Igata
Scientific publication management services are changing drastically. On the one hand, researchers demand intelligent search services to discover scientific publications. On the other hand, publishers need to incorporate semantic information to better organize their digital assets and make publications more discoverable. For this purpose, we investigate how to manage scientific publications using Linked Data and introduce FELinker, an entity linking component that links scientific publications with DBPedia. In particular, this paper introduces advantages of linking scientific publications with Linked Data, discusses major challenges, and outlines the proposed method. Experiment shows the proposed method could get promising performance in scientific publication linkage.
{"title":"Link scientific publications using linked data","authors":"Qingliang Miao, Yao Meng, Lu Fang, Fumihito Nishino, N. Igata","doi":"10.1109/ICOSC.2015.7050818","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050818","url":null,"abstract":"Scientific publication management services are changing drastically. On the one hand, researchers demand intelligent search services to discover scientific publications. On the other hand, publishers need to incorporate semantic information to better organize their digital assets and make publications more discoverable. For this purpose, we investigate how to manage scientific publications using Linked Data and introduce FELinker, an entity linking component that links scientific publications with DBPedia. In particular, this paper introduces advantages of linking scientific publications with Linked Data, discusses major challenges, and outlines the proposed method. Experiment shows the proposed method could get promising performance in scientific publication linkage.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"258 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124237986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2015-02-01DOI: 10.1109/ICOSC.2015.7050839
Krishna Sapkota, Pathmeswaran Raju, William J. Byrne, C. Chapman
Bioenergy is a renewable energy generated from biomass, while biofuel is a hydrocarbon fuel that is produced from biomass. Recently, bioenergy and biofuel projects are encouraged and supported by many governments and organizations in various ways such as providing incentives, technical supports, information, and decision support tools. Economic model is one of the decision support tools, which helps to estimate the costs and earnings involved in a project. It is constructed with various elements such as concepts, relations, logics, constants and equations. In current economic models, all the elements are hard coded into some programming code, which makes the model less reusable and extendable. To address the issue, we present an ontology-based economic model in this paper. In particular, we have leveraged the Semantic Web technologies to represent the knowledge about the bioenergy and biofuel economics and inferred the equations and other values required for economic calculations. The case study has been carried out in two of the INTERREG Projects and found promising results.
{"title":"Ontology-based economic models for bioenergy and biofuel projects","authors":"Krishna Sapkota, Pathmeswaran Raju, William J. Byrne, C. Chapman","doi":"10.1109/ICOSC.2015.7050839","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050839","url":null,"abstract":"Bioenergy is a renewable energy generated from biomass, while biofuel is a hydrocarbon fuel that is produced from biomass. Recently, bioenergy and biofuel projects are encouraged and supported by many governments and organizations in various ways such as providing incentives, technical supports, information, and decision support tools. Economic model is one of the decision support tools, which helps to estimate the costs and earnings involved in a project. It is constructed with various elements such as concepts, relations, logics, constants and equations. In current economic models, all the elements are hard coded into some programming code, which makes the model less reusable and extendable. To address the issue, we present an ontology-based economic model in this paper. In particular, we have leveraged the Semantic Web technologies to represent the knowledge about the bioenergy and biofuel economics and inferred the equations and other values required for economic calculations. The case study has been carried out in two of the INTERREG Projects and found promising results.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134516140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2015-02-01DOI: 10.1109/ICOSC.2015.7050860
Yuan-Chih Yu
When disaster strikes the urban community, residents may suffer life-threatening, environmental impact, and economic loss. Meanwhile, the cellular and Internet services are prone to be fail because the disaster may cause the network infrastructure damage. As the communication service is so important for the disaster response, we suggest an emergency social networking solution, called ECSN, to conquer such crisis. ECSN is a community-based emergent social networking service suitable for dealing with the tasks on disaster response. To provide the ECSN service, we construct a Disaster Response Portal dedicated designed for disaster management. From the software architecture perspective, it has mobile client agents and server-side services working together to realize the concept of “Community Social Networking”. Through the Disaster Response Portal, the local disaster rescue and response can be integrated with nationwide disaster management. Also, disaster response can be easily planned and manageable at a community scope and benefits other emergent measures taken by disaster prevention, mitigation, preparedness, and recovery. After simulation experiments validate, the result shows the system can work well on the problem domain. Most importantly, the total solution creates a new practicable model for the mobility of urban disaster response.
{"title":"A mobile social networking service for urban community disaster response","authors":"Yuan-Chih Yu","doi":"10.1109/ICOSC.2015.7050860","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050860","url":null,"abstract":"When disaster strikes the urban community, residents may suffer life-threatening, environmental impact, and economic loss. Meanwhile, the cellular and Internet services are prone to be fail because the disaster may cause the network infrastructure damage. As the communication service is so important for the disaster response, we suggest an emergency social networking solution, called ECSN, to conquer such crisis. ECSN is a community-based emergent social networking service suitable for dealing with the tasks on disaster response. To provide the ECSN service, we construct a Disaster Response Portal dedicated designed for disaster management. From the software architecture perspective, it has mobile client agents and server-side services working together to realize the concept of “Community Social Networking”. Through the Disaster Response Portal, the local disaster rescue and response can be integrated with nationwide disaster management. Also, disaster response can be easily planned and manageable at a community scope and benefits other emergent measures taken by disaster prevention, mitigation, preparedness, and recovery. After simulation experiments validate, the result shows the system can work well on the problem domain. Most importantly, the total solution creates a new practicable model for the mobility of urban disaster response.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115658262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2015-02-01DOI: 10.1109/ICOSC.2015.7050813
Carmelo Spiccia, A. Augello, G. Pilato, G. Vassallo
Word prediction generally relies on n-grams occurrence statistics, which may have huge data storage requirements and does not take into account the general meaning of the text. We propose an alternative methodology, based on Latent Semantic Analysis, to address these issues. An asymmetric Word-Word frequency matrix is employed to achieve higher scalability with large training datasets than the classic Word-Document approach. We propose a function for scoring candidate terms for the missing word in a sentence. We show how this function approximates the probability of occurrence of a given candidate word. Experimental results show that the proposed approach outperforms non neural network language models.
{"title":"A word prediction methodology for automatic sentence completion","authors":"Carmelo Spiccia, A. Augello, G. Pilato, G. Vassallo","doi":"10.1109/ICOSC.2015.7050813","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050813","url":null,"abstract":"Word prediction generally relies on n-grams occurrence statistics, which may have huge data storage requirements and does not take into account the general meaning of the text. We propose an alternative methodology, based on Latent Semantic Analysis, to address these issues. An asymmetric Word-Word frequency matrix is employed to achieve higher scalability with large training datasets than the classic Word-Document approach. We propose a function for scoring candidate terms for the missing word in a sentence. We show how this function approximates the probability of occurrence of a given candidate word. Experimental results show that the proposed approach outperforms non neural network language models.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121924075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2015-02-01DOI: 10.1109/ICOSC.2015.7050859
Shailja Sharma, J. Lather, M. Dave
The Current description standards for Web Services such as WSDL and UDDI have a significant drawback of being restricted to the syntactic aspects of service. A service provider registers a service in the universal repository i.e. UDDI so that the service consumers can search and discover the required service that meets the user functional requirements from thousands of registered services. Matching the user request with all services in a particular category of the repository is a cumbersome task. Semantic approaches are required to further assist the user in discovering relevant services. In this paper, we have proposed a semantic approach that gives ranked list of services based on the web based relatedness score and helps the users in the selection of potentially relevant and semantically similar services within a category. The proposed approach has been implemented on 80 OWLS services and the results have shown that the approach gives ranked list of services with ease of the selection process for the user.
{"title":"Google based hybrid approach for discovering services","authors":"Shailja Sharma, J. Lather, M. Dave","doi":"10.1109/ICOSC.2015.7050859","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050859","url":null,"abstract":"The Current description standards for Web Services such as WSDL and UDDI have a significant drawback of being restricted to the syntactic aspects of service. A service provider registers a service in the universal repository i.e. UDDI so that the service consumers can search and discover the required service that meets the user functional requirements from thousands of registered services. Matching the user request with all services in a particular category of the repository is a cumbersome task. Semantic approaches are required to further assist the user in discovering relevant services. In this paper, we have proposed a semantic approach that gives ranked list of services based on the web based relatedness score and helps the users in the selection of potentially relevant and semantically similar services within a category. The proposed approach has been implemented on 80 OWLS services and the results have shown that the approach gives ranked list of services with ease of the selection process for the user.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132458953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2015-02-01DOI: 10.1109/ICOSC.2015.7050814
D. Dou, Hao Wang, Haishan Liu
Semantic Data Mining refers to the data mining tasks that systematically incorporate domain knowledge, especially formal semantics, into the process. In the past, many research efforts have attested the benefits of incorporating domain knowledge in data mining. At the same time, the proliferation of knowledge engineering has enriched the family of domain knowledge, especially formal semantics and Semantic Web ontologies. Ontology is an explicit specification of conceptualization and a formal way to define the semantics of knowledge and data. The formal structure of ontology makes it a nature way to encode domain knowledge for the data mining use. In this survey paper, we introduce general concepts of semantic data mining. We investigate why ontology has the potential to help semantic data mining and how formal semantics in ontologies can be incorporated into the data mining process. We provide detail discussions for the advances and state of art of ontology-based approaches and an introduction of approaches that are based on other form of knowledge representations.
{"title":"Semantic data mining: A survey of ontology-based approaches","authors":"D. Dou, Hao Wang, Haishan Liu","doi":"10.1109/ICOSC.2015.7050814","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050814","url":null,"abstract":"Semantic Data Mining refers to the data mining tasks that systematically incorporate domain knowledge, especially formal semantics, into the process. In the past, many research efforts have attested the benefits of incorporating domain knowledge in data mining. At the same time, the proliferation of knowledge engineering has enriched the family of domain knowledge, especially formal semantics and Semantic Web ontologies. Ontology is an explicit specification of conceptualization and a formal way to define the semantics of knowledge and data. The formal structure of ontology makes it a nature way to encode domain knowledge for the data mining use. In this survey paper, we introduce general concepts of semantic data mining. We investigate why ontology has the potential to help semantic data mining and how formal semantics in ontologies can be incorporated into the data mining process. We provide detail discussions for the advances and state of art of ontology-based approaches and an introduction of approaches that are based on other form of knowledge representations.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127429812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-08-31DOI: 10.1109/ICOSC.2015.7050794
Udaya Raj Dhungana, S. Shakya, K. Baral, Bharat Sharma
This paper presents a new model of WordNet that is used to disambiguate the correct sense of polysemy word based on the clue words. The related words for each sense of a polysemy word as well as single sense word are referred to as the clue words. The conventional WordNet organises nouns, verbs, adjectives and adverbs together into sets of synonyms called synsets each expressing a different concept. In contrast to the structure of WordNet, we developed a new model of WordNet that organizes the different senses of polysemy words as well as the single sense words based on the clue words. These clue words for each sense of a polysemy word as well as for single sense word are used to disambiguate the correct meaning of the polysemy word in the given context using knowledge-based Word Sense Disambiguation (WSD) algorithms. The clue word can be a noun, verb, adjective or adverb.
{"title":"Word Sense Disambiguation using WSD specific WordNet of polysemy words","authors":"Udaya Raj Dhungana, S. Shakya, K. Baral, Bharat Sharma","doi":"10.1109/ICOSC.2015.7050794","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050794","url":null,"abstract":"This paper presents a new model of WordNet that is used to disambiguate the correct sense of polysemy word based on the clue words. The related words for each sense of a polysemy word as well as single sense word are referred to as the clue words. The conventional WordNet organises nouns, verbs, adjectives and adverbs together into sets of synonyms called synsets each expressing a different concept. In contrast to the structure of WordNet, we developed a new model of WordNet that organizes the different senses of polysemy words as well as the single sense words based on the clue words. These clue words for each sense of a polysemy word as well as for single sense word are used to disambiguate the correct meaning of the polysemy word in the given context using knowledge-based Word Sense Disambiguation (WSD) algorithms. The clue word can be a noun, verb, adjective or adverb.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123313424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}