{"title":"使用基于 k-轮廓的递归深度图聚类提取搜索任务","authors":"Nurullah Ates , Yusuf Yaslan","doi":"10.1016/j.engappai.2024.109501","DOIUrl":null,"url":null,"abstract":"<div><div>Search engines must accurately predict the implicit intent of users to effectively guide their online search experience and assist them in completing their tasks. Users create time-ordered query logs by performing various queries on search engines to access desired information. Search task extraction groups queries with the same intent into unique clusters, whether these queries come from different tasks within the same session or from the same task across different sessions. Accurate identification of user intent improves the performance of search-guiding processes, including query suggestion, personalized search, and advertisement retrieval. Many existing methods focus on creating graphs that show relationships between queries. However, these methods typically cluster the graph using simple threshold-based techniques rather than leveraging graph topological structure features. Recent studies have introduced deep clustering layers to prevent the model size from growing as the number of queries increases. However, these models rely on labeled data and overlook modern embeddings from language models. We propose a novel k-contour-based graph convolutional network connective proximity clustering layer (CoGCN-C-CL) architecture that clusters graphs without requiring labeled data by leveraging graph topological properties. CoGCN-C-CL simultaneously learns query representations and search tasks. The k-contours identify distinct regions of the graph, while the graph convolutional network (GCN) exploits interactions between nodes within these regions. Experimental results demonstrate that CoGCN-C-CL outperforms existing state-of-the-art search task clustering methods on frequently used search task datasets.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Search task extraction using k-contour based recurrent deep graph clustering\",\"authors\":\"Nurullah Ates , Yusuf Yaslan\",\"doi\":\"10.1016/j.engappai.2024.109501\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Search engines must accurately predict the implicit intent of users to effectively guide their online search experience and assist them in completing their tasks. Users create time-ordered query logs by performing various queries on search engines to access desired information. Search task extraction groups queries with the same intent into unique clusters, whether these queries come from different tasks within the same session or from the same task across different sessions. Accurate identification of user intent improves the performance of search-guiding processes, including query suggestion, personalized search, and advertisement retrieval. Many existing methods focus on creating graphs that show relationships between queries. However, these methods typically cluster the graph using simple threshold-based techniques rather than leveraging graph topological structure features. Recent studies have introduced deep clustering layers to prevent the model size from growing as the number of queries increases. However, these models rely on labeled data and overlook modern embeddings from language models. We propose a novel k-contour-based graph convolutional network connective proximity clustering layer (CoGCN-C-CL) architecture that clusters graphs without requiring labeled data by leveraging graph topological properties. CoGCN-C-CL simultaneously learns query representations and search tasks. The k-contours identify distinct regions of the graph, while the graph convolutional network (GCN) exploits interactions between nodes within these regions. Experimental results demonstrate that CoGCN-C-CL outperforms existing state-of-the-art search task clustering methods on frequently used search task datasets.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2024-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197624016592\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016592","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Search task extraction using k-contour based recurrent deep graph clustering
Search engines must accurately predict the implicit intent of users to effectively guide their online search experience and assist them in completing their tasks. Users create time-ordered query logs by performing various queries on search engines to access desired information. Search task extraction groups queries with the same intent into unique clusters, whether these queries come from different tasks within the same session or from the same task across different sessions. Accurate identification of user intent improves the performance of search-guiding processes, including query suggestion, personalized search, and advertisement retrieval. Many existing methods focus on creating graphs that show relationships between queries. However, these methods typically cluster the graph using simple threshold-based techniques rather than leveraging graph topological structure features. Recent studies have introduced deep clustering layers to prevent the model size from growing as the number of queries increases. However, these models rely on labeled data and overlook modern embeddings from language models. We propose a novel k-contour-based graph convolutional network connective proximity clustering layer (CoGCN-C-CL) architecture that clusters graphs without requiring labeled data by leveraging graph topological properties. CoGCN-C-CL simultaneously learns query representations and search tasks. The k-contours identify distinct regions of the graph, while the graph convolutional network (GCN) exploits interactions between nodes within these regions. Experimental results demonstrate that CoGCN-C-CL outperforms existing state-of-the-art search task clustering methods on frequently used search task datasets.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.