A Printing Workflow Recommendation Tool--Exploiting Correlations between Highly Sparse Case Logs

2006 5th International Conference on Machine Learning and Applications (ICMLA'06) Pub Date : 2006-12-14 DOI:10.1109/ICMLA.2006.10

Ming Zhong, Tong Sun

{"title":"A Printing Workflow Recommendation Tool--Exploiting Correlations between Highly Sparse Case Logs","authors":"Ming Zhong, Tong Sun","doi":"10.1109/ICMLA.2006.10","DOIUrl":null,"url":null,"abstract":"As a user preference prediction mechanism, recommendation techniques have been widely used to support personalized information filtering in current e-commerce applications. We build a recommendation tool into the existing Xerox printing workflow configuration system in order to provide new users with a number of solutions that are possibly of their interests. Such solution recommendations can significantly improve the system efficiency and accuracy by reducing workflow generation overhead and helping users quickly identify their needs. In our work, the main challenge is the high sparsity inherent to our application data - most fields have missing values due to a customer's lack of background or uncertainty on their specific needs. We address this problem by using latent semantic indexing (LSI) to merge original sparse data records into dense and semantic records. The generated dense data are then grouped into clusters based on their correlations. These clusters, together with their user patterns and representative workflows, are used to support efficient online workflow recommendation. Our implemented tool is able to achieve 83% accuracy on a dataset of 4569 case logs with 91% average sparseness","PeriodicalId":297071,"journal":{"name":"2006 5th International Conference on Machine Learning and Applications (ICMLA'06)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 5th International Conference on Machine Learning and Applications (ICMLA'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2006.10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

As a user preference prediction mechanism, recommendation techniques have been widely used to support personalized information filtering in current e-commerce applications. We build a recommendation tool into the existing Xerox printing workflow configuration system in order to provide new users with a number of solutions that are possibly of their interests. Such solution recommendations can significantly improve the system efficiency and accuracy by reducing workflow generation overhead and helping users quickly identify their needs. In our work, the main challenge is the high sparsity inherent to our application data - most fields have missing values due to a customer's lack of background or uncertainty on their specific needs. We address this problem by using latent semantic indexing (LSI) to merge original sparse data records into dense and semantic records. The generated dense data are then grouped into clusters based on their correlations. These clusters, together with their user patterns and representative workflows, are used to support efficient online workflow recommendation. Our implemented tool is able to achieve 83% accuracy on a dataset of 4569 case logs with 91% average sparseness

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一个打印工作流推荐工具——利用高度稀疏的案例日志之间的相关性

推荐技术作为一种用户偏好预测机制，在当前的电子商务应用中被广泛用于支持个性化信息过滤。我们在现有的施乐打印工作流程配置系统中构建了一个推荐工具，以便为新用户提供一些可能感兴趣的解决方案。这样的解决方案建议可以通过减少工作流生成开销和帮助用户快速识别他们的需求来显著提高系统效率和准确性。在我们的工作中，主要的挑战是我们的应用程序数据固有的高稀疏性——由于客户缺乏背景或不确定他们的特定需求，大多数字段都缺少值。我们通过使用潜在语义索引(LSI)将原始的稀疏数据记录合并为密集的语义记录来解决这个问题。然后根据它们的相关性将生成的密集数据分组到簇中。这些集群及其用户模式和代表性工作流用于支持有效的在线工作流推荐。我们实现的工具能够在包含4569个案例日志的数据集上实现83%的准确率，平均稀疏度为91%

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2006 5th International Conference on Machine Learning and Applications (ICMLA'06)

自引率

0.00%

发文量