Katsuhiro Honda, Takaya Nakano, S. Ubukata, A. Notsu
{"title":"具有排他项隶属度的模糊共聚类划分质量研究","authors":"Katsuhiro Honda, Takaya Nakano, S. Ubukata, A. Notsu","doi":"10.1109/ICIEV.2015.7334058","DOIUrl":null,"url":null,"abstract":"Bag-of-Words data analysis is a fundamental issue in web data mining for Big Data utilization, and Co-clustering is often applied to cooccurrence information analysis in such problems of document-keyword association research. In probabilistic partition models such as Multinomial Mixtures and Fuzzy c-Means-type ones, different partition constraints are forced to rows (objects) and columns (items), and then item memberships may not be useful in revealing item partitions. A possible approach in clarifying the interpretability of item partitions is additional penalization for exclusive item memberships, which was shown to emphasize cluster-wise representative items in document analysis. In this paper, the utility of the penalization approach is further studied through comparisons of partition qualities with several benchmark data sets. Several experimental results show that the additional penalty may sometime contribute to slightly improving the partition quality in addition to improvement of interpretability of co-cluster partitions.","PeriodicalId":367355,"journal":{"name":"2015 International Conference on Informatics, Electronics & Vision (ICIEV)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A study on partition quality of Fuzzy Co-clustering with exclusive item memberships\",\"authors\":\"Katsuhiro Honda, Takaya Nakano, S. Ubukata, A. Notsu\",\"doi\":\"10.1109/ICIEV.2015.7334058\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Bag-of-Words data analysis is a fundamental issue in web data mining for Big Data utilization, and Co-clustering is often applied to cooccurrence information analysis in such problems of document-keyword association research. In probabilistic partition models such as Multinomial Mixtures and Fuzzy c-Means-type ones, different partition constraints are forced to rows (objects) and columns (items), and then item memberships may not be useful in revealing item partitions. A possible approach in clarifying the interpretability of item partitions is additional penalization for exclusive item memberships, which was shown to emphasize cluster-wise representative items in document analysis. In this paper, the utility of the penalization approach is further studied through comparisons of partition qualities with several benchmark data sets. Several experimental results show that the additional penalty may sometime contribute to slightly improving the partition quality in addition to improvement of interpretability of co-cluster partitions.\",\"PeriodicalId\":367355,\"journal\":{\"name\":\"2015 International Conference on Informatics, Electronics & Vision (ICIEV)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Informatics, Electronics & Vision (ICIEV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIEV.2015.7334058\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Informatics, Electronics & Vision (ICIEV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIEV.2015.7334058","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A study on partition quality of Fuzzy Co-clustering with exclusive item memberships
Bag-of-Words data analysis is a fundamental issue in web data mining for Big Data utilization, and Co-clustering is often applied to cooccurrence information analysis in such problems of document-keyword association research. In probabilistic partition models such as Multinomial Mixtures and Fuzzy c-Means-type ones, different partition constraints are forced to rows (objects) and columns (items), and then item memberships may not be useful in revealing item partitions. A possible approach in clarifying the interpretability of item partitions is additional penalization for exclusive item memberships, which was shown to emphasize cluster-wise representative items in document analysis. In this paper, the utility of the penalization approach is further studied through comparisons of partition qualities with several benchmark data sets. Several experimental results show that the additional penalty may sometime contribute to slightly improving the partition quality in addition to improvement of interpretability of co-cluster partitions.