{"title":"MMPKUBase:全面、高质量的中文多模态知识图谱","authors":"Xuan Yi, Yanzeng Li, Lei Zou","doi":"arxiv-2408.01679","DOIUrl":null,"url":null,"abstract":"Multi-modal knowledge graphs have emerged as a powerful approach for\ninformation representation, combining data from different modalities such as\ntext, images, and videos. While several such graphs have been constructed and\nhave played important roles in applications like visual question answering and\nrecommendation systems, challenges persist in their development. These include\nthe scarcity of high-quality Chinese knowledge graphs and limited domain\ncoverage in existing multi-modal knowledge graphs. This paper introduces\nMMPKUBase, a robust and extensive Chinese multi-modal knowledge graph that\ncovers diverse domains, including birds, mammals, ferns, and more, comprising\nover 50,000 entities and over 1 million filtered images. To ensure data\nquality, we employ Prototypical Contrastive Learning and the Isolation Forest\nalgorithm to refine the image data. Additionally, we have developed a\nuser-friendly platform to facilitate image attribute exploration.","PeriodicalId":501480,"journal":{"name":"arXiv - CS - Multimedia","volume":"79 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MMPKUBase: A Comprehensive and High-quality Chinese Multi-modal Knowledge Graph\",\"authors\":\"Xuan Yi, Yanzeng Li, Lei Zou\",\"doi\":\"arxiv-2408.01679\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multi-modal knowledge graphs have emerged as a powerful approach for\\ninformation representation, combining data from different modalities such as\\ntext, images, and videos. While several such graphs have been constructed and\\nhave played important roles in applications like visual question answering and\\nrecommendation systems, challenges persist in their development. These include\\nthe scarcity of high-quality Chinese knowledge graphs and limited domain\\ncoverage in existing multi-modal knowledge graphs. This paper introduces\\nMMPKUBase, a robust and extensive Chinese multi-modal knowledge graph that\\ncovers diverse domains, including birds, mammals, ferns, and more, comprising\\nover 50,000 entities and over 1 million filtered images. To ensure data\\nquality, we employ Prototypical Contrastive Learning and the Isolation Forest\\nalgorithm to refine the image data. Additionally, we have developed a\\nuser-friendly platform to facilitate image attribute exploration.\",\"PeriodicalId\":501480,\"journal\":{\"name\":\"arXiv - CS - Multimedia\",\"volume\":\"79 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Multimedia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.01679\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.01679","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
MMPKUBase: A Comprehensive and High-quality Chinese Multi-modal Knowledge Graph
Multi-modal knowledge graphs have emerged as a powerful approach for
information representation, combining data from different modalities such as
text, images, and videos. While several such graphs have been constructed and
have played important roles in applications like visual question answering and
recommendation systems, challenges persist in their development. These include
the scarcity of high-quality Chinese knowledge graphs and limited domain
coverage in existing multi-modal knowledge graphs. This paper introduces
MMPKUBase, a robust and extensive Chinese multi-modal knowledge graph that
covers diverse domains, including birds, mammals, ferns, and more, comprising
over 50,000 entities and over 1 million filtered images. To ensure data
quality, we employ Prototypical Contrastive Learning and the Isolation Forest
algorithm to refine the image data. Additionally, we have developed a
user-friendly platform to facilitate image attribute exploration.