智能索引和排序管理系统-各种主题的自动搜索索引和排序

IF 0.6 4区 工程技术 Q4 Engineering Nuclear Engineering International Pub Date : 2020-12-20 DOI:10.18034/EI.V8I2.554
Apoorva Ganapathy, Takudzwa Fadziso
{"title":"智能索引和排序管理系统-各种主题的自动搜索索引和排序","authors":"Apoorva Ganapathy, Takudzwa Fadziso","doi":"10.18034/EI.V8I2.554","DOIUrl":null,"url":null,"abstract":"An issue that the majority of the databases face is the static and manual character of indexing activities. This traditional method of indexing and sorting different topics is confirmed to shake the dataset performance somewhat, making downtime and a potential effect in the presentation that is normally addressed by manually indexing operations. Numerous data mining methods can accelerate this process by using proper indexing structures. Choosing the appropriate index generally relies upon the kind of operation that the algorithm performs against the dataset. Topic indexing is the operation of recognizing the principal topics covered by a document. These are helpful for some reasons: as subject headings in libraries, as keywords in scholarly articles, and as hashtags on social media platforms. Knowing a document’s topic assists individuals with deciding its importance quickly. In any case, assigning topics manually is a tedious and redundant task. This paper shows the best way to create them automatically in a way that contends with manual indexing done by humans. This paper also talks about the issues and the techniques for identifying applicable data in a huge variety of documents. The contribution of this thesis to this issue is to foster better content analysis techniques that can be utilized to describe document content with automated index terms. Index terms can be used as meta-data that defines documents and is utilized for seeking various topics. The main point of this paper is to show the way toward creating an automatic indexer which analyzes the topic of documents by integrating proof from word frequencies and proof from the linguistic analysis given by a syntactic parser. The indexer weighs the expressions of a document as per their assessed significance for depicting the topic of a given document based on the content analysis.","PeriodicalId":49736,"journal":{"name":"Nuclear Engineering International","volume":null,"pages":null},"PeriodicalIF":0.6000,"publicationDate":"2020-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Intelligent Indexing and Sorting Management System – Automated Search Indexing and Sorting of Various Topics\",\"authors\":\"Apoorva Ganapathy, Takudzwa Fadziso\",\"doi\":\"10.18034/EI.V8I2.554\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An issue that the majority of the databases face is the static and manual character of indexing activities. This traditional method of indexing and sorting different topics is confirmed to shake the dataset performance somewhat, making downtime and a potential effect in the presentation that is normally addressed by manually indexing operations. Numerous data mining methods can accelerate this process by using proper indexing structures. Choosing the appropriate index generally relies upon the kind of operation that the algorithm performs against the dataset. Topic indexing is the operation of recognizing the principal topics covered by a document. These are helpful for some reasons: as subject headings in libraries, as keywords in scholarly articles, and as hashtags on social media platforms. Knowing a document’s topic assists individuals with deciding its importance quickly. In any case, assigning topics manually is a tedious and redundant task. This paper shows the best way to create them automatically in a way that contends with manual indexing done by humans. This paper also talks about the issues and the techniques for identifying applicable data in a huge variety of documents. The contribution of this thesis to this issue is to foster better content analysis techniques that can be utilized to describe document content with automated index terms. Index terms can be used as meta-data that defines documents and is utilized for seeking various topics. The main point of this paper is to show the way toward creating an automatic indexer which analyzes the topic of documents by integrating proof from word frequencies and proof from the linguistic analysis given by a syntactic parser. The indexer weighs the expressions of a document as per their assessed significance for depicting the topic of a given document based on the content analysis.\",\"PeriodicalId\":49736,\"journal\":{\"name\":\"Nuclear Engineering International\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2020-12-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nuclear Engineering International\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.18034/EI.V8I2.554\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nuclear Engineering International","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.18034/EI.V8I2.554","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 2

摘要

大多数数据库面临的一个问题是索引活动的静态和手动特征。这种对不同主题进行索引和排序的传统方法被证实会在一定程度上影响数据集的性能,造成停机时间和对演示文稿的潜在影响,通常可以通过手动索引操作来解决。许多数据挖掘方法都可以通过使用适当的索引结构来加速这一过程。选择合适的索引通常取决于算法对数据集执行的操作类型。主题索引是识别文档所涵盖的主要主题的操作。这些词之所以有用,是因为它们可以作为图书馆的主题标题、学术文章的关键词,以及社交媒体平台上的标签。了解文档的主题可以帮助人们快速确定其重要性。在任何情况下,手动分配主题都是一项乏味且冗余的任务。本文展示了自动创建索引的最佳方法,这种方法可以与人工索引相媲美。本文还讨论了在大量文档中识别适用数据的问题和技术。本文对这个问题的贡献在于培养更好的内容分析技术,这些技术可用于用自动索引术语描述文档内容。索引项可以用作定义文档的元数据,并用于查找各种主题。本文的主要目的是通过将词频证明和句法分析器给出的语言分析证明结合起来,实现自动索引器对文档主题的分析。索引器根据对描述基于内容分析的给定文档主题的评估意义对文档的表达式进行加权。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Intelligent Indexing and Sorting Management System – Automated Search Indexing and Sorting of Various Topics
An issue that the majority of the databases face is the static and manual character of indexing activities. This traditional method of indexing and sorting different topics is confirmed to shake the dataset performance somewhat, making downtime and a potential effect in the presentation that is normally addressed by manually indexing operations. Numerous data mining methods can accelerate this process by using proper indexing structures. Choosing the appropriate index generally relies upon the kind of operation that the algorithm performs against the dataset. Topic indexing is the operation of recognizing the principal topics covered by a document. These are helpful for some reasons: as subject headings in libraries, as keywords in scholarly articles, and as hashtags on social media platforms. Knowing a document’s topic assists individuals with deciding its importance quickly. In any case, assigning topics manually is a tedious and redundant task. This paper shows the best way to create them automatically in a way that contends with manual indexing done by humans. This paper also talks about the issues and the techniques for identifying applicable data in a huge variety of documents. The contribution of this thesis to this issue is to foster better content analysis techniques that can be utilized to describe document content with automated index terms. Index terms can be used as meta-data that defines documents and is utilized for seeking various topics. The main point of this paper is to show the way toward creating an automatic indexer which analyzes the topic of documents by integrating proof from word frequencies and proof from the linguistic analysis given by a syntactic parser. The indexer weighs the expressions of a document as per their assessed significance for depicting the topic of a given document based on the content analysis.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Nuclear Engineering International
Nuclear Engineering International 工程技术-核科学技术
自引率
0.00%
发文量
0
审稿时长
6-12 weeks
期刊介绍: Information not localized
期刊最新文献
A Review of Cybersecurity and Biometric Authentication in Cloud Network Network Security Framework, Techniques, and Design for Hybrid Cloud Wave Structures for Nonlinear Schrodinger Types Fractional Partial Differential Equations Arise in Physical Sciences Significant of Gradient Boosting Algorithm in Data Management System Handwritten Bangla Numerical Digit Recognition Using Fine Regulated Deep Neural Network
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1