{"title":"后验索引,分类和检索文本数据","authors":"Noah S. Prywes, Allen L. Lang, Susan Zagorsky","doi":"10.1016/0020-0271(74)90039-4","DOIUrl":null,"url":null,"abstract":"<div><p>This paper reports on a series of programs that have been developed to process data-bases, consisting of textual items, and to <em>index</em> and arrange (classify) the data items in accordance with an <em>automatically generated classification system</em>. The programs produce directories and a classification-number ordered data-base on microfilm, where it may be searched using a microfilm reader, or on magnetic tape for input to an on-line computer system for search and retrieval.</p><p>In the automatic indexing, candidate index words or phrases are selected automatically and the user can reject unsuitable candidate words, <em>en masse</em>, based on various listings prepared by the computer.</p><p>The automatic classification of text items in conjunction with the indexing produces a number of useful features. First, it allows the grouping of “like” items on a print-out, on microfilm or in computer storage. This organization of the items is useful for searching and understanding of the content of the data-base. Searches by conjunction of index terms are simplified and can be performed manually (or automatically) using the directories that are produced.</p><p>Automatic processing of raw text data, and requiring little user work make the system attractive where low cost is imperative, such as in private or specialised data-bases.</p></div>","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 1","pages":"Pages 15-27"},"PeriodicalIF":0.0000,"publicationDate":"1974-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90039-4","citationCount":"2","resultStr":"{\"title\":\"A-posteriori indexing, classification and retrieval of textual data\",\"authors\":\"Noah S. Prywes, Allen L. Lang, Susan Zagorsky\",\"doi\":\"10.1016/0020-0271(74)90039-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This paper reports on a series of programs that have been developed to process data-bases, consisting of textual items, and to <em>index</em> and arrange (classify) the data items in accordance with an <em>automatically generated classification system</em>. The programs produce directories and a classification-number ordered data-base on microfilm, where it may be searched using a microfilm reader, or on magnetic tape for input to an on-line computer system for search and retrieval.</p><p>In the automatic indexing, candidate index words or phrases are selected automatically and the user can reject unsuitable candidate words, <em>en masse</em>, based on various listings prepared by the computer.</p><p>The automatic classification of text items in conjunction with the indexing produces a number of useful features. First, it allows the grouping of “like” items on a print-out, on microfilm or in computer storage. This organization of the items is useful for searching and understanding of the content of the data-base. Searches by conjunction of index terms are simplified and can be performed manually (or automatically) using the directories that are produced.</p><p>Automatic processing of raw text data, and requiring little user work make the system attractive where low cost is imperative, such as in private or specialised data-bases.</p></div>\",\"PeriodicalId\":100670,\"journal\":{\"name\":\"Information Storage and Retrieval\",\"volume\":\"10 1\",\"pages\":\"Pages 15-27\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1974-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/0020-0271(74)90039-4\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Storage and Retrieval\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/0020027174900394\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Storage and Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/0020027174900394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A-posteriori indexing, classification and retrieval of textual data
This paper reports on a series of programs that have been developed to process data-bases, consisting of textual items, and to index and arrange (classify) the data items in accordance with an automatically generated classification system. The programs produce directories and a classification-number ordered data-base on microfilm, where it may be searched using a microfilm reader, or on magnetic tape for input to an on-line computer system for search and retrieval.
In the automatic indexing, candidate index words or phrases are selected automatically and the user can reject unsuitable candidate words, en masse, based on various listings prepared by the computer.
The automatic classification of text items in conjunction with the indexing produces a number of useful features. First, it allows the grouping of “like” items on a print-out, on microfilm or in computer storage. This organization of the items is useful for searching and understanding of the content of the data-base. Searches by conjunction of index terms are simplified and can be performed manually (or automatically) using the directories that are produced.
Automatic processing of raw text data, and requiring little user work make the system attractive where low cost is imperative, such as in private or specialised data-bases.