Pub Date : 1974-02-01DOI: 10.1016/0020-0271(74)90005-9
Martin Dillon
An experiment in superficial indexing was carried out to elaborate the contrast between indexing with a controlled vocabulary and using automated methods as described by salton [1, 2]. Superficial indexing refers to a set of procedures for selecting, organizing and using indexing terms modelled after automatic techniques but requiring neither the full text of articles, abstracts or machine readable text. It represents a minimal effort in manual indexing designed, in part, to allow the development of an automated version.
Retrieval comparisons were made with a manual system using controlled techniques, and though they were not as favorable as the results reported by Salton, still favor development of such techniques over manual systems.
{"title":"An experiment in superficial indexing","authors":"Martin Dillon","doi":"10.1016/0020-0271(74)90005-9","DOIUrl":"10.1016/0020-0271(74)90005-9","url":null,"abstract":"<div><p>An experiment in superficial indexing was carried out to elaborate the contrast between indexing with a controlled vocabulary and using automated methods as described by salton [1, 2]. Superficial indexing refers to a set of procedures for selecting, organizing and using indexing terms modelled after automatic techniques but requiring neither the full text of articles, abstracts or machine readable text. It represents a minimal effort in manual indexing designed, in part, to allow the development of an automated version.</p><p>Retrieval comparisons were made with a manual system using controlled techniques, and though they were not as favorable as the results reported by Salton, still favor development of such techniques over manual systems.</p></div>","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 2","pages":"Pages 63-71"},"PeriodicalIF":0.0,"publicationDate":"1974-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90005-9","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131280840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1974-02-01DOI: 10.1016/0020-0271(74)90008-4
A. Merta
{"title":"Mathematics for information workers","authors":"A. Merta","doi":"10.1016/0020-0271(74)90008-4","DOIUrl":"10.1016/0020-0271(74)90008-4","url":null,"abstract":"","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 2","pages":"Page 81"},"PeriodicalIF":0.0,"publicationDate":"1974-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90008-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82354846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1974-02-01DOI: 10.1016/0020-0271(74)90009-6
A. Merta
{"title":"Statistics for information workers","authors":"A. Merta","doi":"10.1016/0020-0271(74)90009-6","DOIUrl":"10.1016/0020-0271(74)90009-6","url":null,"abstract":"","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 2","pages":"Page 81"},"PeriodicalIF":0.0,"publicationDate":"1974-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90009-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80609901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1974-01-01DOI: 10.1016/0020-0271(74)90041-2
Rowena W. Swanson
{"title":"Proceedings of the conference on interlibrary communications and information networks","authors":"Rowena W. Swanson","doi":"10.1016/0020-0271(74)90041-2","DOIUrl":"10.1016/0020-0271(74)90041-2","url":null,"abstract":"","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 1","pages":"Pages 33-35"},"PeriodicalIF":0.0,"publicationDate":"1974-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90041-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74379174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1974-01-01DOI: 10.1016/0020-0271(74)90038-2
C.J. van Rijsbergen
The purpose of this paper is to report the results of experiments in document clustering using three well known test collections. Automatic classification is briefly introduced. The hypothesis underlying the use of clustering is discussed. A framework for the evaluation of cluster-based retrieval strategies is constructed. These strategies are shown to be dependent on the method of cluster representation (cluster profile) adopted. Finally, a particular cluster-based strategy together with a cluster representation method associated with it is examined and evaluated in detail.
{"title":"Further experiments with hierarchic clustering in document retrieval","authors":"C.J. van Rijsbergen","doi":"10.1016/0020-0271(74)90038-2","DOIUrl":"10.1016/0020-0271(74)90038-2","url":null,"abstract":"<div><p>The purpose of this paper is to report the results of experiments in document clustering using three well known test collections. Automatic classification is briefly introduced. The hypothesis underlying the use of clustering is discussed. A framework for the evaluation of cluster-based retrieval strategies is constructed. These strategies are shown to be dependent on the method of cluster representation (cluster profile) adopted. Finally, a particular cluster-based strategy together with a cluster representation method associated with it is examined and evaluated in detail.</p></div>","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 1","pages":"Pages 1-14"},"PeriodicalIF":0.0,"publicationDate":"1974-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90038-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122028964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1974-01-01DOI: 10.1016/0020-0271(74)90039-4
Noah S. Prywes, Allen L. Lang, Susan Zagorsky
This paper reports on a series of programs that have been developed to process data-bases, consisting of textual items, and to index and arrange (classify) the data items in accordance with an automatically generated classification system. The programs produce directories and a classification-number ordered data-base on microfilm, where it may be searched using a microfilm reader, or on magnetic tape for input to an on-line computer system for search and retrieval.
In the automatic indexing, candidate index words or phrases are selected automatically and the user can reject unsuitable candidate words, en masse, based on various listings prepared by the computer.
The automatic classification of text items in conjunction with the indexing produces a number of useful features. First, it allows the grouping of “like” items on a print-out, on microfilm or in computer storage. This organization of the items is useful for searching and understanding of the content of the data-base. Searches by conjunction of index terms are simplified and can be performed manually (or automatically) using the directories that are produced.
Automatic processing of raw text data, and requiring little user work make the system attractive where low cost is imperative, such as in private or specialised data-bases.
{"title":"A-posteriori indexing, classification and retrieval of textual data","authors":"Noah S. Prywes, Allen L. Lang, Susan Zagorsky","doi":"10.1016/0020-0271(74)90039-4","DOIUrl":"10.1016/0020-0271(74)90039-4","url":null,"abstract":"<div><p>This paper reports on a series of programs that have been developed to process data-bases, consisting of textual items, and to <em>index</em> and arrange (classify) the data items in accordance with an <em>automatically generated classification system</em>. The programs produce directories and a classification-number ordered data-base on microfilm, where it may be searched using a microfilm reader, or on magnetic tape for input to an on-line computer system for search and retrieval.</p><p>In the automatic indexing, candidate index words or phrases are selected automatically and the user can reject unsuitable candidate words, <em>en masse</em>, based on various listings prepared by the computer.</p><p>The automatic classification of text items in conjunction with the indexing produces a number of useful features. First, it allows the grouping of “like” items on a print-out, on microfilm or in computer storage. This organization of the items is useful for searching and understanding of the content of the data-base. Searches by conjunction of index terms are simplified and can be performed manually (or automatically) using the directories that are produced.</p><p>Automatic processing of raw text data, and requiring little user work make the system attractive where low cost is imperative, such as in private or specialised data-bases.</p></div>","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 1","pages":"Pages 15-27"},"PeriodicalIF":0.0,"publicationDate":"1974-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90039-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124367138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1974-01-01DOI: 10.1016/0020-0271(74)90040-0
Aaron Tenenbein
The binary search scheme is a method of finding a particular file from a set of ordered files stored in a computer. This scheme involves examining the middle file, and eliminating those files which do not contain the required file. This procedure is repeated until the required file is found.
In this paper an exact expression for the expected number of passes required to find a file selected at random is derived. The expected number of passes is shown to be asymptotically equal to log2k where k is the number of files. The asymptotic approximate values are compared to the exact values.
{"title":"Expected number of passes in a binary search scheme","authors":"Aaron Tenenbein","doi":"10.1016/0020-0271(74)90040-0","DOIUrl":"10.1016/0020-0271(74)90040-0","url":null,"abstract":"<div><p>The binary search scheme is a method of finding a particular file from a set of ordered files stored in a computer. This scheme involves examining the middle file, and eliminating those files which do not contain the required file. This procedure is repeated until the required file is found.</p><p>In this paper an exact expression for the expected number of passes required to find a file selected at random is derived. The expected number of passes is shown to be asymptotically equal to log<sub>2</sub> <em>k</em> where <em>k</em> is the number of files. The asymptotic approximate values are compared to the exact values.</p></div>","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 1","pages":"Pages 29-32"},"PeriodicalIF":0.0,"publicationDate":"1974-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90040-0","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130025137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1973-12-01DOI: 10.1016/0020-0271(73)90008-9
Lois L. Earl
The term “word government” is used to convey the concept that some words are habitually used with certain constructions, which in a sense they control or govern. A “governing word” is one which requires one or more of several constructions to complete its meaning, and a “word government table” is a tabulation of such words with information about their parts of speech, patterns of use, meanings and meaning-relationships with other words in the construction. Because governing words are the troublesome words of the language—high usage words which have acquired many meanings and usages—tabulating such words and documenting their use pay rich dividends in text processing applications. As part of a project in automatic indexing and abstracting at the Lockheed Palo Alto Research Laboratory, government tables have been compiled for all governing words beginning with the letters A through R ; it is estimated that the complete tables will contain fewer than 8000 governing word entries. This paper first describes how the format of the tables has developed as the tables have been compiled. The use of the tables for resolving syntactic and semantic ambiguities is indicated in some detail. Some practical applications of word government analysis are discussed. Finally, word government techniques are compared to other approaches which deal with the problems of capturing meaning for text processing purposes, and some of the limitations of word government are noted.
“词语政府”一词用来传达这样一个概念,即一些词语习惯性地与某些结构一起使用,在某种意义上它们控制或统治着这些结构。“支配词”是指需要一个或多个结构来完成其意义的词,而“支配词表”是一个关于这些词的词性、使用模式、意义以及与该结构中其他词的意义关系的信息的表格。由于支配词是语言中比较麻烦的词——具有多种含义和用法的高使用率词,因此将这些词制表并记录它们的用法在文本处理应用中有很大的好处。作为洛克希德帕洛阿尔托研究实验室(Lockheed Palo Alto Research Laboratory)自动索引和摘要项目的一部分,政府表格已经编制了所有以字母a到R开头的主导词;据估计,完整的表将包含少于8000个控制词条目。本文首先描述了表格的格式是如何随着表格的编制而发展的。文中还详细说明了如何使用表来解决语法和语义上的歧义。讨论了词政府分析的一些实际应用。最后,将词政府技术与其他处理为文本处理目的捕获意义问题的方法进行比较,并指出词政府的一些局限性。
{"title":"Use of word government in resolving syntactic and semantic ambiguities","authors":"Lois L. Earl","doi":"10.1016/0020-0271(73)90008-9","DOIUrl":"10.1016/0020-0271(73)90008-9","url":null,"abstract":"<div><p>The term “word government” is used to convey the concept that some words are habitually used with certain constructions, which in a sense they control or govern. A “governing word” is one which requires one or more of several constructions to complete its meaning, and a “word government table” is a tabulation of such words with information about their parts of speech, patterns of use, meanings and meaning-relationships with other words in the construction. Because governing words are the troublesome words of the language—high usage words which have acquired many meanings and usages—tabulating such words and documenting their use pay rich dividends in text processing applications. As part of a project in automatic indexing and abstracting at the Lockheed Palo Alto Research Laboratory, government tables have been compiled for all governing words beginning with the letters A through R ; it is estimated that the complete tables will contain fewer than 8000 governing word entries. This paper first describes how the format of the tables has developed as the tables have been compiled. The use of the tables for resolving syntactic and semantic ambiguities is indicated in some detail. Some practical applications of word government analysis are discussed. Finally, word government techniques are compared to other approaches which deal with the problems of capturing meaning for text processing purposes, and some of the limitations of word government are noted.</p></div>","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"9 12","pages":"Pages 639-664"},"PeriodicalIF":0.0,"publicationDate":"1973-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(73)90008-9","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130318464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1973-12-01DOI: 10.1016/0020-0271(73)90017-X
H. Borko
{"title":"Mathematics for liberal arts students","authors":"H. Borko","doi":"10.1016/0020-0271(73)90017-X","DOIUrl":"10.1016/0020-0271(73)90017-X","url":null,"abstract":"","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"9 12","pages":"Page 725"},"PeriodicalIF":0.0,"publicationDate":"1973-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(73)90017-X","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78171558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}