Journal of the American Society for Information Science and Technology最新文献_第10页

Web-based analyses of E-journal impact: Approaches, problems, and issues 电子期刊影响的网络分析:方法、问题和议题

Journal of the American Society for Information Science and Technology

Pub Date : 2000-11-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1029%3E3.0.CO;2-P

S. P. Harter, C. Ford

This study1 assesses the ways in which citation searching of scholarly print journals is and is not analogous to backlink searching of scholarly e‐journal articles on the WWW, and identifies problems and issues related to conducting and interpreting such searches. Backlink searches are defined here as searches for Web pages that link to a given URL. Backlink searches were conducted on a sample of 39 scholarly electronic journals. Search results were processed to determine the number of backlinking pages, total backlinks, and external backlinks made to the e‐journals and to their articles. The results were compared to findings from a citation study performed on the same e‐journals in 1996. A content analysis of a sample of the files backlinked to e‐journal articles was also undertaken. The authors identify a number of reliability issues associated with the use of “raw” search engine data to evaluate the impact of electronic journals and articles. No correlation was found between backlink measures and ISI citation measures of e‐journal impact, suggesting that the two measures may be assessing something quite different. Major differences were found between the types of entities that cite, and those that backlink, e‐journal articles, with scholarly works comprising a very small percentage of backlinking files. These findings call into question the legitimacy of using backlink searches to evaluate the scholarly impact of e‐journals and e‐journal articles (and by extension, e‐journal authors).

本研究评估了学术印刷期刊的引文检索与学术电子期刊文章在WWW上的反向链接检索的相似之处和不同之处，并确定了与进行和解释此类检索相关的问题。这里将反向链接搜索定义为对链接到给定URL的网页的搜索。反向链接搜索以39种学术电子期刊为样本进行。对搜索结果进行处理，以确定反向链接页面的数量、总反向链接以及对电子期刊及其文章的外部反向链接。这些结果与1996年在同一电子期刊上进行的引文研究的结果进行了比较。还对反向链接到电子期刊文章的文件样本进行了内容分析。作者指出了一些与使用“原始”搜索引擎数据来评估电子期刊和文章影响相关的可靠性问题。没有发现反向链接测量和ISI引用测量之间的相关性，这表明这两种测量方法可能评估的是完全不同的东西。在引用电子期刊文章的实体类型和反向链接的实体类型之间发现了重大差异，学术作品在反向链接文件中所占的比例非常小。这些发现对使用反向链接搜索来评估电子期刊和电子期刊文章(以及电子期刊作者)的学术影响的合法性提出了质疑。

{"title":"Web-based analyses of E-journal impact: Approaches, problems, and issues","authors":"S. P. Harter, C. Ford","doi":"10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1029%3E3.0.CO;2-P","DOIUrl":"https://doi.org/10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1029%3E3.0.CO;2-P","url":null,"abstract":"This study1 assesses the ways in which citation searching of scholarly print journals is and is not analogous to backlink searching of scholarly e‐journal articles on the WWW, and identifies problems and issues related to conducting and interpreting such searches. Backlink searches are defined here as searches for Web pages that link to a given URL. Backlink searches were conducted on a sample of 39 scholarly electronic journals. Search results were processed to determine the number of backlinking pages, total backlinks, and external backlinks made to the e‐journals and to their articles. The results were compared to findings from a citation study performed on the same e‐journals in 1996. A content analysis of a sample of the files backlinked to e‐journal articles was also undertaken. The authors identify a number of reliability issues associated with the use of “raw” search engine data to evaluate the impact of electronic journals and articles. No correlation was found between backlink measures and ISI citation measures of e‐journal impact, suggesting that the two measures may be assessing something quite different. Major differences were found between the types of entities that cite, and those that backlink, e‐journal articles, with scholarly works comprising a very small percentage of backlinking files. These findings call into question the legitimacy of using backlink searches to evaluate the scholarly impact of e‐journals and e‐journal articles (and by extension, e‐journal authors).","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"8 1","pages":"1159-1176"},"PeriodicalIF":0.0,"publicationDate":"2000-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72977816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 67

Predicting the effectiveness of naïve data fusion on the basis of system characteristics 基于系统特性预测naïve数据融合的有效性

Journal of the American Society for Information Science and Technology

Pub Date : 2000-11-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1030%3E3.0.CO;2-E

K. Ng, P. Kantor

Effective automation of the information retrieval task has long been an active area of research, leading to sophisticated retrieval models. With many IR schemes available, researchers have begun to investigate the benefits of combining the results of different IR schemes to improve performance, in the process called “data fusion.” There are many successful data fusion experiments reported in IR literature, but there are also cases in which it did not work well. Thus, if would be quite valuable to have a theory that can predict, in advance, whether fusion of two or more retrieval schemes will be worth doing. In previous study (Ng & Kantor, 1998), we identified two predictive variables for the effectiveness of fusion: (a) a list‐based measure of output dissimilarity, and (b) a pair‐wise measure of the similarity of performance of the two schemes. In this article we investigate the predictive power of these two variables in simple symmetrical data fusion. We use the IR systems participating in the TREC 4 routing task to train a model that predicts the effectiveness of data fusion, and use the IR systems participating in the TREC 5 routing task to test that model. The model asks, “when will fusion perform better than an oracle who uses the best scheme from each pair?” We explore statistical techniques for fitting the model to the training data and use the receiver operating characteristic curve of signal detection theory to represent the power of the resulting models. The trained prediction methods predict whether fusion will beat an oracle, at levels much higher than could be achieved by chance.

信息检索任务的有效自动化一直是一个活跃的研究领域，导致了复杂的检索模型。有了许多可用的红外方案，研究人员已经开始研究将不同红外方案的结果结合起来提高性能的好处，这一过程被称为“数据融合”。红外文献中报道了许多成功的数据融合实验，但也有不尽如人意的情况。因此，如果有一种理论能够提前预测两种或两种以上检索方案的融合是否值得进行，那将是非常有价值的。在之前的研究中(Ng & Kantor, 1998)，我们确定了融合有效性的两个预测变量:(a)基于列表的输出不相似性度量，以及(b)两个方案性能相似性的成对度量。在本文中，我们研究了这两个变量在简单对称数据融合中的预测能力。我们使用参与TREC 4路由任务的红外系统来训练预测数据融合有效性的模型，并使用参与TREC 5路由任务的红外系统来测试该模型。该模型问道:“什么时候核聚变会比使用每对方案中最佳方案的预言机表现得更好?”我们探索了将模型拟合到训练数据的统计技术，并使用信号检测理论的接收器工作特性曲线来表示所得模型的功率。经过训练的预测方法预测核聚变是否会击败神谕，其水平远远高于偶然实现的水平。

{"title":"Predicting the effectiveness of naïve data fusion on the basis of system characteristics","authors":"K. Ng, P. Kantor","doi":"10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1030%3E3.0.CO;2-E","DOIUrl":"https://doi.org/10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1030%3E3.0.CO;2-E","url":null,"abstract":"Effective automation of the information retrieval task has long been an active area of research, leading to sophisticated retrieval models. With many IR schemes available, researchers have begun to investigate the benefits of combining the results of different IR schemes to improve performance, in the process called “data fusion.” There are many successful data fusion experiments reported in IR literature, but there are also cases in which it did not work well. Thus, if would be quite valuable to have a theory that can predict, in advance, whether fusion of two or more retrieval schemes will be worth doing. In previous study (Ng & Kantor, 1998), we identified two predictive variables for the effectiveness of fusion: (a) a list‐based measure of output dissimilarity, and (b) a pair‐wise measure of the similarity of performance of the two schemes. In this article we investigate the predictive power of these two variables in simple symmetrical data fusion. We use the IR systems participating in the TREC 4 routing task to train a model that predicts the effectiveness of data fusion, and use the IR systems participating in the TREC 5 routing task to test that model. The model asks, “when will fusion perform better than an oracle who uses the best scheme from each pair?” We explore statistical techniques for fitting the model to the training data and use the receiver operating characteristic curve of signal detection theory to represent the power of the resulting models. The trained prediction methods predict whether fusion will beat an oracle, at levels much higher than could be achieved by chance.","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"2 1","pages":"1177-1189"},"PeriodicalIF":0.0,"publicationDate":"2000-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86530382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 92

Bibliometric information retrieval system (BIRS): A web search interface utilizing bibliometric research results 文献计量学信息检索系统(BIRS):利用文献计量学研究成果的网络检索界面

Journal of the American Society for Information Science and Technology

Pub Date : 2000-11-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1031%3E3.0.CO;2-B

Ying Ding, G. Chowdhury, S. Foo, Weizhong Qian

The aim of this article is to test whether the results obtained from a specific bibliographic research can be applied to a real search environment and enhance the level of utility of an information retrieval session for all levels of end users. In this respect, a Web-based Bibliometric Information Retrieval System (BIRS) has been designed and created, with facilities to assist the end users to get better understanding of their search domain, formulate and expand their search queries, and visualize the bibliographic research results. There are three specific features in the system design of the BIRS: the information visualization feature of the BIRS (cocitation maps) to guide the end users to identify the important research groups and capture the detailed information about the intellectual structure of the search domain; the multilevel browsing feature to allow the end users to go to different levels of interesting topics; and the common user interface feature to enable the end users to search all kinds of databases regardless of different searching systems, different working platforms, different database producer and supplier, such as different Web search engines, different library OPACs, or different on-line databases. A preliminary user evaluation study of BIRS revealed that users generally found it easy to form and expand their queries, and that BIRS helped them acquire useful background information about the search domain. They also pointed out aspects of information visualization, multilevel browsing, and common user interface as novel characteristics exhibited by BIRS.

本文的目的是测试从特定书目研究中获得的结果是否可以应用于实际的搜索环境，并提高信息检索会话对各级最终用户的效用水平。在这方面，我们设计和建立了一个基于网络的文献计量信息检索系统(BIRS)，该系统的设施可以帮助最终用户更好地了解他们的搜索领域，制定和扩展他们的搜索查询，并将书目研究结果可视化。在BIRS的系统设计中有三个具体的特点:BIRS的信息可视化特征，引导最终用户识别重要的研究群体，获取搜索领域的智力结构的详细信息;多层次浏览功能，让最终用户可以浏览不同层次的有趣话题;以及通用的用户界面特性，使最终用户能够搜索各种数据库，而不管不同的搜索系统、不同的工作平台、不同的数据库生产商和供应商，如不同的Web搜索引擎、不同的图书馆opac或不同的在线数据库。一项对BIRS的初步用户评价研究表明，用户普遍认为BIRS很容易形成和扩展他们的查询，并且BIRS帮助他们获得有关搜索域的有用背景信息。他们还指出BIRS具有信息可视化、多层次浏览和通用用户界面等新特点。

{"title":"Bibliometric information retrieval system (BIRS): A web search interface utilizing bibliometric research results","authors":"Ying Ding, G. Chowdhury, S. Foo, Weizhong Qian","doi":"10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1031%3E3.0.CO;2-B","DOIUrl":"https://doi.org/10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1031%3E3.0.CO;2-B","url":null,"abstract":"The aim of this article is to test whether the results obtained from a specific bibliographic research can be applied to a real search environment and enhance the level of utility of an information retrieval session for all levels of end users. In this respect, a Web-based Bibliometric Information Retrieval System (BIRS) has been designed and created, with facilities to assist the end users to get better understanding of their search domain, formulate and expand their search queries, and visualize the bibliographic research results. There are three specific features in the system design of the BIRS: the information visualization feature of the BIRS (cocitation maps) to guide the end users to identify the important research groups and capture the detailed information about the intellectual structure of the search domain; the multilevel browsing feature to allow the end users to go to different levels of interesting topics; and the common user interface feature to enable the end users to search all kinds of databases regardless of different searching systems, different working platforms, different database producer and supplier, such as different Web search engines, different library OPACs, or different on-line databases. A preliminary user evaluation study of BIRS revealed that users generally found it easy to form and expand their queries, and that BIRS helped them acquire useful background information about the search domain. They also pointed out aspects of information visualization, multilevel browsing, and common user interface as novel characteristics exhibited by BIRS.","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"20 1","pages":"1190-1204"},"PeriodicalIF":0.0,"publicationDate":"2000-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81152768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 49

Protein annotators' assistant: A novel application of information retrieval techniques 蛋白质注释者的助手:信息检索技术的新应用

Journal of the American Society for Information Science and Technology

Pub Date : 2000-10-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1020%3E3.0.CO;2-F

M. Wise

The Protein Annotators' Assistant (or PAA) (http://www.ebi.ac.uk/paa/) is a software system which assists protein annotators in the task of assigning functions to newly sequenced proteins. Working backward from SwissProt, a database which describes known proteins, and a prior sequence similarity search that returns a list of known proteins similar to a query, PAA suggests keywords and phrases which may describe functions performed by the query. In a preprocessing step, a database is built from the protein names that appear in the SwissProt database, and against each protein are listed key words and phrases that are extracted from the corresponding text records. Common words either in general English usage or from the biological domain are removed as the phrases are assembled. This process is assisted by the use of a simple stemming algorithm, which extends the list of stop‐words (i.e., reject words), together with a list of accept‐words. At runtime, the search algorithm, invoked by a user via a Web interface, takes a list of protein names and clusters the named proteins around keywords/phrases shared by members of the list. The assumption is that if these proteins have a particular keyword/phrase in common, and they are related to a query protein, then the keyword/phrase may also describe the query. Overall, PAA employs a number of IR techniques in a novel setting and is thus related to text categorization, where multiple categories may be suggested, except that in this case none of the categories are specified in advance.

Protein Annotators' Assistant(或PAA) (http://www.ebi.ac.uk/paa/)是一个软件系统，它可以帮助蛋白质注释者为新测序的蛋白质分配功能。PAA从SwissProt(一个描述已知蛋白质的数据库)和返回与查询相似的已知蛋白质列表的先前序列相似性搜索向后工作，建议可以描述查询执行的功能的关键字和短语。在预处理步骤中，根据出现在SwissProt数据库中的蛋白质名称建立数据库，并针对每个蛋白质列出从相应文本记录中提取的关键词和短语。在短语的组装过程中，无论是一般英语用法中的常用词还是来自生物领域的常用词都会被删除。该过程通过使用简单的词干提取算法来辅助，该算法扩展了停止词列表(即拒绝词)以及接受词列表。在运行时，由用户通过Web界面调用的搜索算法获取一个蛋白质名称列表，并将命名的蛋白质聚集在列表成员共享的关键字/短语周围。假设这些蛋白质有一个共同的关键字/短语，并且它们与查询蛋白质相关，那么关键字/短语也可以描述查询。总的来说，PAA在新的设置中使用了许多IR技术，因此与文本分类有关，其中可能建议使用多个类别，只是在这种情况下没有预先指定任何类别。

{"title":"Protein annotators' assistant: A novel application of information retrieval techniques","authors":"M. Wise","doi":"10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1020%3E3.0.CO;2-F","DOIUrl":"https://doi.org/10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1020%3E3.0.CO;2-F","url":null,"abstract":"The Protein Annotators' Assistant (or PAA) (http://www.ebi.ac.uk/paa/) is a software system which assists protein annotators in the task of assigning functions to newly sequenced proteins. Working backward from SwissProt, a database which describes known proteins, and a prior sequence similarity search that returns a list of known proteins similar to a query, PAA suggests keywords and phrases which may describe functions performed by the query. In a preprocessing step, a database is built from the protein names that appear in the SwissProt database, and against each protein are listed key words and phrases that are extracted from the corresponding text records. Common words either in general English usage or from the biological domain are removed as the phrases are assembled. This process is assisted by the use of a simple stemming algorithm, which extends the list of stop‐words (i.e., reject words), together with a list of accept‐words. At runtime, the search algorithm, invoked by a user via a Web interface, takes a list of protein names and clusters the named proteins around keywords/phrases shared by members of the list. The assumption is that if these proteins have a particular keyword/phrase in common, and they are related to a query protein, then the keyword/phrase may also describe the query. Overall, PAA employs a number of IR techniques in a novel setting and is thus related to text categorization, where multiple categories may be suggested, except that in this case none of the categories are specified in advance.","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"31 1","pages":"1131-1136"},"PeriodicalIF":0.0,"publicationDate":"2000-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89250036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

What is a collection? 什么是集合?

Journal of the American Society for Information Science and Technology

Pub Date : 2000-10-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1018%3E3.0.CO;2-T

Hur-Li Lee

Advances in information technology have dramatically changed information seeking, and necessitate an examination of traditional conceptions of library collection. This article addresses the task and reveals four major presumptions associated with collections: tangibility, ownership, a user community, and an integrated retrieval mechanism. Some of these presumptions have served only to perpetuate misconceptions of collection. Others seem to have become more relevant in the current information environment. The emergence of nontraditional media, such as the World Wide Web (WWW), poses two specific challenges: to question the necessity of finite collections, and contest the boundaries of a collection. A critical analysis of these issues results in a proposal for an expanded concept of collection that considers the perspectives of both the user and the collection developer, invites rigorous user-centered research, and looks at the collection as an information-seeking context.

信息技术的进步极大地改变了信息搜索，有必要对图书馆馆藏的传统观念进行审查。本文讨论了这项任务，并揭示了与集合相关的四个主要假设:有形性、所有权、用户社区和集成检索机制。其中一些假设只会使对收集的误解永久化。其他的似乎在当前的信息环境中变得更加相关。非传统媒体的出现，如万维网(WWW)，提出了两个具体的挑战:质疑有限收藏的必要性，以及挑战收藏的边界。对这些问题进行批判性分析后，我们提出了扩展集合概念的建议，该概念考虑了用户和集合开发人员的观点，邀请了严格的以用户为中心的研究，并将集合视为信息查找上下文。

引用次数: 60

Partial orders and measures for language preferences 语言偏好的部分顺序和度量

Journal of the American Society for Information Science and Technology

Pub Date : 2000-10-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1014%3E3.0.CO;2-4

L. Egghe, R. Rousseau

Relative own-language preference depends on two parameters: the publication share of the language, and the self-citing rate. Openness of language L with respect to language J depends on three parameters: the publication share of language L, the publication share of language J, and the citation share of language J among all citations given by language L. It is shown that the relative own-language preference and the openness of one language with respect to another one, can be represented by a partial order. This partial order can be represented by a polygonal line (for the relative own-language preference) or a three-dimensional solid (for openness), somewhat in the same spirit as the Lorenz curve for concentration and evenness. Any function used to measure relative own-language preference or openness of one language with respect to another one should at least respect the corresponding partial orders. This is a minimum requirement for such measures. Depending on the use one wants to make of these measures other requirements become necessary. A logarithmic dependence on the language share(s) seems a natural additional requirement. This would correspond with the logarithmic behavior of psychophysical sensations. We give examples of normalized functions satisfying this additional requirement. It is further investigated if openness partial orders can lead to measures for relative own-language preference. The article ends with some examples related to the language use in some sociological journals.

相对自己的语言偏好取决于两个参数:语言的发表份额和自引用率。语言L相对于语言J的开放性取决于三个参数:语言L的发表份额、语言J的发表份额和语言J在语言L给出的所有引用中的引用份额。结果表明，一种语言相对于另一种语言的开放性和相对于另一种语言的相对自身语言偏好可以用偏序来表示。这种偏序可以用多边形线(对于相对自己的语言偏好)或三维立体(对于开放性)来表示，在某种程度上与洛伦兹曲线的集中和均匀性相同。任何用来衡量一种语言相对于另一种语言的相对偏好或开放性的函数都至少应该尊重相应的部分顺序。这是此类措施的最低要求。根据想要使用这些措施的不同，其他要求也变得必要。对语言共享的对数依赖似乎是一个自然的附加要求。这将与心理物理感觉的对数行为相对应。我们给出了满足这个附加要求的归一化函数的例子。本文进一步研究了开放偏序是否能导致相对自身语言偏好的测量。文章最后列举了一些社会学期刊上的语言使用实例。

{"title":"Partial orders and measures for language preferences","authors":"L. Egghe, R. Rousseau","doi":"10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1014%3E3.0.CO;2-4","DOIUrl":"https://doi.org/10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1014%3E3.0.CO;2-4","url":null,"abstract":"Relative own-language preference depends on two parameters: the publication share of the language, and the self-citing rate. Openness of language L with respect to language J depends on three parameters: the publication share of language L, the publication share of language J, and the citation share of language J among all citations given by language L. It is shown that the relative own-language preference and the openness of one language with respect to another one, can be represented by a partial order. This partial order can be represented by a polygonal line (for the relative own-language preference) or a three-dimensional solid (for openness), somewhat in the same spirit as the Lorenz curve for concentration and evenness. Any function used to measure relative own-language preference or openness of one language with respect to another one should at least respect the corresponding partial orders. This is a minimum requirement for such measures. Depending on the use one wants to make of these measures other requirements become necessary. A logarithmic dependence on the language share(s) seems a natural additional requirement. This would correspond with the logarithmic behavior of psychophysical sensations. We give examples of normalized functions satisfying this additional requirement. It is further investigated if openness partial orders can lead to measures for relative own-language preference. The article ends with some examples related to the language use in some sociological journals.","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"22 1","pages":"1123-1130"},"PeriodicalIF":0.0,"publicationDate":"2000-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83330778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Aboutness from a commonsense perspective 从常识的角度来看

Journal of the American Society for Information Science and Technology

Pub Date : 2000-10-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1026%3E3.0.CO;2-Y

P. Bruza, D. Song, Kam-Fai Wong

Information retrieval (IR) is driven by a process that decides whether a document is about a query. Recent attempts spawned from a logic-based information retrieval theory have formalized properties characterizing “aboutness,” but no consensus has yet been reached. The proposed properties are largely determined by the underlying framework within which aboutness is defined. In addition, some properties are only sound within the context of a given IR model, but are not sound from the perspective of the user. For example, a common form of aboutness, namely overlapping aboutness, implies precision degrading properties such as compositional monotonicity. Therefore, the motivating question for this article is: independent of any given IR model, and examined within an information-based, abstract framework, what are commonsense properties of aboutness (and its dual, nonaboutness)? We propose a set of properties characterizing aboutness and nonaboutness from a commonsense perspective. Special attention is paid to the rules prescribing conservative behavior of aboutness with respect to information composition. The interaction between aboutness and nonaboutness is modeled via normative rules. The completeness, soundness, and consistency of the aboutness proof systems are analyzed and discussed. A case study based on monotonicity shows that many current IR systems are either monotonic or nonmonotonic. An interesting class of IR models, namely those that are conservatively monotonic, is identified.

信息检索(IR)由一个流程驱动，该流程决定文档是否与查询有关。最近从基于逻辑的信息检索理论衍生出来的尝试已经形式化了描述“关于性”的属性，但尚未达成共识。建议的属性在很大程度上由定义有关性的底层框架决定。此外，有些属性仅在给定IR模型的上下文中是合理的，但从用户的角度来看并不合理。例如，一种常见的关于性形式，即重叠关于性，意味着精度降低的属性，如组合单调性。因此，本文的激励问题是:独立于任何给定的IR模型，并在基于信息的抽象框架中进行检查，关于性(及其双重非关于性)的常识性属性是什么?我们从常识性的角度提出了一组表征有关性和非有关性的属性。特别注意的是关于信息构成的保守性行为的规则。关于和非关于之间的相互作用通过规范规则建模。对相关证明系统的完备性、可靠性和一致性进行了分析和讨论。基于单调性的实例分析表明，当前许多红外系统要么是单调的，要么是非单调的。识别出一类有趣的IR模型，即那些保守单调的模型。

{"title":"Aboutness from a commonsense perspective","authors":"P. Bruza, D. Song, Kam-Fai Wong","doi":"10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1026%3E3.0.CO;2-Y","DOIUrl":"https://doi.org/10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1026%3E3.0.CO;2-Y","url":null,"abstract":"Information retrieval (IR) is driven by a process that decides whether a document is about a query. Recent attempts spawned from a logic-based information retrieval theory have formalized properties characterizing “aboutness,” but no consensus has yet been reached. The proposed properties are largely determined by the underlying framework within which aboutness is defined. In addition, some properties are only sound within the context of a given IR model, but are not sound from the perspective of the user. For example, a common form of aboutness, namely overlapping aboutness, implies precision degrading properties such as compositional monotonicity. Therefore, the motivating question for this article is: independent of any given IR model, and examined within an information-based, abstract framework, what are commonsense properties of aboutness (and its dual, nonaboutness)? We propose a set of properties characterizing aboutness and nonaboutness from a commonsense perspective. Special attention is paid to the rules prescribing conservative behavior of aboutness with respect to information composition. The interaction between aboutness and nonaboutness is modeled via normative rules. The completeness, soundness, and consistency of the aboutness proof systems are analyzed and discussed. A case study based on monotonicity shows that many current IR systems are either monotonic or nonmonotonic. An interesting class of IR models, namely those that are conservatively monotonic, is identified.","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"16 1","pages":"1090-1105"},"PeriodicalIF":0.0,"publicationDate":"2000-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82621975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 59

The web as a classroom resource: Reactions from the users 网络作为课堂资源:用户的反应

Journal of the American Society for Information Science and Technology

Pub Date : 2000-10-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1017%3E3.0.CO;2-W

Andrew Large, J. Beheshti

This article presents and discusses interviews with 50 grade-6 primary school students about their experience of using the Web to find information for a class project. The children discuss the quantity and quality of textual and image information on the Web versus traditional print sources, and the reasons why they made very little use of any moving images and sound clips on the Web. They also discuss how they searched for information on the Web and the ways in which this differs from looking for information in printed sources. The children overall demonstrate a sophistication both in their appreciation of the Web's strengths and weaknesses as an information source, and in their information retrieval strategies. In their reaction to the Web compared with traditional print sources, they can be categorized as technophiles, traditionalists, or pragmatists. The results from this research study suggest that although the Web can make an important contribution to information retrieval by school students, for the time being, at any rate, a role also remains both for other electronic sources such as CD-ROMs and print materials that are targeted specifically at young users. The Web needs both a more straightforward interface and more information specifically aimed at the young before it can seriously threaten its rivals.

本文介绍并讨论了对50名六年级小学生的采访，了解他们使用网络查找班级项目信息的经历。孩子们讨论了网络上文本和图像信息的数量和质量，以及他们很少使用网络上任何动态图像和声音剪辑的原因。他们还讨论了如何在Web上搜索信息，以及这与在印刷资源中查找信息的不同之处。总的来说，孩子们在对网络作为信息源的优缺点的理解以及他们的信息检索策略方面都表现出了成熟。在他们对网络的反应与传统印刷资源的比较中，他们可以被归类为技术爱好者、传统主义者和实用主义者。这项研究的结果表明，尽管网络可以对学校学生的信息检索作出重要贡献，但至少就目前而言，其他电子资源，如cd - rom和专门针对年轻用户的印刷材料，也发挥着重要作用。互联网需要更直接的界面和更多专门针对年轻人的信息，才能真正威胁到它的竞争对手。

{"title":"The web as a classroom resource: Reactions from the users","authors":"Andrew Large, J. Beheshti","doi":"10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1017%3E3.0.CO;2-W","DOIUrl":"https://doi.org/10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1017%3E3.0.CO;2-W","url":null,"abstract":"This article presents and discusses interviews with 50 grade-6 primary school students about their experience of using the Web to find information for a class project. The children discuss the quantity and quality of textual and image information on the Web versus traditional print sources, and the reasons why they made very little use of any moving images and sound clips on the Web. They also discuss how they searched for information on the Web and the ways in which this differs from looking for information in printed sources. The children overall demonstrate a sophistication both in their appreciation of the Web's strengths and weaknesses as an information source, and in their information retrieval strategies. In their reaction to the Web compared with traditional print sources, they can be categorized as technophiles, traditionalists, or pragmatists. The results from this research study suggest that although the Web can make an important contribution to information retrieval by school students, for the time being, at any rate, a role also remains both for other electronic sources such as CD-ROMs and print materials that are targeted specifically at young users. The Web needs both a more straightforward interface and more information specifically aimed at the young before it can seriously threaten its rivals.","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"59 1","pages":"1069-1080"},"PeriodicalIF":0.0,"publicationDate":"2000-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88305267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 167

Network organizational development in the public sector: A case study of the federal emergency management administration (FEMA) 公共部门网络组织发展:联邦紧急事务管理局案例研究

Journal of the American Society for Information Science and Technology

Pub Date : 2000-09-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1004%3E3.0.CO;2-5

Robert C. Ward, G. Wamsley, Aaron Schroeder, David Robins

Research findings from the organizational theory tend to support the position that management uses Information Technology (IT) to maintain existing organizational hierachy and control. Another body of research from information technology advocates suggests that Information Technology's inherent capabilities transform organization hierarchy and control outside of management's control. In addition, advocates from governmental change toward a more responsive type of government advocate adoption of IT as a form of change mechanism. This aritcle explores these conflicting positions. The authors examines one instance of the development of a form of network organization within the federal government, and the processes of IT change that have occurred over the past 20 years. The agency selected for study is the Federal Emergency Mangagement Administration.

组织理论的研究结果倾向于支持管理层使用信息技术(IT)来维持现有的组织层次和控制的立场。信息技术倡导者的另一项研究表明，信息技术的内在能力改变了组织的层次结构和管理控制之外的控制。此外，从政府变革到更具响应性的政府类型的倡导者提倡采用IT作为变革机制的一种形式。本文探讨了这些相互冲突的立场。作者考察了联邦政府内部网络组织形式发展的一个实例，以及过去20年来发生的IT变革过程。被选中进行研究的机构是联邦紧急事务管理局。

引用次数: 22

Interactive query expansion: A user-based evaluation in a relevance feedback environment 交互式查询扩展:在相关反馈环境中基于用户的评估

Journal of the American Society for Information Science and Technology

Pub Date : 2000-09-01 DOI: 10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1002%3E3.0.CO;2-B

E. Efthimiadis

A user-centered investigation of interactive query expansion within the context of a relevance feedback system is presented in this article. Data were collected from 25 searches using the INSPEC database. The data collection mechanisms included questionnaires, transaction logs, and relevance evaluations. The results discuss issues that relate to query expansion, retrieval effectiveness, the correspondence of the on-line-to-off-line relevance judgments, and the selection of terms for query expansion by users (interactive query expansion). The main conclusions drawn from the results of the study are that: (1) one-third of the terms presented to users in a list of candidate terms for query expansion was identified by the users as potentially useful for query expansion. (2) These terms were mainly judged as either variant expressions (synonyms) or alternative (related) terms to the initial query terms. However, a substantial portion of the selected terms were identified as representing new ideas. (3) The relationships identified between the five best terms selected by the users for query expansion and the initial query terms were that: (a) 34% of the query expansion terms have no relationship or other type of correspondence with a query term; (b) 66% of the remaining query expansion terms have a relationship to the query terms. These relationships were: narrower term (46%), broader term (3%), related term (17%). (4) The results provide evidence for the effectiveness of interactive query expansion. The initial search produced on average three highly relevant documents; the query expansion search produced on average nine further highly relevant documents. The conclusions highlight the need for more research on: interactive query expansion, the comparative evaluation of automatic vs. interactive query expansion, the study of weighted Web-based or Web-accessible retrieval systems in operational environments, and for user studies in searching ranked retrieval systems in general.

本文介绍了在相关反馈系统的背景下以用户为中心的交互式查询扩展研究。使用INSPEC数据库从25次搜索中收集数据。数据收集机制包括问卷调查、事务日志和相关性评估。结果讨论了与查询扩展、检索有效性、在线与离线相关性判断的对应关系以及用户选择查询扩展术语(交互式查询扩展)相关的问题。从研究结果中得出的主要结论是:(1)在用于查询扩展的候选术语列表中，提供给用户的术语中有三分之一被用户识别为可能对查询扩展有用。(2)这些术语主要被判断为初始查询术语的变体表达(同义词)或替代(相关)术语。然而，选定的术语中有相当一部分被确定为代表新思想。(3)用户选择的5个最佳查询扩展词与初始查询词之间的关系是:(a) 34%的查询扩展词与查询词没有关系或没有其他类型的对应关系;(b) 66%的剩余查询扩展词与查询词有关系。这些关系是:狭义术语(46%)，广义术语(3%)，相关术语(17%)。(4)研究结果为交互式查询扩展的有效性提供了证据。最初的搜索平均产生三份高度相关的文件;查询扩展搜索平均产生9个高度相关的文档。这些结论强调了需要更多的研究:交互式查询扩展，自动与交互式查询扩展的比较评估，基于web或web可访问的加权检索系统在操作环境中的研究，以及搜索排名检索系统的用户研究。

{"title":"Interactive query expansion: A user-based evaluation in a relevance feedback environment","authors":"E. Efthimiadis","doi":"10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1002%3E3.0.CO;2-B","DOIUrl":"https://doi.org/10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1002%3E3.0.CO;2-B","url":null,"abstract":"A user-centered investigation of interactive query expansion within the context of a relevance feedback system is presented in this article. Data were collected from 25 searches using the INSPEC database. The data collection mechanisms included questionnaires, transaction logs, and relevance evaluations. The results discuss issues that relate to query expansion, retrieval effectiveness, the correspondence of the on-line-to-off-line relevance judgments, and the selection of terms for query expansion by users (interactive query expansion). The main conclusions drawn from the results of the study are that: (1) one-third of the terms presented to users in a list of candidate terms for query expansion was identified by the users as potentially useful for query expansion. (2) These terms were mainly judged as either variant expressions (synonyms) or alternative (related) terms to the initial query terms. However, a substantial portion of the selected terms were identified as representing new ideas. (3) The relationships identified between the five best terms selected by the users for query expansion and the initial query terms were that: (a) 34% of the query expansion terms have no relationship or other type of correspondence with a query term; (b) 66% of the remaining query expansion terms have a relationship to the query terms. These relationships were: narrower term (46%), broader term (3%), related term (17%). (4) The results provide evidence for the effectiveness of interactive query expansion. The initial search produced on average three highly relevant documents; the query expansion search produced on average nine further highly relevant documents. The conclusions highlight the need for more research on: interactive query expansion, the comparative evaluation of automatic vs. interactive query expansion, the study of weighted Web-based or Web-accessible retrieval systems in operational environments, and for user studies in searching ranked retrieval systems in general.","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"37 1","pages":"989-1003"},"PeriodicalIF":0.0,"publicationDate":"2000-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81502053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 129