Recovery of vanished URLs: Comparing the efficiency of Internet Archive and Google

IF 0.5 4区 管理学 Q3 INFORMATION SCIENCE & LIBRARY SCIENCE Malaysian Journal of Library & Information Science Pub Date : 2017-05-31 DOI:10.22452/MJLIS.VOL22NO2.3
D. V. Kumar, B. Kumar
{"title":"Recovery of vanished URLs: Comparing the efficiency of Internet Archive and Google","authors":"D. V. Kumar, B. Kumar","doi":"10.22452/MJLIS.VOL22NO2.3","DOIUrl":null,"url":null,"abstract":"This article examines the vanishing nature of URLs and recovery of vanished URLs through Internet Archive and Google search engine. For that purpose study investigates the URLs cited in the articles of two LIS journals published during 2009-2013. A total of 226 articles published in two open access LIS journals were selected. Of 5197 citations cited in 226 articles, 21.05 percent were URLs (1094). Study found that 38.12 percent (417 out of 5197) URLs were found missing and remaining 61.88 percent of URLs were active at the time of URL check with W3C link checker. The HTTP 404 error message – “page not found” was the overwhelming message encountered and represented 54.2 percent of all HTTP error message. Internet Archive and Google search engine were used to recover vanished URLs. However, the Internet Archive recovered 66.19 percent of the total vanished URLs, whereas, Google manages to recover only 30.70 percent of the total vanished URLs. The recovery of vanishing URLs through Internet Archive and Google increased the active URL’s rate from 61.88 per cent to 87.11 per cent and 73.58 per cent respectively. Study found that Internet Archive is a most efficient tool to recover vanished URLs compared to Google search engine.","PeriodicalId":45072,"journal":{"name":"Malaysian Journal of Library & Information Science","volume":"22 1","pages":"31-43"},"PeriodicalIF":0.5000,"publicationDate":"2017-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Malaysian Journal of Library & Information Science","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.22452/MJLIS.VOL22NO2.3","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 1

Abstract

This article examines the vanishing nature of URLs and recovery of vanished URLs through Internet Archive and Google search engine. For that purpose study investigates the URLs cited in the articles of two LIS journals published during 2009-2013. A total of 226 articles published in two open access LIS journals were selected. Of 5197 citations cited in 226 articles, 21.05 percent were URLs (1094). Study found that 38.12 percent (417 out of 5197) URLs were found missing and remaining 61.88 percent of URLs were active at the time of URL check with W3C link checker. The HTTP 404 error message – “page not found” was the overwhelming message encountered and represented 54.2 percent of all HTTP error message. Internet Archive and Google search engine were used to recover vanished URLs. However, the Internet Archive recovered 66.19 percent of the total vanished URLs, whereas, Google manages to recover only 30.70 percent of the total vanished URLs. The recovery of vanishing URLs through Internet Archive and Google increased the active URL’s rate from 61.88 per cent to 87.11 per cent and 73.58 per cent respectively. Study found that Internet Archive is a most efficient tool to recover vanished URLs compared to Google search engine.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
恢复消失的URL:比较互联网档案和谷歌的效率
本文研究了url消失的本质以及通过Internet Archive和谷歌搜索引擎恢复消失的url。为此,本研究调查了2009-2013年间发表的两份LIS期刊文章中引用的url。共选取了发表在两种开放获取LIS期刊上的226篇文章。226篇文章被引用5197次,其中21.05%是url(1094次)。研究发现,在使用W3C链接检查器进行URL检查时,38.12%(5197个URL中的417个)的URL被发现缺失,其余61.88%的URL是活跃的。HTTP 404错误信息——“页面未找到”是最常见的错误信息,占所有HTTP错误信息的54.2%。Internet Archive和谷歌搜索引擎被用来恢复消失的url。但是,Internet Archive恢复了66.19%的消失网址,而谷歌只恢复了30.70%的消失网址。通过互联网档案和谷歌恢复消失的网址,使活跃网址的比率分别由61.88%提高到87.11%和73.58%。研究发现,与谷歌搜索引擎相比,Internet Archive是恢复消失url的最有效工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Malaysian Journal of Library & Information Science
Malaysian Journal of Library & Information Science INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
2.00
自引率
7.70%
发文量
8
期刊最新文献
A study on the election factors of ACM Fellow based on the co-authorship relationship Compromising quality parameters lead to fallout: a study of de-indexing of research journals Teaching strategies for library instruction: directions from the literature Strategies for building institutional repositories a case study of content recruitment in Malaysian higher learning institutions Exploring authors engagement in journals with questionable practices: a case study of OMICS
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1