向HTTP添加时间维度

Michael L. Nelson, H. Sompel
{"title":"向HTTP添加时间维度","authors":"Michael L. Nelson, H. Sompel","doi":"10.4135/9781526470546.n14","DOIUrl":null,"url":null,"abstract":"While the web is distributed, most web archives are centralized silos that do not cooperate with each other. This is partially because the technology that is necessary to replay the archived content and keep it from being influenced by material on the live web also makes it difficult for web archives to cooperate. The Memento Protocol (which we played a central role in defining) addresses this problem by defining an extension to the Hypertext Transfer Protocol (HTTP) that allows for standardized, machine-readable integration of both the past web and the present web. The Memento Protocol extends the concept of HTTP content negotiation to include not only well-known dimensions such as Multipurpose Internet Mail Extensions (MIME) types (e.g., JPEG vs. PNG) and file encodings (e.g., gzip vs. compress), but also the dimension of Coordinated Universal Time (UTC) as a universal versioning system. The protocol can be supported by all systems that hold temporal resource versions, including conventional web archives, as well as resource versioning systems such as wikis. The Memento Protocol introduces some standard terminology with which to discuss web archiving, the most fundamental of which are: original resource (the resource on the live web), Memento (an archived version of an Original Resource, frozen in time), TimeGate (a resource capable of datetime content negotiation to discover a temporally appropriate Memento), and TimeMap (a machine-readable list of all Mementos for an Original Resource). Furthermore, the Memento Protocol is the first web archiving API, enabling aggregation of access to disparate web archives. Web archiving has been dominated by the Internet Archive’s Wayback Machine, but via the Memento Protocol it is possible to leverage the more than a dozen publicly accessible web archives throughout the world for increased completeness, consistency, verifiability, resilience, and availability. 14","PeriodicalId":196909,"journal":{"name":"The SAGE Handbook of Web History","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Adding the Dimension of Time to\\n HTTP\",\"authors\":\"Michael L. Nelson, H. Sompel\",\"doi\":\"10.4135/9781526470546.n14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"While the web is distributed, most web archives are centralized silos that do not cooperate with each other. This is partially because the technology that is necessary to replay the archived content and keep it from being influenced by material on the live web also makes it difficult for web archives to cooperate. The Memento Protocol (which we played a central role in defining) addresses this problem by defining an extension to the Hypertext Transfer Protocol (HTTP) that allows for standardized, machine-readable integration of both the past web and the present web. The Memento Protocol extends the concept of HTTP content negotiation to include not only well-known dimensions such as Multipurpose Internet Mail Extensions (MIME) types (e.g., JPEG vs. PNG) and file encodings (e.g., gzip vs. compress), but also the dimension of Coordinated Universal Time (UTC) as a universal versioning system. The protocol can be supported by all systems that hold temporal resource versions, including conventional web archives, as well as resource versioning systems such as wikis. The Memento Protocol introduces some standard terminology with which to discuss web archiving, the most fundamental of which are: original resource (the resource on the live web), Memento (an archived version of an Original Resource, frozen in time), TimeGate (a resource capable of datetime content negotiation to discover a temporally appropriate Memento), and TimeMap (a machine-readable list of all Mementos for an Original Resource). Furthermore, the Memento Protocol is the first web archiving API, enabling aggregation of access to disparate web archives. Web archiving has been dominated by the Internet Archive’s Wayback Machine, but via the Memento Protocol it is possible to leverage the more than a dozen publicly accessible web archives throughout the world for increased completeness, consistency, verifiability, resilience, and availability. 14\",\"PeriodicalId\":196909,\"journal\":{\"name\":\"The SAGE Handbook of Web History\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The SAGE Handbook of Web History\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4135/9781526470546.n14\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The SAGE Handbook of Web History","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4135/9781526470546.n14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

虽然网络是分布式的,但大多数网络档案都是中心化的孤岛,彼此之间互不合作。部分原因是,回放存档内容并防止其受到实时网络上的材料影响所必需的技术,也使网络存档难以合作。纪念品协议(我们在定义中扮演了核心角色)通过定义超文本传输协议(HTTP)的扩展来解决这个问题,该扩展允许将过去的网络和现在的网络标准化,机器可读的集成。备忘录协议扩展了HTTP内容协商的概念,不仅包括众所周知的维度,如多用途互联网邮件扩展(MIME)类型(例如,JPEG与PNG)和文件编码(例如,gzip与compress),而且还包括协调世界时间(UTC)维度,作为通用版本控制系统。该协议可以被所有保存临时资源版本的系统所支持,包括传统的web存档,以及像wiki这样的资源版本控制系统。备忘录协议引入了一些标准术语来讨论网络存档,其中最基本的是:原始资源(实时网络上的资源),备忘录(原始资源的存档版本,冻结在时间中),时间门(能够通过日期时间内容协商来发现暂时合适的备忘录的资源)和时间地图(原始资源的所有备忘录的机器可读列表)。此外,备忘录协议是第一个web归档API,允许对不同的web归档进行聚合访问。Web归档一直由Internet Archive的Wayback Machine主导,但通过Memento协议,可以利用世界各地的十多个公开访问的Web归档,以提高完整性、一致性、可验证性、弹性和可用性。14
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Adding the Dimension of Time to HTTP
While the web is distributed, most web archives are centralized silos that do not cooperate with each other. This is partially because the technology that is necessary to replay the archived content and keep it from being influenced by material on the live web also makes it difficult for web archives to cooperate. The Memento Protocol (which we played a central role in defining) addresses this problem by defining an extension to the Hypertext Transfer Protocol (HTTP) that allows for standardized, machine-readable integration of both the past web and the present web. The Memento Protocol extends the concept of HTTP content negotiation to include not only well-known dimensions such as Multipurpose Internet Mail Extensions (MIME) types (e.g., JPEG vs. PNG) and file encodings (e.g., gzip vs. compress), but also the dimension of Coordinated Universal Time (UTC) as a universal versioning system. The protocol can be supported by all systems that hold temporal resource versions, including conventional web archives, as well as resource versioning systems such as wikis. The Memento Protocol introduces some standard terminology with which to discuss web archiving, the most fundamental of which are: original resource (the resource on the live web), Memento (an archived version of an Original Resource, frozen in time), TimeGate (a resource capable of datetime content negotiation to discover a temporally appropriate Memento), and TimeMap (a machine-readable list of all Mementos for an Original Resource). Furthermore, the Memento Protocol is the first web archiving API, enabling aggregation of access to disparate web archives. Web archiving has been dominated by the Internet Archive’s Wayback Machine, but via the Memento Protocol it is possible to leverage the more than a dozen publicly accessible web archives throughout the world for increased completeness, consistency, verifiability, resilience, and availability. 14
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Critical Political Economy of Web Advertising History Web History in Context Science and Technology Studies Approaches to Web History Web Archives and (Digital) History: A Troubled Past and a Promising Future? Hearing the Past: The Sonic Web from MIDI to Music Streaming
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1