亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频

? 歡迎來到蟲蟲下載站! | ?? 資源下載 ?? 資源專輯 ?? 關于我們
? 蟲蟲下載站

?? package-summary.html

?? 網絡爬蟲開源代碼
?? HTML
?? 第 1 頁 / 共 2 頁
字號:
<p>LaHonda in below is reference to meeting of John, Gordon and Stack atLaHonda Cafe on 16th St., on August 8th, 2006.</p><ul><li>Leave off 9.2 GZIP extra fields. Big section on implementing an optionthat has little to do with WARCing. AGREED at LaHonda.</li><li>But, we need to mark gzipped files as being WARC: i.e. that the GZIP is a member per resource. Its useful so readers know how to invokeGZIP (That it has to be done once to get at any record or just need todo per record). Suggest adding GZIP extra field in HEAD ofGZIP member that says 'WARC' (ARC has such a thing currently). NOT NECESSARY per LaHonda meeting.</li><li>IP-Address for dns resource is DNS Server.  Add note to this effect in8.2 DNS.</li><li>Section 6. is truncated -- missing text.  What was intended here? SEEISO DOC.</li><li>In-line ANVL definition (From Kunze).  Related, can labels haveCTLs such as CRLF (Shouldn't)?  When says 'control-chars', does this includeUNICODE control characters (Should)? CHAR is described as ASCII/UTF-8 but theyare not same (Should be UTF-8).  ANVL OR NOT STILL UP IN AIR AFTER LaHonda.Postpone to 0.11 revision.</li><li>Fix examples. Use output of experimental ARC Writer.</li><li>Fix ambiguity in spec. pertaining to 'smallest possible anvl-fields' notcited by Mads Alhof Kristiansen in <a href="ftp://ftp.diku.dk/diku/semantics/papers/D-548.pdf">Digital Preservationusing the WARC File Format</a>.</li></ul><h2>Open Issues</h2><h3>Drop response record type</h3><p><code>resource</code> is sufficent. Let mimetype distingush if capture withresponse headers or not (As per comment at end of <i>8.1 HTTP and HTTPS</i>where it allows that if no response headers, use resource record type andpage mimetype rather than response type plus a mimetype of message/http: Thedifference in record types is not needed distingushing between the twotypes of capture)</p><p>Are there other capture methods that would require a response record,that don't have a mimetype that includes response headers and content?SMTP has rich MIME set to describe responses. Its request ispretty much unrecordable. NNTP and FTP similar.  Because of rich MIME, noneed of a special response type here.</p><p>Related, do we need the <code>request</code> record?Only makes sense for HTTP?</p><p>This proposal is contentious.  Gordon drew scenario where responsewould be needed distingushing local from remote capture if an archivinginstitution purposefully archived without recording headers orif the payload itself was an archived record. In opposition, was suggested thatshould an institution choose to cature in this 'unusual' mode, crawl metadatacould be used consulted to disambiguate confusion on how capture was done (Tobe further investigated.  In general, definition of record types is still in need of work).</p><h3>subject-url</h3><p>The ISO revision suggests that the positional parameter <code>subject-uri</code> be renamed.  Suggest <code>record-url<code>.</p><h3>Other issues</h3><ul><li>Should we allow freeform creation of custom Named Fields ifhave a MIME-like 'X-' or somesuch prefix?</li><li>Nothing on header-line encoding (Section 11 says UTF-8). For completeness should be US-ASCII or UTF-8, no control-chars (especiallyCR or LF), etc.</li><li><code>warcinfo</code><ul><li>What for a scheme?  Using UUID as per G suggestion.</li><li>Also, how to populate description of crawl into warcinfo?'Documentation' <code>Named Field</code> with list of URLs that can be assumedto exist somewhere in the current WARC set (We'd have to make the crawler goget them at start of a crawl).</li><li>I don't want to repeat crawl description for every WARC. How to have thiswarcinfo point at an original?  <code>related-record-id</code> seemsinsufficent.</li><li>If the crawler config. changes, can I just write a warcinfo withdifferences?  How to express?  Or better as metadata about a warcinfo?</li><li>In the pastwe used to get the filename from this URL header field when we unsure of thefilename or it was unavailable (We're reading a Stream).  Won't be able to dothat with UUID for URL.  So, introducing new warcinfo Named Field (optional)'Filename' that will be used when warcinfo is put at start of a file.Allow warcinfo to have a named parameter 'Filename'?</li></ul></li><li><code>revisit</code><ul><li>What to write?  Use a description field or just expect this info to be present in the warcinfo? Example has request header(inside XML).  Better to use associated <code>request</code> record for thiskind of info?</li><li><code>Related-Record-ID</code> (RRID) of original is likelyan onerous requirement. Envisioning an implementation where we'd write<code>revisit</code> records, we'd write such a record where content wasjudged same or where date since last fetch had not changed.  If we're towrite the RRID, then we'd have to maintain table keyed by URL with value ofpage hash or of last modified-date plus associated RRID (actual RRIDURL, not a hash).</li></ul></li><li>Should we allow a <code>Description</code> <code>Named Field</code>.E.g. I add an order file as a metadata record and associate with a<code>warcinfo</code> record.  Description field could say "This is HeritrixOrder file".  Same for seeds.  Alternative is custom XML packaging (Schemecould describe fields such as 'order' file or ANVL packaging using ANVL'comments'.</li><li>Section 11, why was it we said we don't need a parameter or explicitsubtype for special gzip WARC format?  I don't remember?   Reader needs toknow when its reading a stream.  A client would like to know so it wrotestream to disk with right suffix?  Recap. (Perhaps it was looking atthe MAGIC bytes -- if it starts with GZIP MAGIC and includes extra fieldsthat denote it WARC, thats sufficent?).</li><li>Section 7, on truncation, on 7.1, suggest values -- 'time', 'length' --but allow free form description?Leave off 'superior method of indicating truncation' paragraph.  This qualifiercould be added to all sections of doc -- that a subsequent revision of any aspect of the doc. will be superior. Rather than <code>End-Length</code>, like MIME, last record could have<code>Segment-Number-Total</code>, a count of all segments that make upcomplete record.</li></ul><p>From LaHonda, discussion of <code>revisit</code> type. Definition wastighted some by saying revisit is used when you chose not to store the capture.Was thought possible that itNOT require pointer back to an original.  Suggested it might have asimilarity judgment header -- <code>similiarity-value</code> -- with valuesbetween 0 and 1.  Might also have <code>analysis-method</code> and<code>description</code>.  Possible methods discussed included: URI same,length same, hash of content same, judgement based off content of HTTP HEADrequest, etc.  Possible payloads might be: Nothing, a diff, the hash obtained,etc.</p><h2>Unimplemented</h2><ul><li>Record Segmentation (4.8 <code>continuation</code> record typeand the 5.2 <code>Segment-*</code> Named Parameters.  Future TODO.</li><li>4.7 <code>conversion</code> type. Future TODO.</li></ul><h2>TODOs</h2><ul><li>unit tests using <code>multipart/*</code> (JavaMail) reading andwriting records? Try <code>record-id</code> as part boundary.</li><li>Performance: Need to add Record-based buffering.  GZIP'd streamshave some buffering because of the deflater but could probably dow/ more.</li></ul><P><P><DL></DL><HR><!-- ======= START OF BOTTOM NAVBAR ====== --><A NAME="navbar_bottom"><!-- --></A><A HREF="#skip-navbar_bottom" title="Skip navigation links"></A><TABLE BORDER="0" WIDTH="100%" CELLPADDING="1" CELLSPACING="0" SUMMARY=""><TR><TD COLSPAN=2 BGCOLOR="#EEEEFF" CLASS="NavBarCell1"><A NAME="navbar_bottom_firstrow"><!-- --></A><TABLE BORDER="0" CELLPADDING="0" CELLSPACING="3" SUMMARY="">  <TR ALIGN="center" VALIGN="top">  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../../overview-summary.html"><FONT CLASS="NavBarFont1"><B>Overview</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#FFFFFF" CLASS="NavBarCell1Rev"> &nbsp;<FONT CLASS="NavBarFont1Rev"><B>Package</B></FONT>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <FONT CLASS="NavBarFont1">Class</FONT>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="package-use.html"><FONT CLASS="NavBarFont1"><B>Use</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="package-tree.html"><FONT CLASS="NavBarFont1"><B>Tree</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../../deprecated-list.html"><FONT CLASS="NavBarFont1"><B>Deprecated</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../../index-all.html"><FONT CLASS="NavBarFont1"><B>Index</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../../help-doc.html"><FONT CLASS="NavBarFont1"><B>Help</B></FONT></A>&nbsp;</TD>  </TR></TABLE></TD><TD ALIGN="right" VALIGN="top" ROWSPAN=3><EM></EM></TD></TR><TR><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2">&nbsp;<A HREF="../../../../../org/archive/io/warc/package-summary.html"><B>PREV PACKAGE</B></A>&nbsp;&nbsp;<A HREF="../../../../../org/archive/net/package-summary.html"><B>NEXT PACKAGE</B></A></FONT></TD><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2">  <A HREF="../../../../../index.html?org/archive/io/warc/v10/package-summary.html" target="_top"><B>FRAMES</B></A>  &nbsp;&nbsp;<A HREF="package-summary.html" target="_top"><B>NO FRAMES</B></A>  &nbsp;&nbsp;<SCRIPT type="text/javascript">  <!--  if(window==top) {    document.writeln('<A HREF="../../../../../allclasses-noframe.html"><B>All Classes</B></A>');  }  //--></SCRIPT><NOSCRIPT>  <A HREF="../../../../../allclasses-noframe.html"><B>All Classes</B></A></NOSCRIPT></FONT></TD></TR></TABLE><A NAME="skip-navbar_bottom"></A><!-- ======== END OF BOTTOM NAVBAR ======= --><HR>Copyright &copy; 2003-2007 Internet Archive. All Rights Reserved.</BODY></HTML>

?? 快捷鍵說明

復制代碼 Ctrl + C
搜索代碼 Ctrl + F
全屏模式 F11
切換主題 Ctrl + Shift + D
顯示快捷鍵 ?
增大字號 Ctrl + =
減小字號 Ctrl + -
亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频
欧美久久久一区| 欧美日韩一区不卡| 日韩午夜av一区| 亚洲免费观看高清完整版在线 | 国产精品色在线观看| 丝袜美腿亚洲综合| 色婷婷精品久久二区二区蜜臂av| 久久综合视频网| 日韩不卡一二三区| 色域天天综合网| 中文字幕一区免费在线观看| 国产美女在线观看一区| 91精品国产aⅴ一区二区| 亚洲激情图片一区| 成人黄色软件下载| 久久久久久亚洲综合影院红桃| 日本免费新一区视频| 欧美午夜电影网| 亚洲欧美日韩国产另类专区| 国产成人精品亚洲777人妖| 日韩美女在线视频| 免费成人av资源网| 7777精品伊人久久久大香线蕉的| 一区二区三区在线观看视频| 91小宝寻花一区二区三区| 欧美精彩视频一区二区三区| 国产乱码一区二区三区| 精品国产区一区| 九九**精品视频免费播放| 日韩一卡二卡三卡四卡| 日韩专区在线视频| 欧美精品自拍偷拍| 丝袜诱惑制服诱惑色一区在线观看| 色狠狠av一区二区三区| 亚洲精品自拍动漫在线| 91影院在线观看| 亚洲欧美日韩一区| 色一情一乱一乱一91av| 一区二区三区不卡视频| 欧美亚洲国产一区二区三区va| 一区二区三区蜜桃网| 欧美性色黄大片| 午夜精品视频在线观看| 在线91免费看| 美女任你摸久久| 精品国产乱码久久久久久1区2区| 久久成人免费网站| 久久一二三国产| 国产高清久久久| 国产精品高潮呻吟久久| 99精品国产99久久久久久白柏| 国产精品剧情在线亚洲| 色香蕉久久蜜桃| 天天av天天翘天天综合网| 91精品国产一区二区| 激情综合网最新| 国产欧美精品一区二区色综合朱莉| 成人福利电影精品一区二区在线观看| 国产精品久久久久婷婷| 在线观看不卡一区| 日本亚洲视频在线| 欧美精品一区二区精品网| 成人动漫一区二区在线| 亚洲在线视频一区| 91精品国产欧美一区二区成人| 久久成人久久鬼色| 中文字幕不卡在线播放| 91国在线观看| 青青草视频一区| 欧美国产精品劲爆| 在线免费精品视频| 老司机免费视频一区二区三区| 国产婷婷一区二区| 日本道在线观看一区二区| 日韩不卡手机在线v区| 国产日韩av一区| 色天天综合色天天久久| 全国精品久久少妇| 国产精品美女久久久久久久久 | 国产91色综合久久免费分享| 亚洲欧美电影院| 91精品国产综合久久婷婷香蕉 | 日韩精品1区2区3区| 久久久久久久久伊人| 日本精品一区二区三区高清| 秋霞午夜鲁丝一区二区老狼| 国产精品理论片| 欧美日韩国产另类一区| 国产乱子伦视频一区二区三区| 一区二区三区色| 久久婷婷国产综合精品青草 | 久久精品国产亚洲a| 亚洲色欲色欲www| 欧美一级日韩不卡播放免费| 成人午夜在线视频| 日精品一区二区| 国产精品乱人伦| 日韩视频一区在线观看| 91麻豆swag| 国产在线视频不卡二| 亚洲无人区一区| 国产精品入口麻豆原神| 欧美一区日本一区韩国一区| 波多野结衣中文一区| 日韩电影网1区2区| 亚洲人成在线播放网站岛国| 日韩精品综合一本久道在线视频| 91亚洲国产成人精品一区二区三 | 亚洲欧洲成人精品av97| 日韩午夜在线影院| 色综合久久天天综合网| 国产高清精品在线| 免费在线观看不卡| 亚洲国产人成综合网站| 国产精品久久精品日日| wwwwww.欧美系列| 91精品国产综合久久久蜜臀粉嫩| 色呦呦国产精品| 成人免费毛片嘿嘿连载视频| 久久精品国产99| 午夜精品久久久久久久蜜桃app| 国产精品女上位| 国产午夜亚洲精品午夜鲁丝片 | 成人激情综合网站| 麻豆91精品视频| 日韩专区欧美专区| 亚洲一区二区三区国产| 亚洲欧美一区二区三区孕妇| 国产丝袜欧美中文另类| 久久综合给合久久狠狠狠97色69| 在线不卡欧美精品一区二区三区| 91视频一区二区三区| 大桥未久av一区二区三区中文| 国产在线精品视频| 九九久久精品视频| 老司机精品视频导航| 国产欧美视频一区二区| 日韩成人免费看| 欧美色综合网站| 国产一区日韩二区欧美三区| 婷婷开心激情综合| 亚洲成av人片| 艳妇臀荡乳欲伦亚洲一区| 亚洲欧洲精品一区二区精品久久久 | 日韩精品一二三区| 亚洲成人中文在线| 亚洲国产cao| 亚洲成人你懂的| 亚洲电影一级黄| 亚洲国产wwwccc36天堂| 亚洲成人1区2区| 亚洲1区2区3区4区| 亚洲一本大道在线| 亚洲国产综合在线| 亚洲成人免费电影| 亚洲电影一区二区三区| 水野朝阳av一区二区三区| 午夜精品一区二区三区免费视频| 性做久久久久久免费观看| 日韩综合小视频| 久久99精品一区二区三区| 韩国理伦片一区二区三区在线播放| 久久99热这里只有精品| 国产一区二区在线视频| 国产传媒日韩欧美成人| 成人丝袜视频网| 色综合天天综合网天天狠天天| 色综合久久66| 欧美二区三区91| 日韩免费电影网站| 久久久久久久久免费| 欧美激情中文字幕一区二区| 亚洲女同ⅹxx女同tv| 亚洲一本大道在线| 麻豆久久久久久久| 国产精品一区二区男女羞羞无遮挡 | 青青草成人在线观看| 久草在线在线精品观看| 国产精品综合视频| 9久草视频在线视频精品| 在线观看免费亚洲| 日韩你懂的在线播放| 国产欧美日韩久久| 亚洲一区在线观看网站| 蜜臀av性久久久久av蜜臀妖精| 国产精品影视在线观看| 91视频一区二区三区| 欧美精品乱码久久久久久| 精品国产91九色蝌蚪| 国产精品久久精品日日| 五月天激情综合网| 国产一区二区三区日韩| 色综合中文综合网| 国产欧美一区二区精品忘忧草| 日韩码欧中文字| 天堂影院一区二区| 国产成人精品1024| 欧美性欧美巨大黑白大战| 精品美女一区二区三区| 成人免费在线视频观看|