亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频

? 歡迎來到蟲蟲下載站! | ?? 資源下載 ?? 資源專輯 ?? 關于我們
? 蟲蟲下載站

?? lexicalcrawlmapper.html

?? 網絡爬蟲開源代碼
?? HTML
?? 第 1 頁 / 共 2 頁
字號:
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"><html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en"><head><meta http-equiv="content-type" content="text/html; charset=UTF-8" /><title>LexicalCrawlMapper xref</title><link type="text/css" rel="stylesheet" href="../../../../stylesheet.css" /></head><body><div id="overview"><a href="../../../../../apidocs/org/archive/crawler/processor/LexicalCrawlMapper.html">View Javadoc</a></div><pre><a name="1" href="#1">1</a>   <em class="comment">/*<em class="comment"> LexicalCrawlMapper</em></em><a name="2" href="#2">2</a>   <em class="comment"> * </em><a name="3" href="#3">3</a>   <em class="comment"> * Created on Sep 30, 2005</em><a name="4" href="#4">4</a>   <em class="comment"> *</em><a name="5" href="#5">5</a>   <em class="comment"> * Copyright (C) 2005 Internet Archive.</em><a name="6" href="#6">6</a>   <em class="comment"> * </em><a name="7" href="#7">7</a>   <em class="comment"> * This file is part of the Heritrix web crawler (crawler.archive.org).</em><a name="8" href="#8">8</a>   <em class="comment"> * </em><a name="9" href="#9">9</a>   <em class="comment"> * Heritrix is free software; you can redistribute it and/or modify</em><a name="10" href="#10">10</a>  <em class="comment"> * it under the terms of the GNU Lesser Public License as published by</em><a name="11" href="#11">11</a>  <em class="comment"> * the Free Software Foundation; either version 2.1 of the License, or</em><a name="12" href="#12">12</a>  <em class="comment"> * any later version.</em><a name="13" href="#13">13</a>  <em class="comment"> * </em><a name="14" href="#14">14</a>  <em class="comment"> * Heritrix is distributed in the hope that it will be useful, </em><a name="15" href="#15">15</a>  <em class="comment"> * but WITHOUT ANY WARRANTY; without even the implied warranty of</em><a name="16" href="#16">16</a>  <em class="comment"> * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the</em><a name="17" href="#17">17</a>  <em class="comment"> * GNU Lesser Public License for more details.</em><a name="18" href="#18">18</a>  <em class="comment"> * </em><a name="19" href="#19">19</a>  <em class="comment"> * You should have received a copy of the GNU Lesser Public License</em><a name="20" href="#20">20</a>  <em class="comment"> * along with Heritrix; if not, write to the Free Software</em><a name="21" href="#21">21</a>  <em class="comment"> * Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA  02111-1307  USA</em><a name="22" href="#22">22</a>  <em class="comment"> */</em><a name="23" href="#23">23</a>  <strong>package</strong> <a href="../../../../org/archive/crawler/processor/package-summary.html">org.archive.crawler.processor</a>;<a name="24" href="#24">24</a>  <a name="25" href="#25">25</a>  <strong>import</strong> java.io.BufferedReader;<a name="26" href="#26">26</a>  <strong>import</strong> java.io.File;<a name="27" href="#27">27</a>  <strong>import</strong> java.io.FileReader;<a name="28" href="#28">28</a>  <strong>import</strong> java.io.IOException;<a name="29" href="#29">29</a>  <strong>import</strong> java.io.InputStreamReader;<a name="30" href="#30">30</a>  <strong>import</strong> java.io.Reader;<a name="31" href="#31">31</a>  <strong>import</strong> java.net.URL;<a name="32" href="#32">32</a>  <strong>import</strong> java.net.URLConnection;<a name="33" href="#33">33</a>  <strong>import</strong> java.util.Iterator;<a name="34" href="#34">34</a>  <strong>import</strong> java.util.SortedMap;<a name="35" href="#35">35</a>  <strong>import</strong> java.util.TreeMap;<a name="36" href="#36">36</a>  <a name="37" href="#37">37</a>  <strong>import</strong> org.archive.crawler.datamodel.CandidateURI;<a name="38" href="#38">38</a>  <strong>import</strong> org.archive.crawler.settings.SimpleType;<a name="39" href="#39">39</a>  <strong>import</strong> org.archive.util.iterator.LineReadingIterator;<a name="40" href="#40">40</a>  <strong>import</strong> org.archive.util.iterator.RegexpLineIterator;<a name="41" href="#41">41</a>  <a name="42" href="#42">42</a>  <a name="43" href="#43">43</a>  <em>/**<em>*</em></em><a name="44" href="#44">44</a>  <em> * A simple crawl splitter/mapper, dividing up CandidateURIs/CrawlURIs</em><a name="45" href="#45">45</a>  <em> * between crawlers by diverting some range of URIs to local log files</em><a name="46" href="#46">46</a>  <em> * (which can then be imported to other crawlers). </em><a name="47" href="#47">47</a>  <em> * </em><a name="48" href="#48">48</a>  <em> * May operate on a CrawlURI (typically early in the processing chain) or</em><a name="49" href="#49">49</a>  <em> * its CandidateURI outlinks (late in the processing chain, after </em><a name="50" href="#50">50</a>  <em> * LinksScoper), or both (if inserted and configured in both places). </em><a name="51" href="#51">51</a>  <em> * </em><a name="52" href="#52">52</a>  <em> * &lt;p>Uses lexical comparisons of classKeys to map URIs to crawlers. The</em><a name="53" href="#53">53</a>  <em> * 'map' is specified via either a local or HTTP-fetchable file. Each</em><a name="54" href="#54">54</a>  <em> * line of this file should contain two space-separated tokens, the</em><a name="55" href="#55">55</a>  <em> * first a key and the second a crawler node name (which should be</em><a name="56" href="#56">56</a>  <em> * legal as part of a filename). All URIs will be mapped to the crawler</em><a name="57" href="#57">57</a>  <em> * node name associated with the nearest mapping key equal or subsequent </em><a name="58" href="#58">58</a>  <em> * to the URI's own classKey. If there are no mapping keys equal or </em><a name="59" href="#59">59</a>  <em> * after the classKey, the mapping 'wraps around' to the first mapping key.</em><a name="60" href="#60">60</a>  <em> * </em><a name="61" href="#61">61</a>  <em> * &lt;p>One crawler name is distinguished as the 'local name'; URIs mapped to</em><a name="62" href="#62">62</a>  <em> * this name are not diverted, but continue to be processed normally.</em><a name="63" href="#63">63</a>  <em> * </em><a name="64" href="#64">64</a>  <em> * &lt;p>For example, assume a SurtAuthorityQueueAssignmentPolicy and</em><a name="65" href="#65">65</a>  <em> * a simple mapping file:</em><a name="66" href="#66">66</a>  <em> * </em><a name="67" href="#67">67</a>  <em> * &lt;pre></em><a name="68" href="#68">68</a>  <em> *  d crawlerA</em><a name="69" href="#69">69</a>  <em> *  ~ crawlerB</em><a name="70" href="#70">70</a>  <em> * &lt;/pre></em><a name="71" href="#71">71</a>  <em> * &lt;p>All URIs with "com," classKeys will find the 'd' key as the nearest</em><a name="72" href="#72">72</a>  <em> * subsequent mapping key, and thus be mapped to 'crawlerA'. If that's</em><a name="73" href="#73">73</a>  <em> * the 'local name', the URIs will be processed normally; otherwise, the</em><a name="74" href="#74">74</a>  <em> * URI will be written to a diversion log aimed for 'crawlerA'. </em><a name="75" href="#75">75</a>  <em> * </em><a name="76" href="#76">76</a>  <em> * &lt;p>If using the JMX importUris operation importing URLs dropped by</em><a name="77" href="#77">77</a>  <em> * a {@link LexicalCrawlMapper} instance, use &lt;code>recoveryLog&lt;/code> style.</em><a name="78" href="#78">78</a>  <em> * </em><a name="79" href="#79">79</a>  <em> * @author gojomo</em><a name="80" href="#80">80</a>  <em> * @version $Date: 2006-09-26 20:38:48 +0000 (Tue, 26 Sep 2006) $, $Revision: 4667 $</em><a name="81" href="#81">81</a>  <em> */</em><a name="82" href="#82">82</a>  <strong>public</strong> <strong>class</strong> <a href="../../../../org/archive/crawler/processor/LexicalCrawlMapper.html">LexicalCrawlMapper</a> <strong>extends</strong> <a href="../../../../org/archive/crawler/processor/CrawlMapper.html">CrawlMapper</a> {<a name="83" href="#83">83</a>      <strong>private</strong> <strong>static</strong> <strong>final</strong> <strong>long</strong> serialVersionUID = 1L;<a name="84" href="#84">84</a>      <a name="85" href="#85">85</a>      <em>/**<em>* where to load map from */</em></em><a name="86" href="#86">86</a>      <strong>public</strong> <strong>static</strong> <strong>final</strong> String ATTR_MAP_SOURCE = <span class="string">"map-source"</span>;

?? 快捷鍵說明

復制代碼 Ctrl + C
搜索代碼 Ctrl + F
全屏模式 F11
切換主題 Ctrl + Shift + D
顯示快捷鍵 ?
增大字號 Ctrl + =
減小字號 Ctrl + -
亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频
国内成+人亚洲+欧美+综合在线| 中文字幕va一区二区三区| 国产精品综合久久| 一区二区三区四区精品在线视频| 精品欧美久久久| 欧美系列在线观看| 一本大道久久精品懂色aⅴ| 精品一区在线看| 美女脱光内衣内裤视频久久网站| 国产精品欧美一级免费| 精品国产一区二区三区四区四| 欧美在线制服丝袜| av中文字幕在线不卡| 国产在线播精品第三| 无码av免费一区二区三区试看| 最新欧美精品一区二区三区| 精品久久国产字幕高潮| 欧美久久久久中文字幕| 91国内精品野花午夜精品| 成人污视频在线观看| 国产一区二区按摩在线观看| 日本在线不卡视频| 亚洲国产综合色| 一区二区欧美在线观看| 亚洲人成亚洲人成在线观看图片| 国产欧美视频在线观看| 久久综合狠狠综合久久激情| 日韩色在线观看| 91精品国模一区二区三区| 91成人在线免费观看| 91在线一区二区| 91视频.com| 欧美在线视频全部完| 欧美亚洲国产一区二区三区va| 一本大道综合伊人精品热热 | 亚洲国产视频一区二区| 亚洲欧美aⅴ...| 亚洲乱码精品一二三四区日韩在线| 国产精品丝袜一区| 国产精品美女久久久久久久久| 亚洲国产精品成人综合色在线婷婷| 久久久亚洲高清| 国产亚洲精品bt天堂精选| 国产日韩精品一区二区浪潮av| 久久久精品影视| 国产精品欧美一区二区三区| 亚洲人成人一区二区在线观看| 麻豆久久一区二区| 秋霞影院一区二区| 久久狠狠亚洲综合| 国产高清不卡一区| 成人av在线看| 色成年激情久久综合| 91精品黄色片免费大全| 久久综合丝袜日本网| 国产精品人成在线观看免费| 亚洲视频你懂的| 午夜精品免费在线观看| 蜜桃视频第一区免费观看| 国产美女在线精品| 一本色道a无线码一区v| 欧美日韩亚洲综合在线| 26uuu色噜噜精品一区二区| 国产精品久久夜| 午夜视频久久久久久| 久久成人av少妇免费| 成人精品免费视频| 欧美丝袜自拍制服另类| 精品久久人人做人人爱| 国产精品美女久久久久aⅴ| 亚洲精品中文在线观看| 视频一区二区中文字幕| 国产美女精品一区二区三区| 色噜噜偷拍精品综合在线| 7777精品伊人久久久大香线蕉 | 一区二区三区精品| 麻豆91免费观看| av电影天堂一区二区在线| 欧美肥妇毛茸茸| 国产精品无人区| 视频一区欧美精品| 成人app软件下载大全免费| 欧美日韩亚洲综合在线 欧美亚洲特黄一级 | 亚洲女性喷水在线观看一区| 天堂在线亚洲视频| 成人免费看视频| 91精品久久久久久蜜臀| 最新日韩在线视频| 久久精品99国产精品| 在线精品视频免费播放| 久久只精品国产| 天堂久久久久va久久久久| 成人免费观看av| 日韩免费在线观看| 亚洲国产精品久久不卡毛片| 成人午夜视频在线观看| 日韩欧美在线综合网| 亚洲精品午夜久久久| 国产精品77777竹菊影视小说| 欧美日本不卡视频| 亚洲欧美日韩成人高清在线一区| 韩国精品久久久| 欧美日本在线观看| 国产精品久久久久一区二区三区共| 日本视频在线一区| 欧美日韩一区在线观看| 亚洲欧洲精品一区二区三区不卡 | 一本一本大道香蕉久在线精品 | 色先锋aa成人| 日本一区二区三区久久久久久久久不 | 国产综合久久久久久久久久久久| 欧美三级资源在线| 亚洲免费三区一区二区| 风间由美一区二区三区在线观看| 日韩一级片在线观看| 亚洲高清免费视频| 在线精品视频免费播放| 亚洲欧美一区二区三区极速播放| 国产夫妻精品视频| 久久精品亚洲乱码伦伦中文| 国产在线播精品第三| 日韩免费成人网| www.久久精品| 国产精品国产三级国产专播品爱网| 国产一区二区三区精品视频| 精品日韩99亚洲| 美美哒免费高清在线观看视频一区二区 | 日韩精品一区二区在线观看| 日日夜夜精品视频天天综合网| 欧美性生活久久| 一二三区精品视频| 欧美视频一区二区三区四区| 亚洲图片自拍偷拍| 欧美在线观看视频一区二区| 亚洲无人区一区| 欧美精品1区2区3区| 天堂蜜桃91精品| 日韩欧美亚洲另类制服综合在线| 日韩综合小视频| 欧美成人精品二区三区99精品| 免费人成精品欧美精品| 精品久久国产老人久久综合| 久久99精品国产麻豆婷婷洗澡| 欧美变态口味重另类| 精品一区二区三区蜜桃| 久久久国产一区二区三区四区小说 | www.日韩精品| 亚洲欧美日韩久久精品| 在线观看免费视频综合| 天天操天天色综合| 精品嫩草影院久久| 国产福利一区二区三区| 国产精品久久久久久久久免费樱桃| 99这里只有精品| 亚洲一二三四在线| 日韩一区二区三区视频在线观看| 精品一区二区三区欧美| 中文欧美字幕免费| 欧亚洲嫩模精品一区三区| 视频一区在线视频| 久久久精品国产免费观看同学| 成人黄色小视频在线观看| 亚洲午夜精品一区二区三区他趣| 欧美一区二区三区四区视频| 国产高清无密码一区二区三区| 最新国产成人在线观看| 91麻豆精品国产综合久久久久久| 狠狠色丁香婷婷综合| 亚洲摸摸操操av| 日韩欧美123| 91蝌蚪porny九色| 美国欧美日韩国产在线播放| 国产精品天干天干在观线| 欧美三级视频在线播放| 极品少妇xxxx精品少妇偷拍| 亚洲欧美日韩国产成人精品影院| 欧美一区二区视频免费观看| 成人av影院在线| 日本不卡在线视频| 国产精品欧美极品| 91精品国产综合久久久久久久 | 麻豆精品一区二区三区| 国产精品午夜春色av| 欧美日韩国产综合一区二区三区 | 精品国产一区二区亚洲人成毛片 | 欧美精品日韩综合在线| 成人一区二区视频| 美女视频第一区二区三区免费观看网站| 中文字幕免费观看一区| 3d动漫精品啪啪1区2区免费 | 欧美va亚洲va在线观看蝴蝶网| 成人av影院在线| 狠狠网亚洲精品| 亚洲成人av资源| 国产精品短视频| 久久综合九色综合97婷婷| 欧美老年两性高潮| 99精品国产91久久久久久| 激情五月婷婷综合网| 亚洲国产精品一区二区久久 |