亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频

? 歡迎來到蟲蟲下載站! | ?? 資源下載 ?? 資源專輯 ?? 關于我們
? 蟲蟲下載站

?? arale.html

?? 用java寫的網絡爬蟲
?? HTML
字號:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">

<html>
<head>
<title>Arale User Manual</title>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>
<style>
body {
background-color : #FFFFFF;
font-family: Verdana, Geneva, Arial, Helvetica, sans-serif;
font-size: x-small;
color: #000000;
}
td, p, li, a {
font-family: Verdana, Geneva, Arial, Helvetica, sans-serif;
font-size: x-small;
}
code, pre {
font-family: monospaced;
font-size: x-small;
}
</style>
</head>

<body>

<p style="font-size:small"><b>Arale User Manual</b>

<p>
author: Flavio Tordini<br>
email: <a href="mailto:flaviotordini@tiscali.it">flaviotordini@tiscali.it</a><br>
web: <a href="http://web.tiscali.it/_flat">http://web.tiscali.it/_flat</a>

<p>
<a href="#intro">Introduction</a><br>
<a href="#get">Getting Arale</a><br>
<a href="#sys">System Requirements</a><br>
<a href="#install">Installing Arale</a><br>
<a href="#run">Running Arale</a><br>
<a href="#settings">Arale settings</a><br>
<a href="#build">Building Arale</a><br>

<p><a name="intro"></a><b>Introduction</b>
<p>
Arale is a java multithreaded web spider. While many bots around are focused on page indexing, Arale is primarly designed for personal use. It fits the needs of advanced web surfers and web developers.
<p>
With Arale you can download entire web sites or specific resources from the web. Some real life cases are:<br>
<li>want to download only images, videos, mp3 or zip files from a site.</li>
<li>manuals, articles, ebooks fragmented in many files to discourage download.</li>
<li>user-unfriendly sites. Popups, banners and tricky javascripts annoying you before you can download a resource.</li>
<p>
<i>Multithreaded</i> means that Arale can download more than one file simultaneously. Arale can easily saturate your bandwith, thus providing the fastest possible download speed for your internet connection.

<p>
If you're developing dynamic sites using technologies such as JSP, PHP, ASP or whatever, you may be interested in rendering dynamic pages to static files.
Arale supports URL renaming: query string is encoded in the static filename and .html extension is appended.
let's make an example:
<p>
original URL: <code>mypage.jsp?myparam=myvalue</code><br>
static filename: <code>mypage.jsp!myparam=myvalue.html</code><br>
<p>
Existing links to renamed URLs are substituted with modified links. This preserves navigation among static files.
Once a dynamic site is trasformed into a set of static files it can be deployed on a server that does not support dynamic pages. For example you may deploy a JSP site in a free web space.

<p>
Currently Arale is a command-line tool. It would be nice to develop a GUI for it. I'd like to have some feedback from users, so if you think it's worth send me an email and tell me what you think. ;)


<p><a name="get"></a><b>Getting Arale</b>
<p>
The latest version of Arale can be downloaded from <a href="http://web.tiscali.it/_flat">http://web.tiscali.it/_flat</a>.
The distribution includes Arale sources along with building scripts (see <a href="#build">Building Arale</a>).

<p><a name="sys"></a><b>System Requirements</b>
<p>
In order to run Arale, you need the Java Development Kit (JDK) or the Java Runtime Environment (JRE) installed on your system.
Arale requires Java 2. The recommended Java version for running Arale is Java 2 version 1.3 or later.
<li><a href="http://java.sun.com/j2se/">Java Development Kit</a></li>
<li><a href="http://java.sun.com/j2se/">Java Runtime Environment</a></li>

<p><a name="install"></a><b>Installing Arale</b>
<p>
Simply extract the Arale distribution archive to a directory. Make sure you have the JAVA_HOME environment variable pointing to Java Development Kit installation directory.
As an option you may set an ARALE_OPTS environment variable. The value of ARALE_OPTS contains command line arguments that should be passed to the Java Virtual Machine when starting Arale. For example, you can define properties or set the maximum Java heap size.

The following sets up the environment on Windows:
<pre>
set JAVA_HOME=c:\jdk1.3.1
set ARALE_HOME=c:\arale
set ARALE_OPTS=-mx32m
</pre>
To complete Arale installation run <code>windows/setup.bat</code> in Arale installation directory. this will create shortcuts to Arale and will integrate Arale with Internet Explorer. Cool!
<p>
on Unix (bash):
<pre>
export JAVA_HOME=/usr/local/jdk-1.3.1
export ARALE_HOME=/usr/local/arale
export ARALE_OPTS=-mx32m
</pre>

<p><a name="run"></a><b>Running Arale</b>
<p>
Running Arale is simple, when you installed it as described in the previous section. Just type <code>arale</code> followed by an URL.
<pre>arale http://web.tiscali.it/_flat</pre>
By default Arale reads its settings from the <code>arale.properties</code> file. You can override this behaviour by typing:
<pre>arale http://web.tiscali.it/_flat -settings mysettings.properties</pre>

Command-line option summary:
<pre>
Usage: arale [&lt;URL&gt;] [&lt;options&gt;]
        -settings &lt;file&gt;: Use specified property file
        -output &lt;dir&gt;: Use specified output directory
        -version: Print Arale version and exit
        -help: Print this message and exit
</pre>


<p><a name="settings"></a><b>Arale settings</b>
<ul>
<li><b>URL</b>: start URL</li>

<li><b>output.directory</b>: this is Arale output directory. It may be a relative or an absolute path. Arale will put all downloaded files in subdirectories by recreating the directory structure found on the remote server.</li>

<li><b>download.tokens</b>: Arale will download URLs that contain these tokens. Tokens are separated by spaces. Just like this: <code>.html .gif .jpg .css</code>. </li> What <i>token</i> means? A token is a series of characters Arale will search for when scanning files. When Arale finds a token specified by this parameter, it then searches for right limit and a left limit of the ipothetic link. Then Arale tries to connect to that URL. If the resource is found then it is immediatly downloaded to disk, otherwise Arale just keeps going.</li>

<li><b>scan.tokens</b>: Arale will scan URLs that contain these tokens. Tokens are separated by spaces. URLs containing these tokens should all have a text/html content type. Resources found with these tokens will be scanned for new links. They will not be downloaded if they are not in the download.tokens list.</li>

<li><b>force.html.scanning</b>: Force scanning of resources having a text/html content type. Even if they're not listed in scan.tokens.</li>

<li><b>ensure.html.scanning</b>: Ensure that only resources having a text/html content type will be scanned. For example a dynamic resource (.jsp, .asp ...) may return any content type, not only text/html.</li>

<li><b>domain.depth</b>: This parameter represents how many domain levels deep should arale follow links. 1 means no domain change. Increasing this value will dramatically increase the number of followed links. For example 2 means Arale will crawl the starting domain plus all domains linked in the starting domain pages.</li>

<li><b>file.minsize</b>: Minimum downloaded file size. All files smaller than this value will be discarded. -1 means Arale will ignore this setting.</li>

<li><b>file.minsize</b>: Maximum downloaded file size. All files bigger than this value will be discarded. -1 means Arale will ignore this setting.</li>

<li><b>file.download.unknown.size</b>: Tells arale wheter to download files whose size cannot be predetermined. Sometimes the web server will not tell the file size, in that Arale will use this setting to decide what to do. this value may be true or false.</li>

<li><b>thread.count</b>: This is the number of threads Arale will allocate. In practice this is the number of simultaneous HTTP connections. Choosing a higher value may increase may increase processing speed, but may also stress your machine and the remote server(s). 1 is the minimum value.</li>

<li><b>pause.milliseconds</b>: The number of milliseconds to pause before starting processing the next URL. Use this setting to make Arale take a breath between URL processing. This may be useful if you're on a LAN and want to avoid creating noticeable bandwidth usage bursts. If you're rendering a dynamic site into static pages, setting this value may increase the process reliability.</li>

<li><b>rename.dynamic.files</b>: Arale will rename dynamic files such as .jsp, .asp, .php to .html. Also link to renamed resources will be substituted. Enable this setting if you're rendering a dynamic site into static pages. This is also great when downloading from dynamic sites.</li>

<li><b>url.leftdelimiters</b>: Characters that delimit a URL on the left side. This is used by Arale parsing methods. You probably will not need to change this setting.</li>

<li><b>url.rightdelimiters</b>: Characters that delimit a URL on the right side. This is used by Arale parsing methods. You probably will not need to change this setting.</li>

</ul>


<p><a name="build"></a><b>Building Arale</b>
<p>
Arale sources are located in the archive named <code>arale-sources.zip</code>.

Arale uses the Jakarta Ant build tool.

Ant can be dowloaded from the Jakarta Project site:
<a href="http://jakarta.apache.org/ant">http://jakarta.apache.org/ant</a>.
Extract Ant archive and set the ANT_HOME environment
variable to the directory you installed Ant.
Refer to Ant documentation for further details.

Once you're done with Ant, simply run the build batch file.
The Ant build script for Arale (<code>build.xml</code>) takes a number a parameters, called tasks. They are:<br>
<code>clean</code> - deletes compiled classes, javadocs, and the arale jar<br>
<code>prepare</code> - creates required directories<br>
<code>compile</code> - compiles source files<br>
<code>dist</code> - creates a jar<br>
<code>javadoc</code> - generates javadoc documentation<br>

<p><b>[end of file]</b>

</body>
</html>

?? 快捷鍵說明

復制代碼 Ctrl + C
搜索代碼 Ctrl + F
全屏模式 F11
切換主題 Ctrl + Shift + D
顯示快捷鍵 ?
增大字號 Ctrl + =
減小字號 Ctrl + -
亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频
精品国产电影一区二区| 久久99热这里只有精品| 老司机午夜精品| 91在线精品秘密一区二区| 日韩三级精品电影久久久| 国产精品久久久久久久久搜平片| 免费视频最近日韩| 色噜噜狠狠色综合中国| 中文字幕巨乱亚洲| 韩国精品在线观看| 欧美一区永久视频免费观看| 一区二区三区.www| 色综合久久综合网97色综合| 国产欧美日韩亚州综合| 精品一区免费av| 在线播放中文字幕一区| 亚洲午夜久久久久久久久久久| 成人黄色综合网站| 久久精品人人做人人综合 | 亚洲免费观看高清完整版在线观看| 久久国产综合精品| 欧美一级片免费看| 秋霞成人午夜伦在线观看| 欧美日韩精品三区| 亚洲成人av福利| 欧美久久久久久久久| 午夜影视日本亚洲欧洲精品| 欧美日韩亚洲综合在线| 午夜免费久久看| 91精品国产综合久久久蜜臀图片 | 欧美系列在线观看| 伊人开心综合网| 91精品办公室少妇高潮对白| 一区二区三区国产| 欧美三区免费完整视频在线观看| 夜夜精品浪潮av一区二区三区| 在线欧美一区二区| 亚洲第一久久影院| 91精品国产麻豆| 韩日精品视频一区| 国产欧美一区二区精品性色超碰| 成人三级伦理片| 欧美高清在线精品一区| 成人黄页在线观看| 亚洲午夜精品在线| 日韩一级完整毛片| 国产成人aaaa| 亚洲欧美欧美一区二区三区| 欧美日韩综合不卡| 激情成人综合网| 国产精品入口麻豆原神| 色综合天天综合给合国产| 亚洲自拍偷拍欧美| 日韩视频一区二区在线观看| 国产一区二区不卡在线| 亚洲品质自拍视频| 欧美一区二区三区影视| 成人一区二区视频| 亚洲妇女屁股眼交7| 久久久欧美精品sm网站| 91在线你懂得| 久久不见久久见免费视频7| 国产精品拍天天在线| 在线视频欧美精品| 国产一本一道久久香蕉| 亚洲精品免费在线| 精品国产一区二区三区av性色| 成人av在线资源| 日本强好片久久久久久aaa| 国产精品污网站| 91麻豆精品国产无毒不卡在线观看| 国产精品系列在线观看| 亚洲1区2区3区4区| 国产日韩精品久久久| 欧美巨大另类极品videosbest| 国产麻豆9l精品三级站| 石原莉奈一区二区三区在线观看 | 国产成人av电影在线| 婷婷综合五月天| 亚洲欧洲精品一区二区三区| 日韩一区二区在线看| 欧洲日韩一区二区三区| 国产盗摄视频一区二区三区| 日本一区中文字幕| 亚洲欧美另类久久久精品 | 国产不卡视频在线播放| 日韩二区三区四区| 亚洲一卡二卡三卡四卡无卡久久| 国产午夜精品一区二区三区嫩草 | 久久99精品国产麻豆婷婷洗澡| 一区二区在线观看视频| 中文字幕中文字幕在线一区 | 国产成人精品免费网站| 免费成人av在线| 五月天激情综合网| 一区二区三区四区不卡在线| 中文字幕一区二区三区在线不卡| 日韩亚洲欧美一区| 欧美久久久久久久久中文字幕| 成人午夜av影视| 韩国成人福利片在线播放| 午夜国产不卡在线观看视频| 一区二区三区四区乱视频| 亚洲免费观看视频| 亚洲黄色av一区| 亚洲欧美激情一区二区| 亚洲色图欧洲色图婷婷| 亚洲天堂中文字幕| 国产精品电影一区二区三区| 中文字幕在线观看一区| 国产精品久久久久久亚洲伦| 国产精品久久久久久久久快鸭| 欧美经典三级视频一区二区三区| 国产色一区二区| 中文字幕av一区二区三区高 | 青青草国产精品亚洲专区无| 日韩精品一卡二卡三卡四卡无卡| 亚洲不卡av一区二区三区| 亚洲一区二区黄色| 日韩高清不卡在线| 精品一区二区三区视频在线观看| 精品系列免费在线观看| 极品少妇一区二区| 国产精品 欧美精品| 99国产精品视频免费观看| 91麻豆swag| 精品视频免费看| 欧美成人综合网站| 久久婷婷久久一区二区三区| 国产日韩欧美精品在线| 亚洲欧美日韩国产综合| 午夜精品久久久久久久久久| 蜜桃精品视频在线| 国产成人精品免费在线| 在线免费亚洲电影| 精品女同一区二区| 国产精品乱人伦中文| 亚洲成人激情综合网| 久久精品国产亚洲5555| 99久久夜色精品国产网站| 欧美性生活大片视频| 欧美tickling网站挠脚心| 国产精品乱子久久久久| 日韩中文欧美在线| 不卡影院免费观看| 91精品国产欧美日韩| 国产精品久久久久久亚洲毛片 | 波多野结衣欧美| 欧美日韩在线播放三区四区| 精品播放一区二区| 亚洲尤物视频在线| 粉嫩高潮美女一区二区三区| 欧美在线啊v一区| 国产亚洲人成网站| 午夜精品一区在线观看| 成人中文字幕电影| 日韩一区二区免费在线电影| 国产精品高潮久久久久无| 美国十次综合导航| 色综合久久综合| 国产亚洲欧美激情| 日韩成人av影视| 色噜噜狠狠色综合欧洲selulu| 久久夜色精品一区| 日韩国产欧美在线播放| 色综合久久中文字幕综合网| 久久久九九九九| 免费成人性网站| 欧美高清你懂得| 亚洲色欲色欲www| 国产露脸91国语对白| 日韩视频在线你懂得| 亚洲午夜一区二区| 在线精品国精品国产尤物884a| 国产精品久久免费看| 国产综合一区二区| 日韩欧美一级二级三级 | 一卡二卡欧美日韩| 99这里都是精品| 久久久久久9999| 久久精品国产亚洲高清剧情介绍| 欧美欧美欧美欧美| 亚洲一区二区视频| 91成人在线观看喷潮| 最新热久久免费视频| 99热精品国产| 亚洲欧洲精品一区二区三区| 成人激情动漫在线观看| 国产日韩视频一区二区三区| 国产一区视频网站| 久久综合国产精品| 国产一区三区三区| 精品久久久久一区| 激情文学综合丁香| 精品久久久久久最新网址| 精品一区二区在线看| 精品国产123| 国产精品综合二区| 亚洲国产成人午夜在线一区| 成人国产亚洲欧美成人综合网|