◢ <[wiki:Hinet120702/Lab11 實作十二]> | <[wiki:Hinet120702 回課程大綱]> ▲ | ◣
= 實作十三 Lab 13 =
[[PageOutline]]
{{{
#!html
抓抓龍安裝步驟示範
Crawlzilla installation in Practice
}}}
== 下載抓抓龍 ==
* 抓抓龍官方網站: http://crawlzilla.info
* 下載 [http://sourceforge.net/projects/crawlzilla/files/stable/Crawlzilla-1.0/Crawlzilla-1.1.1.tar.gz/download 最新穩定版 Crawlzilla 1.1.1 版本]
== 安裝抓抓龍 ==
{{{
hadoop@debian:~$ lsb_release -a
No LSB modules are available.
Distributor ID: Debian
Description: Debian GNU/Linux 6.0.3 (squeeze)
Release: 6.0.3
Codename: squeeze
hadoop@debian:~$ wget -nv http://nchc.dl.sourceforge.net/project/crawlzilla/stable/Crawlzilla-1.0/Crawlzilla-1.1.1.tar.gz
hadoop@debian:~$ tar zxvf Crawlzilla-1.1.1.tar.gz
hadoop@debian:~$ cd Crawlzilla_Install/
hadoop@debian:~/Crawlzilla_Install$ ./install
System does not has Crawlzilla.
Identify is root
Your system information is:
Debian , 6.0.3
Try to automatically install: sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect
debian will install some packages sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect
System has Sun Java 1.6 above version.
System has ssh.
System has ssh Server (sshd).
System has dialog.
Welcome to use Crawlzilla, this install program will create a new accunt and to assist you to setup the password of crawler.
Set password for crawler:
password:
keyin the password again:
password:
Master IP address is: 192.168.125.1
Master MAC address is: 08:00:27:04:6b:db
Please confirm the install infomation of above :1.Yes 2.No
1
Create crawler and change password.
unpack success!
Check and Set /etc/hosts finished.
Add Crawlzilla service to /etc/init.d
Crawlzilla will startup when booting
update-rc.d: using dependency based boot sequencing
Make the slave installation package
Formatting HDFS...
start up name node [Namenode] ...
starting namenode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-namenode-debian.out
start up job node [JobTracker] ...
starting jobtracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-jobtracker-debian.out
starting datanode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-datanode-debian.out
starting tasktracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-tasktracker-debian.out
Start up tomcat...
.....
Tomcat may not start, please use " crawlzilla " to start
Installed successfully!
You can visit the manage website :http://192.168.125.1:8080
Finish!!!
}}}