wiki:shunfa/2010/0524_NutchEZ_InstallTest

Version 9 (modified by shunfa, 14 years ago) (diff)

--

NutchEZ Install測試

步驟

  • 將安裝shell檔及*.tar.gz放置同一目錄下
  • 執行install.sh

安裝之後檢查項目

路徑檢查項目
/home/nutchuser/nutchez/sourceclient安裝檔(檢查ip,hostname), client壓縮檔
/etc/hosts相同的hostsname需註解掉

測試

Ubuntu10.04

  • Java 檢查部份可加入以下訊息提醒user除錯步驟
    add-apt-repository "deb http://archive.canonical.com/ lucid partner"
    apt-get update
    apt-get install sun-java6-jdk sun-java6-plugin
    update-java-alternatives -s java-6-sun
    

Ubuntu9.10

  • Java 檢查部份可加入以下訊息提醒user除錯步驟
    apt-get install sun-java6-jdk sun-java6-plugin
    

執行

2010/06/10

10/06/10 16:58:42 INFO mapred.JobClient: Task Id : attempt_201006091555_0003_r_000000_0, Status : FAILED
Shuffle Error: Exceeded MAX_FAILED_UNIQUE_FETCHES; bailing-out.
10/06/10 16:58:53 INFO mapred.JobClient: Task Id : attempt_201006091555_0003_r_000000_1, Status : FAILED
Shuffle Error: Exceeded MAX_FAILED_UNIQUE_FETCHES; bailing-out.
10/06/10 16:59:05 INFO mapred.JobClient: Task Id : attempt_201006091555_0003_r_000000_2, Status : FAILED
Shuffle Error: Exceeded MAX_FAILED_UNIQUE_FETCHES; bailing-out.
Exception in thread "main" java.io.IOException: Job failed!
	at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1232)
	at org.apache.nutch.crawl.Generator.generate(Generator.java:472)
	at org.apache.nutch.crawl.Generator.generate(Generator.java:409)
	at org.apache.nutch.crawl.Crawl.main(Crawl.java:116)
nutch crawl is error