Changes between Version 29 and Version 30 of waue/2009/nutch_install
- Timestamp:
- Apr 28, 2009, 1:17:31 PM (16 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
waue/2009/nutch_install
v29 v30 1 2 1 {{{ 3 2 #!html … … 14 13 * 解決中文亂碼問題 15 14 * 搜尋引擎不只是找網頁內的資料,也能爬到網頁內的檔案(如pdf,msword) 16 * 運行在多台node15 * 也可運行在多台node 17 16 18 17 = 環境 = … … 61 60 <property> 62 61 <name>fs.default.name</name> 63 <value>hdfs:// node01:9000/</value>62 <value>hdfs://localhost:9000/</value> 64 63 <description> </description> 65 64 </property> 66 65 <property> 67 66 <name>mapred.job.tracker</name> 68 <value> node01:9001</value>67 <value>localhost:9001</value> 69 68 <description> </description> 70 69 </property> … … 86 85 87 86 = step 2 nutch下載與安裝 = 88 87 == 2.0 設定環境變數 == 88 {{{ 89 $ sudo su - 90 # echo "export JAVA_HOME=/usr/lib/jvm/java-6-sun" >> /etc/bash.bashrc 91 # exit 92 # exit 93 }}} 89 94 == 2.1 下載 nutch 並解壓縮 == 90 95 * nutch 1.0 (2009/03/28 release ) … … 153 158 <property> 154 159 <name>http.agent.url</name> 155 <value> node01</value>160 <value>localhost</value> 156 161 <description>A URL to advertise in the User-Agent header. </description> 157 162 </property> … … 227 232 }}} 228 233 229 == 3.4 完全複製到node2 == 230 234 == 3.4 環境若要設定成叢集才要做 == 235 * 若是單機版則不用處理此節 236 * 完全複製到node2 231 237 {{{ 232 238 $ ssh node02 "sudo chown hadooper:hadooper /opt"