close
Warning:
Can't synchronize with repository "(default)" (Unsupported version control system "svn": /usr/lib/python2.7/dist-packages/libsvn/_fs.so: failed to map segment from shared object: Cannot allocate memory). Look in the Trac log for more information.
- Timestamp:
-
Sep 13, 2009, 2:56:55 PM (17 years ago)
- Author:
-
waue
- Comment:
-
--
Legend:
- Unmodified
- Added
- Removed
- Modified
-
|
v3
|
v4
|
|
| 6 | 6 | [[PageOutline]] |
| 7 | 7 | |
| 8 | | == 1 Hadoop運算命令 grep == |
| | 8 | = 1 Hadoop運算命令 grep = |
| 9 | 9 | |
| 10 | 10 | * grep 這個命令是擷取文件裡面特定的字元,在Hadoop example中此指令可以擷取文件中有此指定文字的字串,並作計數統計 |
| … |
… |
|
| 55 | 55 | |
| 56 | 56 | {{{ |
| 57 | | /opt/hadoop$ bin/hadoop fs -ls lab3_out1 |
| 58 | | /opt/hadoop$ bin/hadoop fs -cat lab3_out1/part-00000 |
| | 57 | $ bin/hadoop fs -ls lab3_out1 |
| | 58 | $ bin/hadoop fs -cat lab3_out1/part-00000 |
| 59 | 59 | }}} |
| 60 | 60 | |
| … |
… |
|
| 114 | 114 | }}} |
| 115 | 115 | |
| 116 | | == 2 Hadoop運算命令 WordCount == |
| | 116 | = 2 Hadoop運算命令 WordCount = |
| 117 | 117 | |
| 118 | 118 | * 如名稱,WordCount會對所有的字作字數統計,並且從a-z作排列 |
| … |
… |
|
| 122 | 122 | }}} |
| 123 | 123 | |
| 124 | | 檢查輸出結果的方法同2.1的方法 |
| | 124 | 檢查輸出結果的方法同之前方法 |
| | 125 | |
| 125 | 126 | {{{ |
| 126 | | /opt/hadoop$ bin/hadoop fs -ls lab3_out2 |
| 127 | | /opt/hadoop$ bin/hadoop fs -cat lab3_out2/part-00000 |
| | 127 | $ bin/hadoop fs -ls lab3_out2 |
| | 128 | $ bin/hadoop fs -cat lab3_out2/part-00000 |
| 128 | 129 | }}} |
| 129 | 130 | |
| 130 | | === 2.1 更多運算命令 === |
| | 131 | = 3. 使用網頁Gui瀏覽資訊 = |
| | 132 | |
| | 133 | * [http://localhost:50030 透過 Map/Reduce Admin 來察看程序運作狀態] |
| | 134 | |
| | 135 | * [http://localhost:50070 透過 NameNode 察看運算結果] |
| | 136 | |
| | 137 | = 4. 更多運算命令 = |
| 131 | 138 | |
| 132 | 139 | 可執行的指令一覽表: |
| … |
… |
|
| 147 | 154 | |
| 148 | 155 | 請參考 [http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/examples/package-summary.html org.apache.hadoop.examples] |
| 149 | | |
| 150 | | |
| 151 | | == 3. 使用網頁Gui瀏覽資訊 == |
| 152 | | |
| 153 | | * [http://localhost:50030 Map/Reduce Administration] |
| 154 | | * [http://localhost:50070 NameNode ] |