Changes between Version 1 and Version 2 of waue/2009/0609


Ignore:
Timestamp:
Jun 9, 2009, 1:24:29 PM (15 years ago)
Author:
waue
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • waue/2009/0609

    v1 v2  
    55 - 希望在完整的看完Nutch的官方網頁後,得到更好的靈感與改進方式
    66
     7 == 更多指令 ==
     8
     9 === readdb ===
     10
     11{{{
     12$ nutch readdb /tmp/search/crawldb -stats
     13
     1409/06/09 12:18:13 INFO mapred.MapTask: data buffer = 79691776/99614720
     15
     1609/06/09 12:18:13 INFO mapred.MapTask: record buffer = 262144/327680
     17
     1809/06/09 12:18:14 INFO crawl.CrawlDbReader: TOTAL urls: 1072
     1909/06/09 12:18:14 INFO crawl.CrawlDbReader: status 1 (db_unfetched):    1002
     20
     2109/06/09 12:18:14 INFO crawl.CrawlDbReader: status 2 (db_fetched):      68
     22
     23}}}
     24 === convdb ===
     25
     26 ===  ===
     27
     28 ===  ===
     29
     30 ===  ===
     31
     32 ===  ===
     33
     34 ===  ===
     35
     36 ===  ===
     37
    738 == 筆記 ==
    839
    9  - 非 nutch crawler