Changes between Version 20 and Version 21 of jazz/13-06-02


Ignore:
Timestamp:
Jun 2, 2013, 10:45:54 AM (11 years ago)
Author:
jazz
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • jazz/13-06-02

    v20 v21  
    2323   - Census (? Index Size : 300GB)
    2424   - Sandbox VM - Windows (?) - pcap (network packet) / screenshot - 8GB/day, 3000 malware - 存在 HDFS
    25    - Similarity Search 相似度搜尋
     25   - 目標:'''Similarity Search 相似度搜尋'''
    2626   - 將 log 透過 MR Job 或 Pig 存成 Lucene Index (?),再匯入 Solr (Index Size: 6GB)
    2727   - 缺點:無法做到遞增索引更新(incremental index update)(也得看是否能區隔遞增的更新資料(incremental data update(?)))