close
Warning:
Can't synchronize with repository "(default)" (Unsupported version control system "svn": /usr/lib/python2.7/dist-packages/libsvn/_delta.so: failed to map segment from shared object: Cannot allocate memory). Look in the Trac log for more information.
- Timestamp:
-
Jun 2, 2013, 10:30:42 AM (13 years ago)
- Author:
-
jazz
- Comment:
-
--
Legend:
- Unmodified
- Added
- Removed
- Modified
-
|
v16
|
v17
|
|
| 18 | 18 | == Solr / Lucene in Practice == |
| 19 | 19 | |
| 20 | | * Threat Connect - http://docs.trendmicro.com/all/ent/tc/en-us/tc_olh/abt-tc.html |
| | 20 | * Threat Connect (TC) - http://docs.trendmicro.com/all/ent/tc/en-us/tc_olh/abt-tc.html |
| 21 | 21 | - Sandbox Report - 1.2M reports / 2.4TB / Hadoop |
| 22 | 22 | - PAFI ( virus scan results ) - 50M reports / 514 GB / HBase |
| 23 | | - Census (? 300GB) |
| | 23 | - Census (? Index Size : 300GB) |
| 24 | 24 | - Sandbox VM - Windows (?) - pcap (network packet) / screenshot - 8GB/day, 3000 malware - 存在 HDFS |
| 25 | 25 | - Similarity Search 相似度搜尋 |
| 26 | 26 | - 將 log 透過 MR Job 或 Pig 存成 Lucene Index (?),再匯入 Solr (Index Size: 6GB) |
| 27 | 27 | - 缺點:無法做到遞增索引更新(incremental index update)(也得看是否能區隔遞增的更新資料(incremental data update(?))) |
| 28 | | - |
| | 28 | - Q1: Census 是自建的系統? |
| | 29 | - Q2: Sandbox 是 Windows VM? malware 是否會故意避開 VM? |
| | 30 | - Q3: 蒐集到的 Sandbox 資料是否有遞增的特性? |
| | 31 | * 如何使用 Solr / Lucene 到 Threat Connect (TC) |
| | 32 | - Q: 必須自己寫 Web UI (RESTful API)? |