Changes between Version 14 and Version 15 of crawlzilla-1.0
- Timestamp:
- Mar 8, 2011, 4:16:35 PM (14 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
crawlzilla-1.0
v14 v15 26 26 27 27 || 目錄1 || 目錄2 || 說明 || 28 || ./user/[admin,username]/ || ./IDB/XXX/meta || admin 為必有資料夾,username 為之後新增的使用者,XXX 為新增索引庫, meta 放每個索引庫的相關檔案 || 29 || || ./IDB/XXX/index~segments || index~segments 為 lucene db 的必要五個資料夾|| 30 || || ./tmp || 該使用者正在運算的IndexDB || 31 || ./workspace || || hadoop 的運算資料夾 || 32 || ./slave/ || || 給 slave 安裝需要的檔案 || 28 || ./workspace/ || || hadoop 的運算資料夾 || 33 29 || ./meta/ || || dialog 產生的中間檔 || 34 30 || ./meta/tmp/ || || 暫存檔 || 31 || ./user/ || || 於後說明 || 32 33 * /home/crawler/crawlzilla/user 下的目錄格式說明 34 35 || 目錄1 || 目錄2 || 說明 || 36 || [admin,_username_]/ || || admin 為必有資料夾,_username_ 為之後新增的使用者 || 37 || || ./webs/ || 內放搜尋網頁的資料夾 (註1)|| 38 || || ./webs/_DBName_/ || 名稱為_DBName_的搜尋網頁 || 39 || || ./IDB/ || 內放該使用者已完成的 indexDB 資料夾 || 40 || || ./IDB/_DBName_/ || _DBName_ 為索引庫名稱 || 41 || || ./IDB/_DBName_/meta/ || meta 放每個索引庫的相關檔案 || 42 || || ./IDB/_DBName_/index~segments/ || index~segments 為 lucene db 的必要五個資料夾|| 43 || || ./tmp/ || 內放該使用者未完成的 indexDB 資料夾 || 44 || || ./tmp/_DBName_/ || _DBName_ 為索引庫名稱 || 45 || || ./tmp/_DBName_/meta/ || meta 放每個索引庫的相關檔案 || 46 || || ./meta/ || 該使用者的個人資訊,如pwd,email 等 || 35 47 36 48 * /opt/crawlzilla/ 37 49 38 50 || 目錄1 || 目錄2 || 說明 || 39 || ./tomcat || ./webapps/UUU/XXX || 對應到 UUU 的 XXX 索引庫 || 40 || ./nutch || || nutch 的目錄 || 51 || ./tomcat/ || || tomcat || 52 || ./tomcat/ || ./webapps/_username_/_DBName_ || 對應到 _username_ 的 _DBName_ 索引庫 (註1)|| 53 || ./nutch/ || || nutch 的目錄 || 54 || ./slave/ || || 給 slave 安裝需要的檔案 || 41 55 || ./main/ || || 放 crawlzilla 的執行檔|| 56 57 註: /home/crawler/crawlzilla/user/_username_/webs/_DBName_ ==鍊結到==> /opt/crawlzilla/tomcat/webapps/_username_/_DBName_ 58 59 如:ln -sf /home/crawler/crawlzilla/user/admin/webs/test_3 /opt/crawlzilla/tomcat/webapps/admin/test_3 42 60 43 61 * /var/log/crawlzilla/ 44 62 45 63 || 目錄1 || 目錄2 || 說明 || 46 || ./hadoop-logs || || ||47 || ./hadoop-pids || || ||48 || ./shell-logs || || ||49 || ./tomcat-logs || || ||64 || ./hadoop-logs/ || || || 65 || ./hadoop-pids/ || || || 66 || ./shell-logs/ || || || 67 || ./tomcat-logs/ || || || 50 68 51 69 == 新舊 檔案\目錄 對照 == 52 70 53 71 || 舊 || ==> || 新 || 說明 || 54 || /home/crawler/crawlzilla/logs || ==> || 刪除此鍊結 || || 55 || /home/crawler/crawlzilla/nutch || ==> || 刪除此鍊結 || || 56 || /home/crawler/crawlzilla/source || ==> || /home/crawler/crawlzilla/slave || || 72 || /home/crawler/crawlzilla/logs || ==> || || 刪除此鍊結 || 73 || /home/crawler/crawlzilla/nutch || ==> || || 刪除此鍊結 || 74 || /home/crawler/crawlzilla/tmp || ==> || /home/crawler/crawlzilla/tmp || 不變 || 75 || /home/crawler/crawlzilla/source || ==> || /opt/crawlzilla/slave || || 57 76 || /home/crawler/crawlzilla/archieve/_DBName_ || ==> || /home/crawler/crawlzilla/user/admin/IDB/_DBName_ || || 58 || /home/crawler/crawlzilla/tmp || ==> || /home/crawler/crawlzilla/tmp || || 77 59 78 || /home/crawler/crawlzilla/urls || ==> || /home/crawler/crawlzilla/meta/urls || || 60 || /home/crawler/crawlzilla/.metadata/_DBName_ || ==> || /home/crawler/crawlzilla/user/admin/IDB/_DBName_/meta || ||79 || /home/crawler/crawlzilla/.metadata/_DBName_ || ==> || /home/crawler/crawlzilla/user/admin/IDB/_DBName_/meta (註2) || || 61 80 || /home/crawler/crawlzilla/.menu_tmp || ==> || /home/crawler/crawlzilla/meta/menu_tmp || || 62 81 || /home/crawler/crawlzilla/system/ || ==> || 於下說明 || || 82 83 * 註2: 0.3 版以前,無論完成與否的IDB中間資料都放在 /home/crawler/crawlzilla/.metadata/。但 1.0 版以後,未完成的 /home/crawler/crawlzilla/user/admin/tmp/_DBName_/meta ,完成之後搬移到 /home/crawler/crawlzilla/user/admin/IDB/_DBName_/meta 63 84 64 85 * /home/crawler/crawlzilla/system: