================= Specification ================= Web content on a CDROM Web content searchable Cross platform Minimal installation ================= CHANGELOG ================= ================= 0.2.0 2007 Jul ?? ================= release candidate CHG update Tomcat 5.5.23 CHG update JDIC 0.9.3 CHG update BrowserLauncher2 1.2 CHG based on bootstrap not embed CHG rearrange site/docs CHG search docs CHG separate content module CHG remove accesstoinsight references CHG new icons, splash, logo FIX issues with config.properties KNOWN ISSUES ================= BUG Aborting from CLI kills mozilla browsers ungracefully BUG Tray is messed up (expect new features) BUG Neither Unicode nor 8-bit chars work BUG honor comment robots content="none" /robots BUG ERROR on Restart BUG favicon.ico is invalid TESTED PLATFORMS ================= CD and HD with Windows XP SP2 CD and HD with Ubuntu 5.10 Linux 2.6 x86 CD with Mac OS X 10.4.6 PENDING TODO ================= CHG update Nutch 0.9 TEST 0.2.0 Windows 2000, Windows XP SP1 TEST 0.2.0 Linux KDE, BSD, Solaris TODO 0.2.0 svn externals TODO 0.2.0 cleanup vicaya/trunk TODO 0.2.0 Automate build, user, ati, dev TODO 0.2.0 LICENSE file (BSD poem, update sf.net) TODO 0.2.0 IZPACK installer TODO 0.4.0 content updates TODO 0.6.0 application updates ================= 0.1.6 2006 May 19 ================= release candidate CHG Launcher.jar at root CHG segments in webapp/WEB-INF/segments CHG one step build, launch, crawl script CHG CHANGELOG to dev base CHG remove search hit UNKNOWN_SIZE - UNKNOWN_DATE FIX search result "surprise me" rather than query FIX search result documents 0- 0 of 108 FIX search hit navigation KNOWN ISSUES ================= BUG honor comment robots content="none" /robots BUG "? On-line Search" vs "Search" BUG ERROR on Restart TESTED PLATFORMS ================= CD and HD with Windows XP SP2 CD and HD with Ubuntu 5.10 Linux 2.6 x86 CD with Mac OS X 10.4.6 PENDING TODO ================= TEST 0.2.0 Windows 2000, Windows XP SP1 TEST 0.2.0 Linux KDE, BSD, Solaris TODO 0.2.0 separate content, search doc TODO 0.2.0 LICENSE file (BSD poem, update sf.net) TODO 0.2.0 IZPACK installer TODO 0.4.0 content updates TODO 0.6.0 application updates ================= 0.1.5 2006 May 14 ================= simplified CHG rearranged file structure CHG config, context properties default and rc CHG configurable content context path CHG separate launcher, search, vendor, vicaya CHG simplified startup scripts CHG removed JRE config CHG removed README.html CHG include doc CHG two step crawl FIX ati-seeds.html not valid html FIX winx86 shutdown port 8005 not released FIX docs include download info KNOWN ISSUES ================= BUG search result hit/doc/page numbers hard-coded BUG winx86 Windows security alert WORK-AROUND do not block ports firewall TESTED PLATFORMS ================= CD and HD with Windows XP SP2 CD and HD with Ubuntu 5.10 Linux 2.6 x86 PENDING TODO ================= TEST 0.1.6 Windows 2000, Windows XP SP1 TEST 0.1.6 Linux PowerPC, Linux KDE TEST 0.1.6 Mac OS X, BSD, Solaris TODO 0.1.6 separate content, search doc TODO 0.1.6 segments relative to webapp TODO 0.2.0 IZPACK installer TODO 0.4.0 content updates TODO 0.6.0 application updates ================= 0.1.4 2006 Apr 17 ================= browser launcher CHG prune unused nutch webapp plugins CHG VICAYA_ENDPORT env var (default=8005) CHG dev move 3rdParty to user.home CHG crawl nutch moved to content/conf CHG crawl cleanup 50% CHG write base/conf/server.xml every run CHG tokenized version number CHG release docs automated CHG cleanup README.txt CHG separate Launcher project CHG start_crawl.sh moved to distribution CHG open browser window on start CHG display splash window on start CHG start tray icon on start FIX all localhost:8108/ should default to index.html FIX all crawl.sh 99% success rate FIX winx86 env vars reset on each start.bat FIX linx86 can not write conf/server.xml from CDROM FIX linx86 incorrect port 8080 and no content from CDROM KNOWN ISSUES ================= BUG all search result hit/doc/page numbers are hard-coded BUG ati-seeds.html not valid html BUG winx86 shutdown port 8005 not always released WORK-AROUND set VICAYA_ENDPORT=8006 | unreserved port BUG winx86 Windows security alert WORK-AROUND do not block ports BUG linx86 ibm/jre does not run from CDROM WORK-AROUND (1) export VICAYA_JAVA=/usr/lib/whatever-j2re-sun-142 WORK-AROUND (2) copy to HD and run locally (set u+w) BUG linx86 ibm/jre searchpage.xml result "IOException null" WORK-AROUND export VICAYA_JAVA=$SUN_JRE ================= 0.1.3 2006 Apr 12 ================= library update and config simplification CHG Nutch 0.7.2 from 0.7.1 CHG simplify env vars (JAVA, CONTENT, SERVER, PROFILE) TEST tested on CD and HD with Windows XP SP2 TEST tested on CD and HD with Ubuntu 5.10 Linux 2.6 x86 FIX automate web crawl from content seeds KNOWN ISSUES ================= BUG all search result hit/doc/page numbers are hard-coded BUG winx86 shutdown port 8005 not always released WORK-AROUND modify .vicaya/base/conf/server.xml change first line BUG winx86 Windows security alert WORK-AROUND do not block ports BUG winx86 env vars reset on each start.bat WORK-AROUND modify start script BUG linx86 can not write conf/server.xml from CDROM BUG linx86 incorrect port 8080 and no content from CDROM WORK-AROUND (1) stop server, manually edit conf/server.xml, restart WORK-AROUND (2) run from HD BUG linx86 ibm/jre does not run from CDROM WORK-AROUND (1) export VICAYA_JAVA=/usr/lib/whatever-j2re-sun-142 WORK-AROUND (2) copy to HD and run locally (set u+w) BUG linx86 ibm/jre search result "IOEception null" no 'x', works with sun jdk WORK-AROUND export VICAYA_JAVA=$SUN_JRE ================= 0.1.2 2006 Mar ================= FIX import development to SVN FIX remove extraneous libraries and temp files FIX automatically start CDROM on windows ================= 0.1.1 2006 Mar ================= bug fix internal release ================= 0.1.0 2006 Mar ================= unstable development release FIX Vicaya web server starts automatically from CD on Windows. FIX Vicaya can be launched manually from CD on Linux FIX Vicaya runs from HD FIX TEMP, JAVA, and server instance (HOME) can be configured TODO a script to build indices added TODO Vicaya has not been tested on Mac OS X BUG startup and index locations (ROOT) can be configured (untested) BUG Vicaya runs slowly from CDROM (startup and searching) ================= 0.0.9 2006 Feb ================= proof-of-concept completed ================= 0.0.8 2006 Feb ================= alpha development begins ================= 0.0.7 2006 Jan ================= Nutch 0.7.1 on Tomcat 5.0 ================= 0.0.6 2005 Aug ================= Nutch on Jetty web server ================= 0.0.5 2005 Jan ================= Javascript only ================= 0.0.4 2004 Nov ================= Javascript/Applet communication ================= 0.0.1 2004 Jun ================= first release Lucene based Java applet