Uncategorized
MySQL Workbench 5.2.16 Beta 6 has been released
by specialj on Feb.20, 2010, under Uncategorized
2 Comments more...Open Source Search Engines with Web Crawling and HTML Indexing
by specialj on Jan.19, 2010, under Uncategorized
The term search engine is particularly vague. It can mean an online search engine, or it can mean software that provides something similar to the online search engines, or it can mean software that does fulltext indexing and querying. I specifically wanted to find a tool that could be used to create a niche search engine without having to write the whole thing myself as a ruby on rails app. Here’s what I found:
- mnoGoSearch – seems very actively maintained, packages present in Debian and Ubuntu, written in c
- DataparkSearch – seems fairly actively maintained, apparently a branch of mnoGoSearch, written in c
- Nutch – seems fairly actively maintained,written in java, built on top of Lucene
And that’s really it. There is a product called ASPseek but it has long since been abandoned. There is also ht://Dig but it has not been updated in years and is not really in the same league as the above. So, at least for anyone trying to research this area there are not a whole lot options to try.
Another year, another blog
by specialj on Jan.19, 2010, under Uncategorized
I believe this is my 6th blog. So far I have never taken the time to migrate articles from one to another and I am continuing that tradition. Not so good for the internet perhaps but better for me to not have to take the time to get a migration right. I may go back and report some of my older content as appropriate though. I decided to use wordpress this time because I’m supporting some other wordpress users. I haven’t been the biggest wordpress fan in the past but I’m willing to give it another try.