Skip to content
aaronbinns edited this page Sep 14, 2010 · 2 revisions

Welcome to my github wiki.

Here I am hosting a few experimental projects related to full-text search of web archives.

TNH, a.k.a. The New Hotness, started as an experiment to write a Lucene document collector which enforces the ‘hitsPerSite=1’ logic during document collection. Performing this during document collection is a big performance improvement. TNH then took on a life of its own and I started building a complete search web service incorporating the specialized document collector as well as other performance optimizations.

Clone this wiki locally