Hi guys,
For the next release of the Documents and Text Collection (which is the combination of the former Text Analytics and ChemMining Collections) we are considering integration with SOLR (http://en.wikipedia.org/wiki/Solr), an open source enterprise search platform. Those of you who use the D&T Collection will probably already know that we use Apache Lucene for our "Local Text Databases". SOLR is essentially an extension to Lucene, that adds features such as enterprise search capabilities, and faceted search results.
To determine the priority of this work, I'd be interested to hear:
- Does anyone use SOLR in their organization (or has at least investigated it)? What were your experiences, do you still use it, and would you consider integration into D&T, and therefore Pipeline Pilot, as being valuable?
- Even if you are unfamiliar with SOLR, are you interested in having us add enterprise document search capabilities into the D&T Collection? Note, we already have components to search SharePoint, so SOLR integration would complement that with an open source version
- Any other general comments about the Documents and Text Collection -- features you'd like us to add, etc.
Thanks for your feedback
Andrew
Andrew LeBeau
Product Manager, Documents and Text Collection