Publisher: DBI Technologies Primary Category: Search Product Type: Component / Managed/Unmanaged Code - without COM / DLL
Powerful text summarization engine. Extractor is a software text summarization engine. It consumes documents (text, html, email) and using a patented genetic extraction algorithm (GenEx) analyzes the recurrence of words and phrases, their proximity to one another, and the uniqueness of the words to a particular document. The engine returns a list of key words and phrases found in the document together with their relative ranking (how many times was the word/phrase found in the document) along with contextual links back to the position of the key word/phrase in the document itself.
Search web pages, text files, PDF and MS-Office documents on your Web site. FindinSite-JS is a search engine for web sites or intranets, featuring regular indexing of HTML, PDF, DOC, DOCX, XLS, XLSX, PPT, PPTX, TXT, JPEG and TIFF files, great international support and word highlighting for hits in Web pages. FindinSite-JS is a Java servlet that runs in a Java servlet engine or application server. Administrators use the online configuration screen to set up indexing runs and configure all output.
Fast, Flexible, Enterprise Search Solution. SearchBlox is an out-of-the-box Enterprise Search Solution built on top of Apache Lucene. It is fast to deploy, easy to manage and available for both on-premise and cloud deployment. SearchBlox provides developers with a powerful search platform for developing and deploying search applications, without having in-depth knowledge of Apache Lucene. As SearchBlox is a full product with integrated crawlers and web-based index/collection management, developers will save considerable time and effort by using SearchBlox compared to building a search application from scratch using Apache Lucene.