Publisher: DBI Technologies Primary Category: Search Product Type: Component / Managed/Unmanaged Code - without COM / DLL
Powerful text summarization engine. Extractor is a software text summarization engine. It consumes documents (text, html, email) and using a patented genetic extraction algorithm (GenEx) analyzes the recurrence of words and phrases, their proximity to one another, and the uniqueness of the words to a particular document. The engine returns a list of key words and phrases found in the document together with their relative ranking (how many times was the word/phrase found in the document) along with contextual links back to the position of the key word/phrase in the document itself.
Add fully controllable search functionality to your ASP.NET hosted Web site. FindinSite-MS is an ASP.NET search engine for a Web site or intranet using a Microsoft server. It integrates fully with your site, it doesn't show ads from your competitors, it highlights words in Web pages and is available as a package for your site, or as a hosted search. Featuring many file types, regular indexing, customisable templates and great international support. Searches HTML, PDF, DOC, PPT, TXT, JPEG and TIFF files.
Add Multi lingual searching to your applications. dtSearch Text Retrieval Engine for Win & .NET and dtSearch Web with Spider are supplied with stemming rules and a noise-word file for English(US). If you are searching documents written in other languages then this could mean that plurals and noise words are missed. dtSearch Language Extension Packs are available for Eastern and Western European languages to improve your non English(US) search results.
Quickly publish an instantly searchable document collection or mirror an existing Web site to CD/DVD. dtSearch Publish includes over a dozen indexed and fielded data search options. dtSearch Publish highlights hits in HTML, XML and PDF, while displaying links and images. dtSearch Publish also converts Office, ZIP and other file formats to HTML with highlighted hits. For end-users, running the CD/DVD requires no installation on the user's hard drive. dtSearch's proprietary indexing and searching algorithms allow for fast indexing and searching performance even over extremely large databases and other diverse collections of documents. dtSearch Publish algorithms are engineered to maintain consistent indexing speeds regardless of the size of the document set. dtSearch supports Microsoft Access, Excel (*.xls, *.xlsb, *.xlsx), Word (*.doc, *.docx, *.rtf), and PowerPoint (*.ppt, *.pptx) files created by Office 2010.