
Now includes Bulgarian and Slovenian, plus bilingual French/English and German/English stemming.
dtSearch Text Retrieval Engine and dtSearch Web with Spider are supplied with stemming rules and a noise-word file for English(US). Stemming is the only search expansion option which is 'on' by default in the dtSearch end-user products; the reason for this is that stemming is almost always useful when making a search, and adds little to the time required to make a search. Unlike some other search engines, dtSearch applies stemming at search time, there is no need to build indexes specifically to apply stemming and no need to build separate indices for each language in use.
With the stemming option selected dtSearch will find plurals and many other variations; for example a search on print will find printers, printing, printed automatically. However, if you are searching documents written in other languages, the English stemming rules will cause you to miss many word variations which do not occur in English (e.g. verb and noun changes with gender), and you may find that words which are unrelated are found in error. Furthermore, the English noise word list, which is designed to remove unwanted English words from your index to keep the index size small, is not suitable for other languages; your indexes may contain many words which will not be useful in searches and which will add to the size of your indexes.
The solution is to use language specific files in place of the default US English files. These are supplied in the form of Language Extension Packs which contain files for many languages. All files are in Unicode format.
A leading supplier of text retrieval software, dtSearch Corp. develops, manufactures and sells the dtSearch text retrieval product line. dtSearch products have been the smart choice for Text Retrieval since 1991. The dtSearch product line is known for its "industrial-strength" (PC Magazine) ability to instantly search terabytes of text. dtSearch product line includes end-user, enterprise and developer text retrieval products. dtSearch product line also includes publishing capabilities, for publishing large document collections to Web sites or CD/DVD and Spidering capabilities, for remote site and distributed searching access. dtSearch products have received multiple awards and hundreds of excellent press reviews. Fortune 500 companies and others with some of the most demanding document search needs in the world rely on dtSearch. 4 out of 5 of Fortune Magazine’s most profitable companies have dtSearch developer or multi-user licenses. Typical corporate uses of dtSearch products include general information retrieval, Internet/Intranet site searching and access to technical documentation.
Related News
Product: dtSearch Language Extension Packs | dtSearch Text Retrieval Engine for Win and .NET (32-bit / 64-bit). | dtSearch Web with Spider (32-bit / 64-bit)
Publisher: dtSearch Corp.
Category: Search
Architecture: 32 Bit | ActiveX Components | ActiveX DLL | ActiveX .NET Ready | ActiveX OCX | C++ / MFC Class Libraries | Components | DLL | JavaBean | Java Class | Java Components | Dev Tools & IT Utilities | Windows Dev Tools | Windows 2000 | Windows 9X / ME | Windows NT | Windows XP
Platform: Access | C++Builder | Embarcadero / CodeGear | Delphi | FrontPage | Internet Explorer | JBuilder | Microsoft | Office | SQL Server Tools | Visual Basic | Visual Basic .NET | Visual C++ | Visual C++ .NET | Visual C# .NET | Visual FoxPro | Visual Studio | Visual Studio .NET
Type: Feature Releases
Bookmark with
| Delicious | Digg | Stumble Upon |
Published in Development Tool News & Software Component News, October 21, 2009
ComponentSource offers a unique global service, used by over 1,000,000 software developers worldwide.
by Publisher
by Category
by Architecture
by Platform