Screenshot Preview

dtSearch updates Language Packs

Now includes Bulgarian and Slovenian, plus bilingual French/English and German/English stemming.

Highlighting search results in dtSearch.

Highlighting search results in dtSearch.

dtSearch Text Retrieval Engine and dtSearch Web with Spider are supplied with stemming rules and a noise-word file for English(US). Stemming is the only search expansion option which is 'on' by default in the dtSearch end-user products; the reason for this is that stemming is almost always useful when making a search, and adds little to the time required to make a search. Unlike some other search engines, dtSearch applies stemming at search time, there is no need to build indexes specifically to apply stemming and no need to build separate indices for each language in use.

With the stemming option selected dtSearch will find plurals and many other variations; for example a search on print will find printers, printing, printed automatically. However, if you are searching documents written in other languages, the English stemming rules will cause you to miss many word variations which do not occur in English (e.g. verb and noun changes with gender), and you may find that words which are unrelated are found in error. Furthermore, the English noise word list, which is designed to remove unwanted English words from your index to keep the index size small, is not suitable for other languages; your indexes may contain many words which will not be useful in searches and which will add to the size of your indexes.

The solution is to use language specific files in place of the default US English files. These are supplied in the form of Language Extension Packs which contain files for many languages. All files are in Unicode format.

Updates

  • Bulgarian language available in Eastern European Group extension pack
  • Slovenian language available in Eastern European Group extension pack
  • French/English stemming available in Western European Group extension pack
  • German/English stemming available in Western European Group extension pack

About dtSearch Corp.

A leading supplier of text retrieval software, dtSearch Corp. develops, manufactures and sells the dtSearch text retrieval product line. dtSearch products have been the smart choice for Text Retrieval since 1991. The dtSearch product line is known for its "industrial-strength" (PC Magazine) ability to instantly search terabytes of text. dtSearch product line includes end-user, enterprise and developer text retrieval products. dtSearch product line also includes publishing capabilities, for publishing large document collections to Web sites or CD/DVD and Spidering capabilities, for remote site and distributed searching access. dtSearch products have received multiple awards and hundreds of excellent press reviews. Fortune 500 companies and others with some of the most demanding document search needs in the world rely on dtSearch. 4 out of 5 of Fortune Magazine’s most profitable companies have dtSearch developer or multi-user licenses. Typical corporate uses of dtSearch products include general information retrieval, Internet/Intranet site searching and access to technical documentation.

Related News

Product: dtSearch Language Extension Packs | dtSearch Text Retrieval Engine for Win and .NET (32-bit / 64-bit). | dtSearch Web with Spider (32-bit / 64-bit)

Publisher: dtSearch Corp.

Category: Search

Architecture: 32 Bit | ActiveX Components | ActiveX DLL | ActiveX .NET Ready | ActiveX OCX | C++ / MFC Class Libraries | Components | DLL | JavaBean | Java Class | Java Components | Dev Tools & IT Utilities | Windows Dev Tools | Windows 2000 | Windows 9X / ME | Windows NT | Windows XP

Platform: Access | C++Builder | Embarcadero / CodeGear | Delphi | FrontPage | Internet Explorer | JBuilder | Microsoft | Office | SQL Server Tools | Visual Basic | Visual Basic .NET | Visual C++ | Visual C++ .NET | Visual C# .NET | Visual FoxPro | Visual Studio | Visual Studio .NET

Type: Feature Releases

Bookmark with

Delicious  Digg  Facebook  Reddit  Stumble Upon  Twitter