Screenshot Preview

dtSearch Language Extension Packs

經 dtSearch Corp. - 產品類型: 構件 / 應用程式 / ActiveX OCX / ActiveX DLL / DLL / VC++ 類庫 / JavaBean / Java類

‏請注意: 除非另有說明,此為英語產品

*

Add Multi lingual searching to your applications. dtSearch Text Retrieval Engine for Win & .NET and dtSearch Web with Spider are supplied with stemming rules and a noise-word file for English(US). If you are searching documents written in other languages then this could mean that plurals and noise words are missed. dtSearch Language Extension Packs are available for Eastern and Western European languages to improve your non English(US) search results.

 *

Language Extension Pack series 400 for dtSearch Text Retrieval Engine for Win & .NET and dtSearch Web with Spider V6.5 or later.

dtSearch Text Retrieval Engine for Win & .NET and dtSearch Web are supplied with stemming rules and a noise-word file for English(US). Stemming is the only search expansion option which is 'on' by default in the dtSearch end-user products; the reason for this is that stemming is almost always useful when making a search, and adds little to the time required to make a search. Unlike some other search engines, dtSearch applies stemming at search time, there is no need to build indexes specifically to apply stemming and no need to build separate indices for each language in use.

With the stemming option selected dtSearch will find plurals and many other variations; for example a search on print will find printers, printing, printed automatically. However, if you are searching documents written in other languages, the English stemming rules will cause you to miss many word variations which do not occur in English (e.g. verb and noun changes with gender), and you may find that words which are unrelated are found in error.

Furthermore, the English noise word list, which is designed to remove unwanted English words from your index to keep the index size small, is not suitable for other languages; your indexes may contain many words which will not be useful in searches and which will add to the size of your indexes.

The solution is to use language specific files in place of the default US English files. These are supplied in the form of dtSearch Language Extension Packs which contain files for many languages, see list below. All files are in Unicode format.

Language Extension Packs 400 Series

Western European Group (Lep402)

  • Danish
  • Dutch
  • English
  • Finnish
  • French*
  • German*
  • Italian
  • Norwegian
  • Portuguese
  • Spanish
  • Swedish
  • * LEP400 and LEP402 also include unique bi-lingual French/English and German/English stemming and noise word files which enables search expansion on indexes and documents containing a mix of French/German and English text.

Eastern European Group (Lep403)

  • Belurusian
  • New Bulgarian
  • Czech
  • Estontian
  • Greek
  • Hungarian
  • Latvian
  • Lithuanian
  • Polish
  • Russian
  • Slovak
  • New Slovenian
  • Turkish
  • Ukrainian

Language Packs include:
Stemming rule files and noise word files for each supported language
Test files to check the operation of stemming in all the supplied languages.
Stemming Language Selector application, changes stemming rules from the Windows Start menu.
One year of on-line technical support and updates.


注意

還未完成翻譯的產品說明, 我們提供了Google 翻譯的連結方便您使用. 但請注意, 自動翻譯有時候可能會有翻譯錯誤.
由「」技術提供
在Firefox 和Opera 需要Flash.

產品搜索

輸入搜索詞:

為什麼從ComponentSource購買?

ComponentSource 提供獨特的全球國際服務, 在世界各地共有超過1,000,000 開發者客戶.

更多訊息 | 關於我們