December 11th, 2002

searchtools.com

Product Report: dtSearch Web

This is an off-site copy of the corresponding Product report page on the SearchTools.com website, and it is designed to allow you to comment on the product and/or the reporting. For more information about the topic of search and tools visit SearchTools.com where you can browse many articles, in-depth analysis and overviews of external resources.

dtSearch Web

Product Information

Platforms: Windows NT 4.0, Windows 2000, Windows XP, Linux in beta test
Price: $999 for unlimited concurrent use on a single server, $2,500 for 3-server package

Also available as a code library and CD-ROM indexer

Features

  • Windows GUI administration interface.
  • Uses the Windows Task Scheduler.
  • Simple configuration to connect to IIS web server via ASP and ISAPI
  • Spider for web site robot indexing comes with all versions of the software
  • Also indexes local file systems and mounted servers.
  • Can include based on URL path and exclude based on file type extension
  • Can exclude text between <!--BeginNoIndex--> and <!--EndNoIndex--> tags.
  • Incremental index updates.
  • Supports French, German, Italian, Spanish, Dutch, Swedish, Danish, Portuguese (Brazilian/European), Finnish and Norwegian
  • Unicode support, including Arabic
  • Indexes and searches nested XML fields
  • ODBC interface to databases
  • Indexes HTML, Outlook email directories, PDF, Microsoft Word, WordPerfect, Microsoft Access, PowerPoint, RTF, ZIP archives and XML.
  • Supports multiple indexes, each containing 4 to 8 gigabytes of text
  • Search type option for forms, selecting among"All words", "Any words", "Exact phrase", and "Boolean" search types
  • Searching using phrases, Boolean operators, Natural-Language queries, fuzzy logic, stemming and phonetic matches.
  • Can search specified fields and meta tags.
  • Advanced search interface lists search zones, query options, fuzziness, number of results and sorting options.
  • Default search result page show results in side frame.
  • Match word highlighting in search results
  • Can display web pages and office documents in browser with match words marked.
  • Extensive customization, including ASP integration
  • Customizable logging of search requests.
  • Distributed search version, FindPlus, available
  • API supports Java JNI, C++, Visual C++, Visual Basic, .NET, and Delphi
  • Used by PDF WebSearch
  • Demo download available
  • Company has been in business since 1991.

Articles & Reviews

  • Review; Search Technologies: dtSearch CRN (Computer Reseller News) Test Center, May 7 2002 by Mario Morejon
    Describes the whole line of dtSearch products. The dtSearch Index manager can recognize many current and legacy file formats. The search engine itself offers Boolean, natural-language, stemming, synonyms and fuzzy matching, and a thesaurus. APIs are available for Visual Basic, ActiveX, C, C++, and Java. Praises the documentation for detailed explanations of parameters and examples. The spider has limited controls, mainly for depth of search. The Web Search engine uses the same internal code, and displays results quickly. Converting to HTML for Web viewing is slow. See also the overview In Search Of The Enterprise which has an analysis and reseller information.

  • Emedicine.com Selects dtSearch to Help Power Its World Medical Library press release, March 14, 2001

  • Web Digital Publishing Goes dtSearch with PDF WebSearch dtSearch Case Study

Examples

searchtools.com

Product Report: Engenium Semetric

This is an off-site copy of the corresponding Product report page on the SearchTools.com website, and it is designed to allow you to comment on the product and/or the reporting. For more information about the topic of search and tools visit SearchTools.com where you can browse many articles, in-depth analysis and overviews of external resources.

Engenium Semetric

Product Information

Platform: Windows NT, 2000
Price: contact company

Features

  • Identifies concepts based on words in context, recognizing associations.
  • Uses proprietary mathematical algorithms incorporating machine learning and vector space modeling.
  • Dynamically maps relationships among words in large sets of documents.
  • Can find relevant documents even when the search term does not appear.
  • High-level API in Java and COM for integration with portals and systems.
  • Has been deployed for resume searching, patents, portals, and government knowledge management systems.
searchtools.com

Product Report: Universal Knowledge Processor

This is an off-site copy of the corresponding Product report page on the SearchTools.com website, and it is designed to allow you to comment on the product and/or the reporting. For more information about the topic of search and tools visit SearchTools.com where you can browse many articles, in-depth analysis and overviews of external resources.

Universal Knowledge Processor

Product Information

Platform: Microsoft Windows
Price: contact company

Features

  • Faceted metadata browse and search engine
  • Multithreaded COM component written in ATL.
  • Uses ASP for integration
  • Includes a rules-based classification engine
  • Optional HyperMedia search engine
  • Scales up to at least 114,000 documents
</searchtool>

Articles

Dynamic Taxonomies: A model for Large Information Bases IEEE Transactions on Knowledge and Data Engineering, May/June 2000 by Giovanni M. Sacco
Academic paper starts with the problems of searching and browsing huge data sets, and of expressing multiple taxonomies. Describes dynamic taxonomies derived from documents by analyzing their concepts, and a visual framework to browse the content. This allows users to find appropriate documents in a few clicks, even in very large data sets. Because the system only displays categories containing documents that fit the criteria, the users will never be in a situation where there are no documents.