SearchTools Blog (searchtools) wrote,
SearchTools Blog
searchtools

Product Report: Zebra

This is an off-site copy of the corresponding Product report page on the SearchTools.com website, and it is designed to allow you to comment on the product and/or the reporting. For more information about the topic of search and tools visit SearchTools.com where you can browse many articles, in-depth analysis and overviews of external resources.

Zebra

Product Information

Platform: Unix, Windows (Win32)
Price: free, open source, GNU GPL (commercial support available)

Features

  • Indexes both free text and fielded information.
  • Indexes local files, no robot for remote spidering.
  • Input filters for recognizing structured text in SGML, XML, MARC and other ASCII text formats.
  • Free text and Boolean queries, regular expressions and right truncation.
  • Approximate matching (for spelling mistakes).
  • Responds to Z39.50 distributed search clients.

Articles & Reviews

  • Comparing Open Source Indexers Infomotions Musings; May 29, 2001 by Eric Lease Morgan
    Describes the history and features of eight open-source search engines, freeWAIS-sf (aging code and hard to install, but good for searching email and public domain etexts); Harvest (powerful gathering features for frequently-changing data stores, good with structured documents); ht://Dig (tricky to configure, no phrase searching, automatic stemming and match word highlighting); Isearch (weak documentation and support, easy to install, dated interface, Z39.50 support); MPS Information Server (zippy indexing of both text and structured data, Z39.50 support, Perl API, limited documentation); SWISH-E (simple to install engine, CGIs in Perl and PHP still beta, good for HTML pages, recognizes new META tags, sorts results by field; WebGlimpse (easy to install and configure, requires commercial version for customized output); Yaz/Zebra (mainly Z39.50, no Perl API, mainly a toolkit to index and respond to distributed client queries). Article also points out that chaotic information is less than helpful and encourages organization, structure and vocabulary control.
Subscribe

  • Post a new comment

    Error

    default userpic

    Your reply will be screened

    Your IP address will be recorded 

    When you submit the form an invisible reCAPTCHA check will be performed.
    You must follow the Privacy Policy and Google Terms of use.
  • 0 comments