XML, Indexing and Query

This is hardly my area, but came across two projects that seem to overlap. One is the Zebrahigh-performance, general-purpose structured text indexing and retrieval engine from Index Data. This is a GPL solution that provides out-of-the-box support for z39.50 and SRU/W. While I’ve not figured out how to configure it for MODS records, the thing is quite fast on the example documents included in the distribution.

The other example is XmlIndexer, a Mono-based project from Edd Dumbill that makes use of dotLucene to provide similar sort of functionality. As with Zebra, you specify an index, and then run the tools against it.

I can’t help but wonder why nobody outside of the library world seems to have heard about SRU/W and CQL, though. It would be nice to see these worlds converge a bit more.

One Comment

  1. Also Cheshire3 fills the same sort of role with the same functionality. Zebra: C Cheshire3: Python XMLIndexer: C#

    Take your pick of language :)

    – Rob Sanderson


Creative Commons License Creative Commons License