Two new open source bibliographic projects of note. The first is called TEI XML Bibliography Project, and comes from Paul Tremblay. Paul was originally aiming for something more ambitious (e.g. a new bibliographic schema), but I think our conversations convinced him it’d be better to focus on something less grand.
I’ve tried to convince him of the importance of BiblioX to bibliographic formatting in XML, but obviously failed. There are thousands of bibliographic styles in circulation, and it seems hopelessly unworkable to write individual XSLT stylesheets for each and every one of them, for each and every output format. While BiblioX still needs a lot of work, the basic principle of a document-and bibliographic-data-agnostic XSLT-based formatting system is sound.
I also stumbled on yet another new project called Bibliophile. This one is a little different in that it is not a standalone software project, but rather an effort at standardization.
Bibliophile is an initiative to align the development of bibliographic databases for the web. It aims to promote standards, discussion among users on necessary features and a variety of specific solutions for different fields of research.
I like this. It gets even more interesting when they say they’re standardizing on MODS.
However, it’s also a little disheartening to see so many projects (usually based on PHP and MySQL) reinventing the wheel over and over again, and often not very well. In particular, it’s one thing to support MODS for data exchange, but these projects really need rich internal data models up to the task of representing MODS data. And yet, they all seem to start and end with the BibTeX data model. Please, people, understand that BibTeX has a very limited—totally flat—data model that is not at all sufficient for scholars outside of the hard sciences, and which takes zero advantage of the power of relational databases and (and even more) XML.
So where to look instead for inspiration? RefDB already has a better data model than BibTeX, and is currently being revamped with a richer MODS-compatible data model (see here for some of the code). And someone is working on a PHP-based interface, which will likely ultimately be based on a PHP module similar to the current RefDB Perl module.
For something more radical, how about LibDB? LibDB is written by Perl hacker Morbus Iff, and is based on principles in the FRBR (pdf). Its SQL schema has separate tables for works, for people and their roles, for events, etc., and Morbus is seriously considering revamping it as a plug-in for the open source CMS system Drupal. While begun as a project to store videos, it is designed to store any bibliographic metadata (save, yet, for what librarians call “analyticals” – articles, book chapters, etc.).
update: David Wilson just pointed me to another potentially-interesting-but-flawed project that basis its data model on bibtex called B3.