Archive of UserLand's first discussion group, started October 5, 1998.
Re: Versions, categorization, dmoz.org
Author: A.M. Kuchling Posted: 9/9/1999; 9:17:17 AM Topic: rss channels via email Msg #: 10818 (In response to 10815) Prev/Next: 10817 / 10819
Note that those data dumps are *not* well-formed XML; there are various glitches in the data. I hacked together a Python script that churns through the 300Mb of data and outputs a corrected version that xmlwf reports as being well-formed XML; look for xmlfix.py at http://starship.python.net/crew/amk/zope/ODP.htmlIt would be nice if the data was available in smaller chunks. I'm working on a ODP viewer for Zope, and have a shell-like tool for editing the data; a slide about it is at http://starship.python.net/crew/amk/zope/odp-slides/sld015.htm . This tool would let you read in the full database, delete chunks, eventually add new material, and then output RDF for your modified version. Slapping an XML-RPC interface on top of it should be relatively easy; I hadn't thought of that.
All of this stuff is still unfinished and unreleased.
This page was archived on 6/13/2001; 4:52:31 PM.
© Copyright 1998-2001 UserLand Software, Inc.