dmozDump v.0.1
by Marshall Roch

Copyright (C) 2002 Marshall Roch
All rights reserved.

----------------------------------------
Use of this script allows you to take
content from the Open Directory Project
(www.dmoz.org).  Use of this content is
also subject to their license agreement.
(www.dmoz.org/license.html)
/*-------------------------------------------*/
/* This program is free software; you can		 */
/* redistribute it and/or modify it under		 */
/* the terms of the GNU General Public 		 	 */
/* License as published by the Free Software */
/* Foundation; either version 2 of the       */
/* License, or (at your option) any later    */
/* version.																	 */
/* 																					 */
/* This program is distributed in the hope   */
/* that it will be useful, but WITHOUT ANY   */
/* WARRANTY; without even the implied war-   */
/* ranty of MERCHANTABILITY or FITNESS FOR   */
/* A PARTICULAR PURPOSE. See the GNU General */
/* Public License for more details.          */
/* 																					 */
/* You should have received a copy of the    */
/* GNU General Public License along with     */
/* this program; if not, write to the Free   */
/* Software Foundation, Inc., 59 Temple      */
/* Place, Suite 330, Boston, MA 02111-1307   */
/* USA                                       */
/*-------------------------------------------*/

DESCRIPTION:
   dmozDump will grab a URL from dmoz.org and extract all of 
the links, URLs, and descriptions.  The information is then 
dumped into a database.  dmozDump uses PHPLib's database 
abstraction layer (the MySQL file is included) to support 
database independence.


INSTALLATION:

   1.  If a table does not exist in your database for links,
       create one.

   2.  Modify dump_config.php to reflect your database's
       connection information.

       To use a DBMS other than MySQL, download PHPLib,
       upload the appropriate file, and change the require()
       in dump_config.php.

   3.  Change table/field names to fit your database.

   4.  Upload all files in this distribution to your server.


USE:

   1.  If your link database is organized into categories,
       input the category ID number (or however your links
       are categorized) in the first field.

   2.  Visit dmoz.org in a new browser window, navigate to
       the page you wish to dump, and copy the URL into the
       second field.

   3.  Copy the URL (NOT the title! Should be "http://...com")
       of the first link that you wish to copy.  NOTE:  this
       script will ONLY dump links between the <hr>s (actual
       links, not categories, etc).

   4.  Copy the last couple of words from the last link 
       description that you wish to dump.  This text must be
       unique on the page to tell the script where to stop 
       grabbing text.

   5.  Submit the form and verify that the links were parsed
       correctly.  If not, correct the data at the bottom of
       page and resubmit the bottom form.

   6.  Once everything is confirmed to be correct, hit the
       "Dump to DB" button.  The links will be inserted.


TERMS OF USE:
   This script is provided free of charge, as long as the
copyright and DMOZ usage notice remains intact.  I am not
responsible if the use of this code causes any damage to
your server or your database.  I am also not liable if you
get in trouble with DMOZ for any reason.


MODIFICATION:
   You may modify this script freely, but you must leave
the original copyright information and clearly say that the
script has been modified, and whom has done the modification.
