How can I download the whole dblp dataset?
The whole dblp dataset is available as one big XML file. You can find the XML dump of dblp in this directory. The directory contains:
dblp.xml
is an XML file which contains all bibliographic records,dblp.xml.gz
is a compressed version of this file (gzip),dblp.dtd
is the document type definition you need to validate the XML file.
The DTD file should be copied in in your local working directory.
The XML file is rebuilt every time a new build of the dblp web pages has been computed. You can always check for the timestamp of the XML file in the directory to learn when a new build is ready.
You can find information on how to parse the dblp XML file here.