Most data fields in dblp XML records can provide attributes type, label and aux. The use of these attributes is limited at the moment but we are plannning a more extensive use in the future. Each attribute is used for a specific purpose.
type: A fine-grained description of the element content. Type is from a controlled vocabulary (see below). Example:
<note type="affiliation">University of Trier</note>
where 'affiliation' denotes that this note holds information on an affiliation (unlike a note element without type which can store arbitrary information). A data element can have more than one type. In this case, the type tags are separated by blanks. Example:
<ee type="oa archive">http...</ee>
denotes this electronic edition (web link) element as open access and marks it as not available from the original source (check archive.org instead).
label: is similar to type but contains free text descriptive information (not from a controlled vocabulary). However, there are content guidelines for some situations. Example
<note type="affiliation" label="since 2010">University of Trier</note>
denotes that the author is at this institution since 2010.
aux: reference to an auxiliary record. Auxiliary records contain additional information for the record and its data elements. The information is not listed in the primary record because it is too large, experimental or cannot be provided under ODC-BY licence. Example for a data element with an aux attribute:
<article key="conf/abc/Dow17"> <author aux="a1">John Dow</author> ...
where conf/abc/Dow17 and a1 together reference the auxiliary record.
The following vocabulary is defined for the attribute type (note that this list is subject to changes as we extend the vocabulary).
Note is a very generic data element that holds additional information for publication records or person records (person records represent information about an author, such as the affiliation). Depending on the type of record, different vocabularies are defined. At the moment, attribute type is used mostly for element note. The following values are defined:
|publication||source||Denotes the source of the record metadata.|
|publication||isbn||An ISBN number. Usually, isbn numbers are provided in the isbn element. A note with type isbn is used if an isbn is assigned to multiple publications.|
|publication||dnb||A DNB GND resource URL for the document.|
|publication||urn||A URN of the document.|
|publication||doi||The DOI of the document. Usually, DOIs in the ee element. A note with type doi is mostly used if a DOI is assigned to multiple publications.|
|publication||reviewid||Deprecated: id of a review paper identifying the document which was reviewed. Historically used for http://dblp.org/db/journals/dr/.|
|publication||rating||Deprecated: rating given as part of a review. Historically used for http://dblp.org/db/journals/dr/.|
|person||affiliation||Name of institution associated with the person.|
|person||unicode name (future: uname)|
Name of the person, using numeric unicode entities.
<note type="unicode name">白皛</note>
|person||award||Name of an award held by this person.|
Name of another person who might be confused with this author. Used rarely, mostly when authors with similar name have common coauthors or work at the same institution.
The ee element stores a link to an external electronic resource (e.g., a web page provided by the publisher, or a pdf document). The following vocabulary is defined for the type attribute of ee elements:
|oa||The field contains a link to an openly accessible version of the publication.|
|archive||The resource is no longer available. The ee field is retained for lookup in the wayback machine (see archive.org).|
The attribute label can store free text. For some use cases, we have defined content guidelines. We expect most strings stored in the attribute label to follow these guidelines. However, we do not guarantee compliance (e.g., in case of an unforseen situation that requires a specific content)
The following sections describe use cases with content guidelines:
note element with type affiliation
<note type="affiliation" label="X">University of Trier</note>
The label attribute describes the time the author spent at this institution. Note that for most affiliations, we do not provide these data as they are often difficult to determine. The following values of label are common:
- former: the author is no longer at this institution, but the end of engagement is not known
- since yyyy: the author is still at this institution
- untill yyyy: start of engagement unknown
- yyyy-xxxx: exact dates of engagement are known (e.g., 2008-2011)