ORCID Public Data File (2019)

folder orcid-dump-2019 (40 files)
fileORCID_2019_activites_0.tar.gz 4.55GB
fileORCID_2019_activites_1.tar.gz 4.50GB
fileORCID_2019_activites_2.tar.gz 3.94GB
fileORCID_2019_activites_3.tar.gz 4.22GB
fileORCID_2019_activites_4.tar.gz 4.40GB
fileORCID_2019_activites_5.tar.gz 4.66GB
fileORCID_2019_activites_6.tar.gz 3.91GB
fileORCID_2019_activites_7.tar.gz 4.43GB
fileORCID_2019_activites_8.tar.gz 4.06GB
fileORCID_2019_activites_9.tar.gz 4.51GB
fileORCID_2019_activites_X.tar.gz 4.38GB
fileORCID_2019_summaries.json.gz 8.14GB
fileORCID_2019_summaries.sample_10k.json.gz 10.27MB
fileORCID_2019_summaries.tar.gz 11.55GB
fileORCID_2019_summaries_json.tar.gz 10.56GB
file__ia_thumb.jpg 3.57kB
fileorcid-dump-2019_meta.sqlite 24.58kB
fileorcid-dump-2019_meta.xml 1.57kB
fileorcid-logo.png 2.03kB
fileorcid-logo_thumb.jpg 1.67kB
Type: Dataset

title= {ORCID Public Data File (2019)},
journal= {},
author= {ORCID},
year= {},
url= {https://orcid.figshare.com/articles/dataset/ORCID_Public_Data_File_2019/9988322},
abstract= {These files contain a snapshot of all public data in the ORCID Registry associated with an ORCID record that was created or claimed by an individual as of October 1st, 2019. ORCID publishes this file once per year under a Creative Commons CC0 1.0 Universal public domain dedication. This means that, to the extent possible under law, ORCID has waived all copyright and related or neighbouring rights to the Public Data File. For more information on the file, see https://orcid.org/content/orcid-public-data-file-use-policy

The file contains the public information associated with each user's ORCID record. The data is available in XML format and is further divided into separate files for easier management. One file contains the full record summary for each record. The rest of the data is divided into 11 files which contain the activities for each record including full work data.

Below is more complete description of how the data is structured.

Summaries file

Name: ORCID_2019_summaries.tar.gz
Description: Contains all the existing summaries, when extracted, it will generate the following file structure: summaries/[3 digits checksum]/[iD].xml
Example: If you are looking for the summary of iD '0000-0002-7869-831X', decompress the file and you will find the summary under 'summaries/31X/0000-0002-7869-831X.xml'.

Activities files


- ORCID_2019_activites_0.tar.gz
- ORCID_2019_activites_1.tar.gz
- ORCID_2019_activites_2.tar.gz
- ORCID_2019_activites_3.tar.gz
- ORCID_2019_activites_4.tar.gz
- ORCID_2019_activites_5.tar.gz
- ORCID_2019_activites_6.tar.gz
- ORCID_2019_activites_7.tar.gz
- ORCID_2019_activites_8.tar.gz
- ORCID_2019_activites_9.tar.gz
- ORCID_2019_activites_X.tar.gz

Description: Consists of 11 .tar.gz files, each file contains the public activities that belongs to an iD that contains a given checksum. The file hierarchy is as follows:
[checksum]/[3 digits checksum]/[iD]/[activity type]/[iD]_[activity_type]_[putcode].xml


If you are looking for the public activities that belong to `0000-0002-7869-831X:

Decompress the file 'ORCID_2019_activites_X.tar.gz'.
You will find all the public activities under 'X/31X/0000-0002-7869-831X/' which are then sub-divided in folders for each activity type.

If you are looking for all the employments that belong to '0000-0002-7869-831X':

Decompress the file 'ORCID_2019_activites_X.tar.gz',
Navigate to 'X/31X/0000-0002-7869-831X/employments'.

If you are looking for the employment with put-code '7923980' that belongs to '0000-0002-7869-831X' :

Decompress the file 'ORCID_2019_activites_X.tar.gz'.
You will find that employment under 'X/31X/0000-0002-7869-831X/employments/0000-0002-7869-831X_employments_7923980.xml'.},
keywords= {},
terms= {},
license= {https://creativecommons.org/share-your-work/public-domain/cc0/},
superseded= {}

Send Feedback