ORCID Public Data File (2020)

orcid-dump-2020 (40 files)
ORCID_2020_10_activities_0.tar.gz 6.13GB
ORCID_2020_10_activities_1.tar.gz 5.75GB
ORCID_2020_10_activities_2.tar.gz 5.25GB
ORCID_2020_10_activities_3.tar.gz 5.55GB
ORCID_2020_10_activities_4.tar.gz 5.93GB
ORCID_2020_10_activities_5.tar.gz 6.13GB
ORCID_2020_10_activities_6.tar.gz 5.14GB
ORCID_2020_10_activities_7.tar.gz 5.68GB
ORCID_2020_10_activities_8.tar.gz 5.54GB
ORCID_2020_10_activities_9.tar.gz 5.93GB
ORCID_2020_10_activities_X.tar.gz 6.03GB
ORCID_2020_10_summaries.json.gz 11.21GB
ORCID_2020_10_summaries.sample_10k.json.gz 10.97MB
ORCID_2020_10_summaries.tar.gz 16.25GB
__ia_thumb.jpg 3.79kB
history/files/ORCID_2020_10_summaries.sample_10k.json.gz.~1~ 54.95kB
orcid-dump-2020_meta.sqlite 26.62kB
orcid-dump-2020_meta.xml 1.74kB
orcid-logo.png 2.03kB
orcid-logo_thumb.jpg 1.67kB
Type: Dataset

title= {ORCID Public Data File (2020)},
journal= {},
author= {ORCID},
year= {},
url= {https://orcid.figshare.com/articles/dataset/ORCID_Public_Data_File_2020/13066970},
abstract= {These files contain a snapshot of all public data in the ORCID Registry associated with an ORCID record that was created or claimed by an individual as of October 1st, 2020. ORCID publishes this file once per year under a Creative Commons CC0 1.0 Universal public domain dedication. This means that, to the extent possible under law, ORCID has waived all copyright and related or neighbouring rights to the Public Data File. For more information on the file, see https://orcid.org/content/orcid-public-data-file-use-policy

The file contains the public information associated with each user's ORCID record. The data is available in XML format and is further divided into separate files for easier management. One file contains the full record summary for each record. The rest of the data is divided into 11 files which contain the activities for each record including full work data.

Below is more complete description of how the data is structured.

Summaries file

Name: ORCID_2020_10_summaries.tar.gz
Description: Contains all the existing summaries, when extracted, it will generate the following file structure: summaries/[3 digits checksum]/[iD].xml
Example: If you are looking for the summary of iD '0000-0002-7869-831X', decompress the file and you will find the summary under 'summaries/31X/0000-0002-7869-831X.xml'.

Activities files


- ORCID_2020_10_activites_0.tar.gz
- ORCID_2020_10_activites_1.tar.gz
- ORCID_2020_10_activites_2.tar.gz
- ORCID_2020_10_activites_3.tar.gz
- ORCID_2020_10_activites_4.tar.gz
- ORCID_2020_10_activites_5.tar.gz
- ORCID_2020_10_activites_6.tar.gz
- ORCID_2020_10_activites_7.tar.gz
- ORCID_2020_10_activites_8.tar.gz
- ORCID_2020_10_activites_9.tar.gz
- ORCID_2020_10_activites_X.tar.gz

Description: Consists of 11 .tar.gz files, each file contains the public activities that belongs to an iD that contains a given checksum. The file hierarchy is as follows:
[checksum]/[3 digits checksum]/[iD]/[activity type]/[iD]_[activity_type]_[putcode].xml


If you are looking for the public activities that belong to `0000-0002-7869-831X:

Decompress the file 'ORCID_2020_10_activites_X.tar.gz'.
You will find all the public activities under 'X/31X/0000-0002-7869-831X/' which are then sub-divided in folders for each activity type.

If you are looking for all the employments that belong to '0000-0002-7869-831X':

Decompress the file 'ORCID_2020_10_activites_X.tar.gz',
Navigate to 'X/31X/0000-0002-7869-831X/employments'.

If you are looking for the employment with put-code '7923980' that belongs to '0000-0002-7869-831X' :

Decompress the file 'ORCID_2020_10_activites_X.tar.gz'.
You will find that employment under 'X/31X/0000-0002-7869-831X/employments/0000-0002-7869-831X_employments_7923980.xml'.
keywords= {},
terms= {},
license= {https://creativecommons.org/share-your-work/public-domain/cc0/},
superseded= {}

Send Feedback