The public data file is currently published as an uncompressed tar file that contains two copies of the data: one in XML and one in JSON. To save on bandwidth I suggest that you: * Publish two separate files, one for each format. Users then only need to retrieve the data file they need, its very unlikely someone will want both formats. * Compress the tar file using gzip I'm not sure what your plans for updating the file, but I'd suggest that you keep some historical copies to allow some analysis, e.g. on growth of number of ORCIDs, etc. You may also want to consider depositing the file with the Internet Archive. Can I also suggest that you get an Open Data Certificate for the data? https://certificates.theodi.org/
Please sign in to leave a comment.