This repository contains instructions on how to extract and transform OpenAlex data for data analysis with Google BigQuery.
$ aws s3 sync 's3:https://openalex' 'openalex-snapshot' --no-sign-request
$ sbatch openalex_hpc.sh
$ gsutil -m cp -r /scratch/users/haupka/works gs:https://bigschol
$ bq load --ignore_unknown_values --source_format=NEWLINE_DELIMITED_JSON subugoe-collaborative:openalex.works gs:https://bigschol/works/*.gz schema_openalex_work.json