Skip to content

btskinner/downloadipeds

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Purpose

Use this script to batch download Integrated Postsecondary Education Data System (IPEDS) files. The downloaded files are not unzipped or processed in any way. This script simply saves you the trouble of having to point and click your way through the data center.

Only those files listed in ipeds_file_list.txt will be downloaded. The default behavior is to download each of the following files into their own subdirectories:

  1. Data file
  2. Dictionary file

You can also choose to download other data versions and/or program files:

  1. Data file (STATA version)
  2. STATA program file (default if you ask for STATA version data)
  3. SPSS program file
  4. SAS program file

IMPORTANT NOTE

The default behavior is to download ALL OF IPEDS. If you don't want everything, modify ipeds_file_list.txt to include only those files that you want. Simply erase those you don't want, keeping one file name per row.

To Run

For the default options, run the script downloadipeds.R as is. It will create subdirectories as needed to store data and program files. You may wish to place the folder in its own directory. Just make sure that ipeds_file_list.txt is in the same directory. For example, let's say you place both files in a directory called ipeds:

./ipeds
|__ downloadipeds.R
|__ ipeds_file_list.txt

If you run it with the default options, two new subdirectories will be created, one for the data files and one for the dictionary files. Using the above example, your directory will look like this after it is finished:

./ipeds
|__ downloadipeds.R
|__ ipeds_file_list.txt
|__ /data
|   |__ HD2015.zip
|   |__ IC2015.zip
|   |__ <...>
|
|__ /dictionary
    |__ HD2015_Dict.zip
    |__ IC2015_Dict.zip
    |__ <...>

To download other program scripts or Stata versions of the data, change the following commands in downloadipeds.R from FALSE to TRUE:

## -----------------------------------------------------------------------------
## CHOOSE WHAT YOU WANT (TRUE == Yes, FALSE == No)
## -----------------------------------------------------------------------------

## default
primary_data = TRUE
dictionary = TRUE

## STATA version
## (NB: downloading Stata version of data will also get Stata program files)
stata_data = FALSE

## other program files
prog_spss = FALSE
prog_sas  = FALSE

Data size

As of 10 December 2016, downloading all IPEDS files (setting all options to TRUE) requires approximately 1.5 GB of disk space. Granted, you probably don't need both regular and Stata versions of the data files (which are the bulk of the directory size). Here are the approximate subdirectory file sizes if you download all data files from all years:

Subdirectory Approximate Size
./data 770.5 MB
./dictionary 16.6 MB
./sas_prog 5.1 MB
./spss_prog 4.9 MB
./stata_data 673.9 MB
./stata_prog 5.7 MB

Combine

To combine multiple IPEDS data files into a single dataset, you may find the following script useful: Combine IPEDS