Data Extraction
SodaPop is designed to provide access to analysis-ready subsets of data collections. Please do not attempt to download entire datasets or large subsets of variables as this may cause system errors. PRI affiliates can access complete datasets using the PopNet (UNIX) or Paris (Windows) servers. Other members of the Penn State community may contact the Data Archivist for assistance in accessing complete datasets.
Downloading
To access the data extraction page, click the download tab,
and then on the link in the middle column that corresponds to the
dataset you wish to extract. Select the desired variables, the number
of observations to be printed, enter a new dataset name, and select the
output format (SAS Version 9 Dataset, Comma Delimited, Tab Delimited,
Stata Data File, or SPSS Data File). When you click the extract button,
an extracted file is saved to a temporary directory that is swept
nightly and an extract output page is generated that contains a proc
contents of the extracted dataset and a proc print (listing of values)
of the selected number of observations. To download the extracted data,
click the link at the top of the extract output page.
If
you wish to analyze data in another software package, open SAS, create
an extract from the SAS query window, and export data using the SAS
Export Wizard to your preferred file format. Additional file
conversions can be accomplished with Stat/Transfer
software (note that Stat/Transfer is not available as standard software in Penn
State computer labs).
Pre-Selected Variables
SodaPop may pre-select variables for you. Typically these are the variables required to identify a unique record in the file. In some cases, when a unique identifier does not exist, we have created one.
