Sections
Personal tools
You are here: Home Help Data Extraction
Document Actions

Data Extraction

SodaPop is designed to provide access to analysis-ready subsets of data collections. Please do not attempt to download entire datasets or large subsets of variables as this may cause system errors. PRI affiliates can access complete datasets using the PopNet (UNIX) or Paris (Windows) servers. Other members of the Penn State community may contact the Data Archivist for assistance in accessing complete datasets.

Downloading

To access the data extraction page, click the download tab, and then on the link in the middle column that corresponds to the dataset you wish to extract. Select the desired variables, the number of observations to be printed, enter a new dataset name, and select the output format (SAS Version 9 Dataset, Comma Delimited, Tab Delimited, Stata Data File, or SPSS Data File). When you click the extract button, an extracted file is saved to a temporary directory that is swept nightly and an extract output page is generated that contains a proc contents of the extracted dataset and a proc print (listing of values) of the selected number of observations. To download the extracted data, click the link at the top of the extract output page.

If you wish to analyze data in another software package, open SAS, create an extract from the SAS query window, and export data using the SAS Export Wizard to your preferred file format. Additional file conversions can be accomplished with Stat/Transfer software (note that Stat/Transfer is not available as standard software in Penn State computer labs).

Pre-Selected Variables

SodaPop may pre-select variables for you. Typically these are the variables required to identify a unique record in the file. In some cases, when a unique identifier does not exist, we have created one.


Contact the SodaPop Team last modified 11-02-2006 15:13
about help
 

Privacy and Legal Statements | Copyright Information
Copyright ©2008, The Pennsylvania State University

Powered by Plone, the Open Source Content Management System