The CHRR Database Investigator


Table of Contents

 

4.0 Variables Window - Variables List

When the variables list is displayed each variable in the NLSY is depicted by a series of identifiers that are column heads: Name, Description, Reference No., Question Name, Survey Year, Area of Interest, and sometimes Data Value. The original display is sorted in ascending order by variable Name. The contents of the variable Name field will be used in SAS, SPSS, and Stata Dictionary packages to identify the variable. For the CHRR Database Investigator software, the contents of the Name field is derived from the Reference Number field minus the decimal point (.). The Reference Number (e.g., R00001.00) is a unique, longitudinal identifier for every variable in the NLSY79 and is machine-assigned sequentially. The contents of the Name field (e.g., R0000100) may be changed to something more mnemonic in one of the statistical packages. Once it has been changed it may no longer be a unique identifier.

If you click on one of the other column heads, the list of variables will resort in ascending order by that identifier. Sorting the variables can be a useful way bringing together longitudinal data from various years, or those with characteristics (Description, Question Name) similar to each other. Sorting can also show you which variables in a group were created for you or were responses to questions.

To sort by Question Name, left-click on the Question Name column heading. You will notice:

  1. All the questions with '*Created*" in the Question Name column sort to the top;
  2. Below them the question names sort into groups with similar names; and
  3. The descriptions within the similar groups often resemble one another.

This process of sorting the variables can help you identify longitudinal items and the scope of their frequency. It can help you distinguish between questions with like Question Names but with differing Descriptions or it can show a correlation between the Question Names and Descriptions.

Now, left-click on the Name column heading to sort the variables in their default order. Your results should look like Figure 2 below.

Figure 2 The Variables Window - List of COMMON Variables

The variable at the top of the list "R0000100   IDENTIFICATION CODE 79 INT... etc." should be highlighted. From this window, the Variables window, you will 'tag' or select variables for your extracts. To select a variable for extract, just place a checkmark in its 'checkbox' under the column entitled Name. To check the box, left-click on it.

However, at this point you may not know enough about a variable to select it for extract. If you are unfamiliar with a variable, this list serves as little more than a reminder or description of what the variable is. To become more familiar with a specific variable, you must examine it in the codebook.

Double-click on R0000100   IDENTIFICATION CODE 79 INT...to open the codebook entry for the identification code variable.


Table of Contents

5.0 Text/Cdbk Window - Individual Variable

When you double-click on "R0000100   IDENTIFICATION CODE 79 INT..." the Text/Cdbk window will appear as shown in Figure 3 displaying detailed information about the variable. The format you see is the codebook format of a variable. R00001.   [CASEID]  IDENTIFICATION CODE 79 INT represents the case ID of the respondent. It is an arbitrary, yet unique, identifier for each respondent as explained in the NOTE on the codebook page. This identifier preserves the confidentiality of the respondent and the respondent's household. (Ignore the Documentation Links: Codebook Supplement-Appendix 2 = NLSY79 User's Guide: NLSY79 Codebook Supplement Main File 1979-1998, Appendix 2, it is a dead end for R00001.)


NOTE:  THIS NUMBER WAS ASSIGNED ACCORDING TO THE
RECORD NUMBER OF EACH RESPONDENT ON THE TAPE.
THE FIRST RESPONDENT WAS ASSIGNED ID#1, THE SECOND2:
WAS ASSIGNED ID#2, ETC.

5.1 CASEID

In most, if not all, of your data extracts the CaseID or case identification code should be included. The case identification code will help you resolve inconsistencies in a particular case. For example, suppose a case consistently reported an annual salary of around $50,000: one year it's $48,500, the next it's $50,000, the following year it's $51,250, and then it's $513,750 and back to $52,750 the next. Using the case identification code you could go to that 'outlying' case and look at the particular datum for the anomaly. From this observation you could then decide whether to keep the case in the extract or to throw it out because of the conflicting data. You can change to a particular case by using the Data\View Case... option in the Main Menu of the software.

Figure 3 Text/Cdbk Window - R0000100 IDENTIFICATION CODE 79 INT Variable

As shown in Figure 3 the codebook page includes additional information about a variable. This information will be more relevant when you look at other variables. It is a good idea in the beginning to familiarize yourself with this information about each of the variables that you will want to use in an extract.


Table of Contents

 

6.0 Process of Variable Identification

The process used in Figures 1 - 3 is a good way to become familiar with NLSY variables.
  • Begin in the Contents window (top frame) by double-clicking on an Index to view its contents. The contents of an index are groupings of like variables. You must open the group to view the variables.
  • Select one of the content-groups and double-click on it to view the variables contained therein. Sort the variables by Description (click on the column heading "Description") to more easily identify like variables. Note the likenesses and differences of these sub groups. Are the variables created?
  • Then select one of the variables in the group and double-click on it to view the specific details about the individual variable in the Text/Cdbk window. Look for documentation links, notes, or references to other variables.
  • Lastly, be aware that in the Documents subdirectory on the CD (Contents window, top frame, Documents subdirectory) the hypertext document NLSY79 User's Guide contains more detailed information about some of the variables. The process of finding this detailed information is a little more involved and will be pointed out in the examples below or skip to a summary of the Process of Finding Variable Details.

Next section, 7.0 The COMMON Variables.


Table of Contents

4-6-var&text-windows.html