On this Picostat.com statistics page, you will find information about the nlsyV data set which pertains to National Longitudinal Survey of Youth Extract. The nlsyV data set is found in the mi R package. You can load the nlsyV data set in R by issuing the following command at the console data("nlsyV"). This will load the data into a variable called nlsyV. If R says the nlsyV data set is not found, you can try installing the package by issuing this command install.packages("mi") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the nlsyV R data set. The size of this file is about 7,977 bytes.
National Longitudinal Survey of Youth Extract
This dataset pertains to children and their families in the United States and
is intended to illustrate missing data issues. Note that although the original
data are longitudinal, this extract is not.
A data frame with 400 randomly subsampled observations on the following 7 variables.
a numeric vector with data on the Peabody Picture Vocabulary Test (Revised)
administered at 36 months
indicator for whether child was first-born
indicator for whether mother was married when child was born
a numeric vector with data on family income in year after the child was born
a numeric vector with data on the age of the mother when the child was born
educational status of mother when child was born (1 = less than high school,
2 = high school graduate, 3 = some college, 4 = college graduate)
race of mother (1 = black, 2 = Hispanic, 3 = white)
Note that momed would typically be an ordered
factor while momrace
would typically be an unorderd
factor but both are
numeric in this
data.frame in order to illustrate the mechanism to
type of a
National Longitudinal Survey of Youth, 1997, http://www.bls.gov/nls/nlsy97.htm
Dataset imported from https://www.r-project.org.