R Dataset / Package mosaicData / Whickham


On this Picostat.com statistics page, you will find information about the Whickham data set which pertains to Data from the Whickham survey. The Whickham data set is found in the mosaicData R package. You can load the Whickham data set in R by issuing the following command at the console data("Whickham"). This will load the data into a variable called Whickham. If R says the Whickham data set is not found, you can try installing the package by issuing this command install.packages("mosaicData") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the Whickham R data set. The size of this file is about 21,262 bytes.

Data from the Whickham survey


Data on age, smoking, and mortality from a one-in-six survey of the electoral roll in Whickham, a mixed urban and rural district near Newcastle upon Tyne, in the UK. The survey was conducted in 1972-1974 to study heart disease and thyroid disease. A follow-up on those in the survey was conducted twenty years later.




A data frame with 1314 observations on women for the following variables.

  • outcome survival status after 20 years: a factor with levels Alive Dead

  • smoker smoking status at baseline: a factor with levels No Yes

  • age age (in years) at the time of the first survey


This dataset contains a subset of the survey sample: women who were classified as current smokers or as never having smoked. The data were synthesized from the summary description tables given in the Appleton et al al paper.


DR Appleton, JM French, MPJ Vanderpump. "Ignoring a covariate: an example of Simpson's paradox". (1996) American Statistician, 50(4):340-341.



Dataset imported from https://www.r-project.org.

Title Authored on Content type
R Dataset / Package psych / bfi March 9, 2018 - 1:06 PM Dataset
OpenIntro Statistics Dataset - scotus_healthcare August 9, 2020 - 2:38 PM Dataset
R Dataset / Package psych / withinBetween March 9, 2018 - 1:06 PM Dataset
R Dataset / Package Stat2Data / Kids198 March 9, 2018 - 1:06 PM Dataset
R Dataset / Package Ecdat / Wages1 March 9, 2018 - 1:06 PM Dataset
Attachment Size
dataset-32411.csv 20.76 KB
Dataset License
GNU General Public License v2.0
Documentation License
GNU General Public License v2.0