R Dataset / Package survival / flchain


On this Picostat.com statistics page, you will find information about the flchain data set which pertains to Assay of serum free light chain for 7874 subjects.. The flchain data set is found in the survival R package. You can load the flchain data set in R by issuing the following command at the console data("flchain"). This will load the data into a variable called flchain. If R says the flchain data set is not found, you can try installing the package by issuing this command install.packages("survival") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the flchain R data set. The size of this file is about 332,439 bytes.

Assay of serum free light chain for 7874 subjects.


This is a stratified random sample containing 1/2 of the subjects from a study of the relationship between serum free light chain (FLC) and mortality. The original sample contains samples on approximately 2/3 of the residents of Olmsted County aged 50 or greater.




A data frame with 7874 persons containing the following variables.


age in years


F=female, M=male


the calendar year in which a blood sample was obtained


serum free light chain, kappa portion


serum free light chain, lambda portion


the FLC group for the subject, as used in the original analysis


serum creatinine


1 if the subject had been diagnosed with monoclonal gammapothy (MGUS)


days from enrollment until death. Note that there are 3 subjects whose sample was obtained on their death date.


0=alive at last contact date, 1=dead


for those who died, a grouping of their primary cause of death by chapter headings of the International Code of Diseases ICD-9


In 1995 Dr. Robert Kyle embarked on a study to determine the prevalence of monoclonal gammopathy of undetermined significance (MGUS) in Olmsted County, Minnesota, a condition which is normally only found by chance from a test (serum electrophoresis) which is ordered for other causes. Later work suggested that one component of immunoglobulin production, the serum free light chain, might be a possible marker for immune disregulation. In 2010 Dr. Angela Dispenzieri and colleagues assayed FLC levels on those samples from the original study for which they had patient permission and from which sufficient material remained for further testing. They found that elevated FLC levels were indeed associated with higher death rates.

Patients were recruited when they came to the clinic for other appointments, with a final random sample of those who had not yet had a visit since the study began. An interesting side question is whether there are differences between early, mid, and late recruits.

This data set contains an age and sex stratified random sample that includes 7874 of the original 15759 subjects. The original subject identifiers and dates have been removed to protect patient identity. Subsampling was done to further protect this information.


The primary investigator (A Dispenzieri) and statistician (T Therneau) for the study.


A Dispenzieri, J Katzmann, R Kyle, D Larson, T Therneau, C Colby, R Clark, G Mead, S Kumar, LJ Melton III and SV Rajkumar (2012). Use of monclonal serum immunoglobulin free light chains to predict overall survival in the general population, Mayo Clinic Proceedings 87:512-523.

R Kyle, T Therneau, SV Rajkumar, D Larson, M Plevak, J Offord, A Dispenzieri, J Katzmann, and LJ Melton, III, 2006, Prevalence of monoclonal gammopathy of undetermined significance, New England J Medicine 354:1362-1369.


age.grp <-  cut(flchain$age, c(49,54, 59,64, 69,74,79, 89, 110),
               labels= paste(c(50,55,60,65,70,75,80,90),
                             c(54,59,64,69,74,79,89,109), sep='-'))
table(flchain$sex, age.grp)

Dataset imported from https://www.r-project.org.

Title Authored on Content type
R Dataset / Package psych / bfi March 9, 2018 - 1:06 PM Dataset
OpenIntro Statistics Dataset - scotus_healthcare August 9, 2020 - 2:38 PM Dataset
R Dataset / Package psych / withinBetween March 9, 2018 - 1:06 PM Dataset
R Dataset / Package Stat2Data / Kids198 March 9, 2018 - 1:06 PM Dataset
R Dataset / Package Ecdat / Wages1 March 9, 2018 - 1:06 PM Dataset
Attachment Size
dataset-22069.csv 324.65 KB
Dataset License
GNU General Public License v2.0
Documentation License
GNU General Public License v2.0