On this R-data statistics page, you will find information about the mgus data set which pertains to Monoclonal gammapothy data. The mgus data set is found in the survival R package. You can load the mgus data set in R by issuing the following command at the console data("mgus"). This will load the data into a variable called mgus. If R says the mgus data set is not found, you can try installing the package by issuing this command install.packages("survival") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the mgus R data set. The size of this file is about 11,493 bytes.
Monoclonal gammapothy data
Natural history of 241 subjects with monoclonal gammapothy of undetermined significance (MGUS).
mgus: A data frame with 241 observations on the following 12 variables.
R project statistics dataset table
||age in years at the detection of MGUS
||year of diagnosis
||for subjects who progress to a plasma cell malignancy
||the subtype of malignancy: multiple myeloma (MM) is the
||most common, followed by amyloidosis (AM), macroglobulinemia (MA),
||and other lymphprolifative disorders (LP)
||days from MGUS until diagnosis of a plasma cell malignancy
||days from diagnosis to last follow-up
||1= follow-up is until death
||albumin level at MGUS diagnosis
||creatinine at MGUS diagnosis
||hemoglobin at MGUS diagnosis
||size of the monoclonal protein spike at diagnosis
mgus1: The same data set in start,stop format. Contains the id, age, sex, and laboratory variable described above along with
R project statistics dataset table
||sequential intervals of time for each subject
||=1 if the interval ends in an event
||a factor containing the event type: censor, death, or plasma cell malignancy
||event number for each subject: 1 or 2
Plasma cells are responsible for manufacturing immunoglobulins, an important part of the immune defense. At any given time there are estimated to be about 10^6 different immunoglobulins in the circulation at any one time. When a patient has a plasma cell malignancy the distribution will become dominated by a single isotype, the product of the malignant clone, visible as a spike on a serum protein electrophoresis. Monoclonal gammapothy of undertermined significance (MGUS) is the presence of such a spike, but in a patient with no evidence of overt malignancy. This data set of 241 sequential subjects at Mayo Clinic was the groundbreaking study defining the natural history of such subjects. Due to the diligence of the principle investigator 0 subjects have been lost to follow-up.
Three subjects had MGUS detected on the day of death. In data set
mgus1 these subjects have the time to MGUS coded as .5 day before the death in order to avoid tied times.
These data sets were updated in Jan 2015 to correct some small errors.
Mayo Clinic data courtesy of Dr. Robert Kyle.
R Kyle, Benign monoclonal gammopathy – after 20 to 35 years of follow-up, Mayo Clinic Proc 1993; 68:26-36.
# Create the competing risk curves for time to first of death or PCM
sfit <- survfit(Surv(start, stop, event) ~ sex, mgus1, subset=(enum==1))
print(sfit) # the order of printout is the order in which they plotplot(sfit, xscale=365.25, lty=c(2,1,2,1), col=c(1,1,2,2),
xlab="Years after MGUS detection", ylab="Proportion")
legend(0, .8, c("Death/male", "Death/female", "PCM/male", "PCM/female"),
lty=c(1,1,2,2), col=c(2,1,2,1), bty='n')title("Curves for the first of plasma cell malignancy or death")
# The plot shows that males have a higher death rate than females (no
# surprise) but their rates of conversion to PCM are essentially the same.
Dataset imported from https://www.r-project.org.