On this Picostat.com statistics page, you will find information about the Titanic data set which pertains to Survival of passengers on the Titanic. The Titanic data set is found in the datasets R package. You can load the Titanic data set in R by issuing the following command at the console data("Titanic"). This will load the data into a variable called Titanic. If R says the Titanic data set is not found, you can try installing the package by issuing this command install.packages("datasets") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the Titanic R data set. The size of this file is about 62,279 bytes.
Survival of passengers on the Titanic
This data set provides information on the fate of passengers on the
fatal maiden voyage of the ocean liner ‘Titanic’, summarized according
to economic status (class), sex, age and survival.
A 4-dimensional array resulting from cross-tabulating 2201
observations on 4 variables. The variables and their levels are as
No || Name || Levels
1 || Class || 1st, 2nd, 3rd, Crew
2 || Sex || Male, Female
3 || Age || Child, Adult
4 || Survived || No, Yes
The sinking of the Titanic is a famous event, and new books are still
being published about it. Many well-known facts—from the
proportions of first-class passengers to the ‘women and
children first’ policy, and the fact that that policy was not
entirely successful in saving the women and children in the third
class—are reflected in the survival rates for various classes of
These data were originally collected by the British Board of Trade in
their investigation of the sinking. Note that there is not complete
agreement among primary sources as to the exact numbers on board,
rescued, or lost.
Due in particular to the very successful film ‘Titanic’, the last
years saw a rise in public interest in the Titanic. Very detailed
data about the passengers is now available on the Internet, at sites
such as Encyclopedia Titanica
Dawson, Robert J. MacG. (1995),
The ‘Unusual Episode’ Data Revisited.
Journal of Statistics Education, 3.
The source provides a data set recording class, sex, age, and survival
status for each person on board of the Titanic, and is based on data
originally collected by the British Board of Trade and reprinted in:
British Board of Trade (1990),
Report on the Loss of the ‘Titanic’ (S.S.).
British Board of Trade Inquiry Report (reprint).
Gloucester, UK: Allan Sutton Publishing.
mosaicplot(Titanic, main = "Survival on the Titanic")
## Higher survival rates in children?
apply(Titanic, c(3, 4), sum)
## Higher survival rates in females?
apply(Titanic, c(2, 4), sum)
## Use loglm() in package 'MASS' for further analysis ...
Dataset imported from https://www.r-project.org.