R Dataset / Package Ecdat / nonEnglishNames

Documentation

On this R-data statistics page, you will find information about the nonEnglishNames data set which pertains to Names with Character Set Problems. The nonEnglishNames data set is found in the Ecdat R package. You can load the nonEnglishNames data set in R by issuing the following command at the console data("nonEnglishNames"). This will load the data into a variable called nonEnglishNames. If R says the nonEnglishNames data set is not found, you can try installing the package by issuing this command install.packages("Ecdat") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the nonEnglishNames R data set. The size of this file is about 231 bytes.


Names with Character Set Problems

Description

A data.frame describing names containing character codes rare or non-existent in standard English text, e.g., with various accent marks that may not be coded consistenty in different locales or by different software.

Usage

data(nonEnglishNames)

Format

A data.frame with two columns:

nonEnglish

a character vector containing names that often have non-standard characters with the non-standard characters replaced by "_"

English

a character vector containing a standard English-character translation of nonEnglish

See Also

grepNonStandardCharacters, subNonStandardCharacters

Examples

data(nonEnglishNames)
all.equal(ncol(nonEnglishNames), 2)
--

Dataset imported from https://www.r-project.org.

Title Authored on Content type
OpenIntro Statistics Dataset - dream August 9, 2020 - 12:25 PM Dataset
OpenIntro Statistics Dataset - winery_cars August 9, 2020 - 2:38 PM Dataset
R Dataset / Package HSAUR / toothpaste March 9, 2018 - 1:06 PM Dataset
R Dataset / Package HSAUR / pottery March 9, 2018 - 1:06 PM Dataset
R Dataset / Package HistData / Guerry March 9, 2018 - 1:06 PM Dataset
Attachment Size
dataset-98983.csv 231 bytes
Dataset License
GNU General Public License v2.0
Documentation License
GNU General Public License v2.0

This documentation is licensed under GPLv3 or later.