Spira(CLEg)R Documentation

Smoker dataset Spira et al. 2004

Description

Number of genes: 9968. Other covariates including age and gender are available in the original data. The training set consists of 38 subjects and the test data 37.

Data y=1 y=–1 Total
train 26 12 38
test 26 11 37

The response y=1 corresonds to a smoker or previous smoker while y=-1 corresponds to an individual who never smoked.

The original data was not divided into training and test. There are 27 males in the training and test datasets. There are 10 females in the training dataset and 9 in the test.

Usage

data(Spira)

Format

List with 4 named elements, X, y, Xt, yt, which are respectively the training design matrix, training classes, test design matrix and test classes.

Details

Source

www.pnas.org/cgi/doi/10.1073/pnas.0401422101

References

Avrum Spira, Jennifer Beanea, Vishal Shah, Gang Liu, Frank Schembri, Xuemei Yang, John Palma, and Jerome S. Brody (2004). Effects of cigarette smoke on the human airway epithelial cell transcriptome. PNAS July 6, 2004 vol. 101 no. 27

Examples

data(Spira)

[Package CLEg version 2.1 Index]