khanmiss               package:impute               R Documentation

_K_h_a_n _m_i_c_r_o_a_r_r_a_y _d_a_t_a _w_i_t_h _r_a_n_d_o_m _m_i_s_s_i_n_g _v_a_l_u_e_s

_D_e_s_c_r_i_p_t_i_o_n:

     A text file containing the Khan micorarray data with random
     missing values introduced for illustrative purposes

_U_s_a_g_e:

        data(khanmiss)

_F_o_r_m_a_t:

     The data set 'khanmiss' consists of 2310 rows and 65 columns. Row
     1 has the sample labels, Row 2 has the class labels. The remaining
     rows are gene expression. Column 1 is a dummy gene number. Column
     2 is the gene name. Remaining columns are gene expression.

     Please note that this dataset was derived from the original by
     introducing some random missing values purely for the purpose of
     illustration.

_S_o_u_r_c_e:

     Khan, J.  and  Wei, J.S.  and Ringner, M. and  Saal, L. and 
     Ladanyi, M. and Westermann, F.  and  Berthold, F. and Schwab, M.
     and  Antonescu, C. and Peterson, C. and and Meltzer, P. (2001)
     Classification and diagnostic prediction of cancers using gene
     expression profiling and artificial neural network. Nature
     Medicine 7, 673-679.

_R_e_f_e_r_e_n_c_e_s:

     Robert Tibshirani, Trevor Hastie, Balasubramanian Narasimhan, and
     Gilbert Chu  (2002). Diagnosis of multiple cancer types by
     shrunken centroids of gene expression PNAS 99: 6567-6572.  
     Available at www.pnas.org

_E_x_a_m_p_l_e_s:

       data(khanmiss)

