classifyKNNsc           package:maigesPack           R Documentation

_F_u_n_c_t_i_o_n _t_o _d_o _d_i_s_c_r_i_m_i_n_a_t_i_o_n _a_n_a_l_y_s_i_s, _b_y _t_h_e _s_e_a_r_c_h _a_n_d _c_h_o_o_s_e _m_e_t_h_o_d

_D_e_s_c_r_i_p_t_i_o_n:

     Function to search by groups of few genes, also called cliques,
     that can discriminate (or classify) between two distinct
     biological sample types, using the k nearest neighbours method.
     This function uses the search and choose method.

_U_s_a_g_e:

     classifyKNNsc(obj=NULL, sLabelID="Classification", func="wilcox.test",
                   facToClass=NULL, gNameID="GeneName", geneGrp=1, path=NULL,
                   nGenes=3, cliques=100, kn=5)

_A_r_g_u_m_e_n_t_s:

     obj: object of class 'maiges' to search the classifiers.

sLabelID: character string with the identification of the sample label
          to be used.

    func: string specifying the function to be used to search by the
          initial one-dimensional classifiers, like 'wilcox.test' or
          't.test'.

facToClass: named list with 2 character vectors specifying the samples
          to be compared. If NULL (default) the first 2 types of
          sLabelID are used.

 gNameID: character string with the identification of gene label ID.

 geneGrp: character or integer specifying the gene group to be tested
          ('colnames' of 'GeneGrps' slot). If both 'geneGrp' and 'path'
          are NULL all genes are used. Defaults to 1 (first group).

    path: character or integer specifying the gene network to be tested
          ('names' of 'Paths' slot). If both 'geneGrp' and 'path' are
          NULL all genes are used. Defaults to NULL.

  nGenes: integer specifying the number of genes in the clique, or
          classifier.

 cliques: integer specifying the number of cliques or classifiers to be
          generated.

      kn: number of neighbours for the _knn_ method.

_D_e_t_a_i_l_s:

     This function implements the method known as Search and choose
     proposed by Cristo (2003). If you want to use an exhaustive search
     use the function 'classifyKNN'.

     This function uses the function 'knn.cv' from package _class_ to
     construct k-nearest neighbour classifiers. It is possible to use
     the functions 'classifyLDAsc' and 'classifySVMsc' to search by
     classifiers using Fisher's linear discriminant analysis and
     support vector machinnes, respectively.

_V_a_l_u_e:

     The result of this function is an object of class 'maigesClass'.

_A_u_t_h_o_r(_s):

     Elier B. Cristo, adapted by Gustavo H. Esteves
     <gesteves@vision.ime.usp.br>

_R_e_f_e_r_e_n_c_e_s:

     Cristo, E.B. Metodos Estatisticos na Analise de Experimentos de
     Microarray. Masther's thesis, Instituto de Matematica e
     Estatistica - Universidade de Sao Paulo, 2003 (in portuguese).

_S_e_e _A_l_s_o:

     'knn.cv', 'classifyKNN', 'classifyLDAsc' and 'classifySVMsc'.

_E_x_a_m_p_l_e_s:

     ## Loading the dataset
     data(gastro)

     ## Doing KNN classifier with 2 genes for the 6th gene group comparing
     ## the 2 categories from 'Type' sample label.
     gastro.class = classifyKNNsc(gastro.summ, sLabelID="Type",
       gNameID="GeneName", nGenes=2, geneGrp=1, cliques=10)
     gastro.class

     ## To do classifier with 3 genes for the 6th gene group comparing
     ## normal vs adenocarcinomas from 'Tissue' sample label
     gastro.class = classifyKNNsc(gastro.summ, sLabelID="Tissue",
       gNameID="GeneName", nGenes=3, geneGrp=1, cliques=10,
       facToClass=list(Norm=c("Neso","Nest"), Ade=c("Aeso","Aest")))

