controlStatus             package:limma             R Documentation

_S_e_t _S_t_a_t_u_s _o_f _e_a_c_h _S_p_o_t _f_r_o_m _L_i_s_t _o_f _S_p_o_t _T_y_p_e_s

_D_e_s_c_r_i_p_t_i_o_n:

     Determine the type (or status) of each spot in the gene list.

_U_s_a_g_e:

     controlStatus(types, genes, spottypecol="SpotType", regexpcol, verbose=TRUE)

_A_r_g_u_m_e_n_t_s:

   types: dataframe containing spot type specifiers, usually input
          using 'readSpotTypes'

   genes: dataframe containing the microarray gene list, or an
          'RGList', 'MAList' or 'MArrayList' containing 'genes'

spottypecol: integer or name specifying column of 'types' containing
          spot type names

regexpcol: vector of integers or column names specifying columns of
          types containing regular expressions. Defaults to any column
          names in common between 'types' and 'genes'.

 verbose: logical, if 'TRUE' then progess on pattern matching is
          reported to the standard output channel

_D_e_t_a_i_l_s:

     This function constructs a vector of status codes by searching for
     patterns in the gene list. The data frame 'genes' contains gene
     IDs and should have as many rows as there are spots on the
     microarrays. Such a data frame is often read using 'readGAL'. The
     data frame 'types' has as many rows as you want to distinguish
     types of spots in the gene list. This data frame should contain a
     column or columns, the 'regexpcol' columns, which have the same
     names as columns in 'genes' and which contain patterns to match in
     the gene list. Another column, the 'spottypecol', contains the
     names of the spot types. Any other columns are assumed to contain
     plotting parameters, such as colors or symbols, to be associated
     with the spot types.

     The patterns in the 'regexpcol' columns are simplified regular
     expressions. For example, 'AA*' means any string starting with
     'AA', '*AA' means any code ending with 'AA', 'AA' means exactly
     these two letters, '*AA*' means any string containing 'AA', 'AA.'
     means 'AA' followed by exactly one other character and 'AA\.'
     means exactly 'AA' followed by a period and no other characters.
     Any other regular expressions are allowed but the codes '^' for
     beginning of string and '$' for end of string should not be
     included.

     Note that the patterns are matched sequentially from first to
     last, so more general patterns should be included first. For
     example, it is often a good idea to include a default spot-type as
     the first line in 'types' with pattern '*' for all 'regexpcol'
     columns and default plotting parameters.

_V_a_l_u_e:

     Character vector specifying the type (or status) of each spot on
     the array. Attributes contain plotting parameters associated with
     each spot type.

_A_u_t_h_o_r(_s):

     Gordon Smyth

_S_e_e _A_l_s_o:

     An overview of LIMMA functions for reading data is given in
     03.ReadingData.

_E_x_a_m_p_l_e_s:

     genes <- data.frame(ID=c("Control","Control","Control","Control","AA1","AA2","AA3","AA4"),
     Name=c("Ratio 1","Ratio 2","House keeping 1","House keeping 2","Gene 1","Gene 2","Gene 3","Gene 4"))
     types <- data.frame(SpotType=c("Gene","Ratio","Housekeeping"),ID=c("*","Control","Control"),Name=c("*","Ratio*","House keeping*"),col=c("black","red","blue"))
     status <- controlStatus(types,genes)

