IUPAC_CODE_MAP          package:Biostrings          R Documentation

_T_h_e _I_U_P_A_C _E_x_t_e_n_d_e_d _G_e_n_e_t_i_c _A_l_p_h_a_b_e_t

_D_e_s_c_r_i_p_t_i_o_n:

     The 'IUPAC_CODE_MAP' named character vector contains the mapping
     from the IUPAC nucleotide ambiguity codes to their meaning.

     The 'mergeIUPACLetters' function provides the reverse mapping.

_U_s_a_g_e:

         IUPAC_CODE_MAP
         mergeIUPACLetters(x)

_A_r_g_u_m_e_n_t_s:

       x: A vector of non-empty character strings made of IUPAC
          letters. 

_D_e_t_a_i_l_s:

     IUPAC nucleotide ambiguity codes are used for representing
     sequences of nucleotides where the exact nucleotides that occur at
     some given positions are not known with certainty.

_V_a_l_u_e:

     'IUPAC_CODE_MAP' is a named character vector where the names are
     the IUPAC nucleotide ambiguity codes and the values are their
     corresponding meanings. The meaning of each code is described by a
     string that enumarates the base letters ('"A"', '"C"', '"G"' or
     '"T"') associated with the code.

     The value returned by 'mergeIUPACLetters' is an unnamed character
     vector of the same length as its argument 'x' where each element
     is an IUPAC nucleotide ambiguity code.

_A_u_t_h_o_r(_s):

     H. Pages

_R_e_f_e_r_e_n_c_e_s:

     <URL: http://www.chick.manchester.ac.uk/SiteSeer/IUPAC_codes.html>

     IUPAC-IUB SYMBOLS FOR NUCLEOTIDE NOMENCLATURE: Cornish-Bowden
     (1985) _Nucl. Acids Res._ 13: 3021-3030.

_S_e_e _A_l_s_o:

     'DNAString', 'RNAString'

_E_x_a_m_p_l_e_s:

       IUPAC_CODE_MAP
       some_iupac_codes <- c("R", "M", "G", "N", "V")
       IUPAC_CODE_MAP[some_iupac_codes]
       mergeIUPACLetters(IUPAC_CODE_MAP[some_iupac_codes])

       mergeIUPACLetters(c("Ca", "Acc", "aA", "MAAmC", "gM", "AB", "bS", "mk"))

