Athaliana  package:BSgenome.Athaliana.TAIR.01222004  R Documentation

_A_r_a_b_i_d_o_p_s_i_s _t_h_a_l_i_a_n_a _f_u_l_l _g_e_n_o_m_e (_T_A_I_R _v_e_r_s_i_o_n _f_r_o_m _J_a_n_u_a_r_y _2_2, _2_0_0_4)

_D_e_s_c_r_i_p_t_i_o_n:

     Arabidopsis thaliana full genome as provided by TAIR (snapshot
     from January 22, 2004) and stored in Biostrings objects.

_N_o_t_e:

     This BSgenome data package was made from the following source data
     files:


     ATH1_chr1.1con.01222004
     ATH1_chr2.1con.01222004
     ATH1_chr3.1con.01222004
     ATH1_chr4.1con.01222004
     ATH1_chr5.1con.01222004
     mitochondrial_genomic_sequence
     ATH1_chloroplast.1con.01072002
     from
     ftp://ftp.arabidopsis.org/home/tair/Sequences/whole_chromosomes/OLD/

     See '?BSgenomeForge' and the BSgenomeForge vignette
     ('vignette("BSgenomeForge")') in the BSgenome software package for
     how to make a BSgenome data package.

_A_u_t_h_o_r(_s):

     H. Pages

_S_e_e _A_l_s_o:

     BSgenome-class, DNAString-class, 'available.genomes',
     BSgenomeForge

_E_x_a_m_p_l_e_s:

     Athaliana
     seqlengths(Athaliana)
     Athaliana$chr1  # same as Athaliana[["chr1"]]

     if ("AGAPS" %in% masknames(Athaliana)) {

       ## Check that the assembly gaps contain only Ns:
       checkOnlyNsInGaps <- function(seq)
       {
         ## Replace all masks by the inverted AGAPS mask
         masks(seq) <- gaps(masks(seq)["AGAPS"])
         af <- alphabetFrequency(seq)
         found_letters <- names(af)[af != 0]
         if (any(found_letters != "N"))
             stop("assembly gaps contain more than just Ns")
       }

       ## A message will be printed each time a sequence is removed
       ## from the cache:
       options(verbose=TRUE)

       for (seqname in seqnames(Athaliana)) {
         cat("Checking sequence", seqname, "... ")
         seq <- Athaliana[[seqname]]
         checkOnlyNsInGaps(seq)
         cat("OK\n")
       }
     }

     ## See the GenomeSearching vignette in the BSgenome software
     ## package for some examples of genome-wide motif searching using
     ## Biostrings and the BSgenome data packages:
     if (interactive())
         vignette("GenomeSearching", package="BSgenome")

