Mmusculus    package:BSgenome.Mmusculus.UCSC.mm8    R Documentation

_M_u_s _m_u_s_c_u_l_u_s _f_u_l_l _g_e_n_o_m_e (_U_C_S_C _v_e_r_s_i_o_n _m_m_8)

_D_e_s_c_r_i_p_t_i_o_n:

     Mus musculus full genome as provided by UCSC (mm8, Feb. 2006) and
     stored in Biostrings objects.

_N_o_t_e:

     This BSgenome data package was made from the following source data
     files:


     sequences: chromFa.tar.gz, upstream1000.fa.gz, upstream2000.fa.gz,
     upstream5000.fa.gz
     from http://hgdownload.cse.ucsc.edu/goldenPath/mm8/bigZips/
     AGAPS masks: all the chr*_gap.txt.gz files from
     ftp://hgdownload.cse.ucsc.edu/goldenPath/mm8/database/
     RM masks:
     http://hgdownload.cse.ucsc.edu/goldenPath/mm8/bigZips/chromOut.tar.gz
     TRF masks:
     http://hgdownload.cse.ucsc.edu/goldenPath/mm8/bigZips/chromTrf.tar.gz

     See '?BSgenomeForge' and the BSgenomeForge vignette
     ('vignette("BSgenomeForge")') in the BSgenome software package for
     how to make a BSgenome data package.

_A_u_t_h_o_r(_s):

     H. Pages

_S_e_e _A_l_s_o:

     BSgenome-class, DNAString-class, 'available.genomes',
     BSgenomeForge

_E_x_a_m_p_l_e_s:

     Mmusculus
     seqlengths(Mmusculus)
     Mmusculus$chr1  # same as Mmusculus[["chr1"]]

     if ("AGAPS" %in% masknames(Mmusculus)) {

       ## Check that the assembly gaps contain only Ns:
       checkOnlyNsInGaps <- function(seq)
       {
         ## Replace all masks by the inverted AGAPS mask
         masks(seq) <- gaps(masks(seq)["AGAPS"])
         af <- alphabetFrequency(seq)
         found_letters <- names(af)[af != 0]
         if (any(found_letters != "N"))
             stop("assembly gaps contain more than just Ns")
       }

       ## A message will be printed each time a sequence is removed
       ## from the cache:
       options(verbose=TRUE)

       for (seqname in seqnames(Mmusculus)) {
         cat("Checking sequence", seqname, "... ")
         seq <- Mmusculus[[seqname]]
         checkOnlyNsInGaps(seq)
         cat("OK\n")
       }
     }

     ## See the GenomeSearching vignette in the BSgenome software
     ## package for some examples of genome-wide motif searching using
     ## Biostrings and the BSgenome data packages:
     if (interactive())
         vignette("GenomeSearching", package="BSgenome")

