Dmelanogasterpackage:BSgenome.Dmelanogaster.UCSC.dm2R Documentation

_D_r_o_s_o_p_h_i_l_a _m_e_l_a_n_o_g_a_s_t_e_r _f_u_l_l _g_e_n_o_m_e (_U_C_S_C _v_e_r_s_i_o_n _d_m_2)

_D_e_s_c_r_i_p_t_i_o_n:

     Drosophila melanogaster full genome as provided by UCSC (dm2, Apr.
     2004) and stored in Biostrings objects.

_N_o_t_e:

     This BSgenome data package was made from the following source data
     files:


     sequences: chromFa.zip, upstream1000.fa.gz, upstream2000.fa.gz,
     upstream5000.fa.gz
     from http://hgdownload.cse.ucsc.edu/goldenPath/dm2/bigZips/
     masks: all the chr*_gap.txt.gz files from
     ftp://hgdownload.cse.ucsc.edu/goldenPath/dm2/database/
     + chromOut.zip and chromTrf.zip from
     http://hgdownload.cse.ucsc.edu/goldenPath/dm2/bigZips/

     See '?BSgenomeForge' and the BSgenomeForge vignette
     ('vignette("BSgenomeForge")') in the BSgenome software package for
     how to make a BSgenome data package.

_A_u_t_h_o_r(_s):

     H. Pages

_S_e_e _A_l_s_o:

     BSgenome-class, DNAString-class, 'available.genomes',
     BSgenomeForge

_E_x_a_m_p_l_e_s:

     Dmelanogaster
     seqlengths(Dmelanogaster)
     Dmelanogaster$chr2L  # same as Dmelanogaster[["chr2L"]]

     if ("AGAPS" %in% masknames(Dmelanogaster)) {

       ## Check that the assembly gaps contain only Ns:
       checkOnlyNsInGaps <- function(seq)
       {
         ## Replace all masks by the inverted AGAPS mask
         masks(seq) <- gaps(masks(seq)["AGAPS"])
         af <- alphabetFrequency(seq)
         found_letters <- names(af)[af != 0]
         if (any(found_letters != "N"))
             stop("assembly gaps contain more than just Ns")
       }

       ## A message will be printed each time a sequence is removed
       ## from the cache:
       options(verbose=TRUE)

       for (seqname in seqnames(Dmelanogaster)) {
         cat("Checking sequence", seqname, "... ")
         seq <- Dmelanogaster[[seqname]]
         checkOnlyNsInGaps(seq)
         cat("OK\n")
       }
     }

     ## See the GenomeSearching vignette in the BSgenome software
     ## package for some examples of genome-wide motif searching using
     ## Biostrings and the BSgenome data packages:
     if (interactive())
         vignette("GenomeSearching", package="BSgenome")

