hapmapchr22           package:GeneticsBase           R Documentation

_C_h_r_o_m_o_s_o_m_e _2_2 _g_e_n_o_t_y_p_e_s _f_r_o_m _I_n_t_e_r_n_a_t_i_o_n_a_l _H_a_p_M_a_p _p_r_o_j_e_c_t

_D_e_s_c_r_i_p_t_i_o_n:

     The sample data file, hapmapchr22, contains genotypes of hromosome
     22 from the International HapMap project. This file contains
     genotypes from 30 CEPH trios (90 subjects) for all Phase 1 HapMap
     (about 1 SNP per 3kb on average) - see <URL: http://hapmap.org>
     for the original data file, full details of samples, markers and
     methods. Note that the HapMap data files are deidentified and
     freely distributable without restriction.

_U_s_a_g_e:

     data(hapmapchr22)

_F_o_r_m_a_t:

     The format is: chr "hapmapchr22"

_D_e_t_a_i_l_s:

     The file was created by converting chromosome 22 bulk data
     download data  files from the HapMap file repository at  <URL:
     http://hapmap.org/genotypes/2005-06_16c_phaseI/full/non-redundant/>

     Downloaded HapMap files were converted by transposing the layout
     of the data from one row per marker to the pedigree file
     convention of two columns per marker. There are many variants of
     the pedigree format but these files are compatible with the
     popular family based analysis software packages FBAT (<URL:
     http://www.biostat.harvard.edu/~fbat/fbat.htm>) and PBAT (<URL:
     http://www.biostat.harvard.edu/~clange/default.htm>)

     The file has a header row containing only the marker names
     followed by one row per subject.  Data rows always start with 6
     fields - 'family_id', 'individual_id', 'father_id', 'mother_id',
     gender and affection status. Parents have zero for 'mother_id' and
     'father_id'. The remaining columns in each row contain two allele
     codes for each marker. Alleles are coded as $0$ for missing,
     $1=A$, $2=C$, $3=G$ and $4=T$.  All fields in a row are delimited
     by one or more spaces. Note that affection status was arbitrarily
     set to 2 (affected) for children and 1 (unaffected) for adults -
     although in reality, HapMap CEPH subjects were not ascertained for
     any disease.

_S_o_u_r_c_e:

     <URL:
     http://hapmap.org/genotypes/2005-06_16c_phaseI/full/non-redundant/>

_R_e_f_e_r_e_n_c_e_s:

     <URL: http://www.biostat.harvard.edu/~fbat/fbat.htm> <URL:
     http://www.biostat.harvard.edu/~clange/default.htm>

_E_x_a_m_p_l_e_s:

     data(hapmapchr22)

