estimateCCMErrorRates        package:ppiStats        R Documentation

_E_s_t_i_m_a_t_e _f_a_l_s_e _p_o_s_i_t_i_v_e _a_n_d _f_a_l_s_e _n_e_g_a_t_i_v_e _e_r_r_o_r _p_r_o_b_a_b_i_l_i_t_i_e_s

_D_e_s_c_r_i_p_t_i_o_n:

     Estimate false positive and false negative error probabilities for
     complex comembership edges using a protein complex interactome
     gold standard

_U_s_a_g_e:

     estimateCCMErrorRates(m,GS,filterSystematic=TRUE,       
              obsPropThresh=1,SystematicpThresh=.01)

_A_r_g_u_m_e_n_t_s:

       m: The bait to prey data adjacency matrix. Baits index the rows
          and prey index the columns.

      GS: A gold standard protein complex interaction incidence matrix.
          Proteins index the rows and protein complexes index the
          columns. 

filterSystematic: A logical. If TRUE, all baits with with highly uneven
          directed degree will be filtered out of the data.

obsPropThresh: A numeric between 0 and 1. The proportion of tested
          proteins found within a protein complex needed to keep that
          protein complex within the gold standard set.

SystematicpThresh: A numeric between 0 and 1. The p-value threshold by
          which systematic errors are filtered.

_D_e_t_a_i_l_s:

     The model is described in the manuscript _Estimating node degree
     in bait-prey graphs._ by D. Scholtens et al.

_V_a_l_u_e:

     A list:

globalpTP: A numeric between 0 and 1.  Estimate of pTP.

globalpTPSE: A numeric.  Estimate of standard error of globalpTP
          estimate.

globalpFP: A numeric between 0 and 1.  Estimate of pFP.

 pTP95CI: A vector of length 2.  95 percent confidence interval upper
          and lower bounds for globalpTP estimate.

 pFP95CI: A vector of length 2.  95 percent confidence interval upper
          and lower bounds for globalpFP estimate.

nEligComplexes: A numeric.  Number of complexes from GS that met
          obsPropThresh criteria.

nEligBaits: A numeric. Total number of eligible baits in GS set.

nEligEdges: A numeric.  Total number of eligible edges in GS set.

nBaitsInComplexes: A vector.  Number of baits in each eligible complex.

complexSizes: A vector. Size of each complex in GS set.

_A_u_t_h_o_r(_s):

     T. Chiang and D. Scholtens

_R_e_f_e_r_e_n_c_e_s:

     Scholtens D, Chiang T, Huber W, Gentleman R.  Estimating node
     degree in bait-prey graphs. _Bioinformatics_.  To appear.

_E_x_a_m_p_l_e_s:

     data(Ho2002BPGraph)
     data(ScISIC)
     Ho2002mat = as(Ho2002BPGraph,"matrix")
     estimateCCMErrorRates(Ho2002mat,ScISIC)$globalpTP

