estimateGLMTagwiseDisp {edgeR}R Documentation

Estimate Empirical Bayes Tagwise Dispersions for Negative Binomial GLMs

Description

Estimates the dispersion parameter for a DGE dataset for general experimental designs by using Cox-Reid approximate conditional inference for a negative binomial generalized linear model for each transcript (tag) with the unadjusted counts and design matrix provided.

Usage

## S3 method for class 'DGEList'
estimateGLMTagwiseDisp(y, design, offset=NULL, trend=!is.null(y$trended.dispersion), ...)
## Default S3 method:
estimateGLMTagwiseDisp(y, design, offset=NULL, dispersion, trend=TRUE, ...)

Arguments

y

an object that contains the raw counts for each library (the measure of expression level); it can either be a matrix of counts, or a DGEList object with (at least) elements counts (table of unadjusted counts) and samples (data frame containing information about experimental group, library size and normalization factor for the library size)

design

numeric matrix giving the design matrix for the GLM that is to be fit.

trend

logical, should an abundance trend be applied to the grid of dispersion values over which the tagwise dispersion estimation is done? Generally this should be TRUE if a trended dispersion has been estimated and FALSE otherwise.

offset

numeric scalar, vector or matrix giving the offset (in addition to the log of the effective library size) that is to be included in the NB GLM for the transcripts. If a scalar, then this value will be used as an offset for all transcripts and libraries. If a vector, it should be have length equal to the number of libraries, and the same vector of offsets will be used for each transcript. If a matrix, then each library for each transcript can have a unique offset, if desired. Default is NULL; if object is a DGEList and offset is NULL then offset will be calculated automatically from codey$samples.

dispersion

vector or scalar giving the dispersion value(s) to be used to set the grip of points for computation of the tagwise dispersion in dispCoxReidInterpolateTagwise.

...

other arguments are passed to lower-level functions. See dispCoxReidInterpolateTagwise for details.

Details

This generic function is essentially a wrapper for dispCoxReidInterpolateTagwise. To obtain estimates of the tagwise dispersion parameters for negative binomial GLMs we use Cox-Reid approximate conditional inference as implemented in dispCoxReidInterpolateTagwise. The approach is to maximize the adjusted profile likelihood over the dispersion value, for the tagwise models and use these values as the tagwise dispersion parameters for differential signal testing in downstream analysis.

Value

estimateGLMTagwiseDisp.DGEList produces a DGEList object, which contains the tagwise dispersion parameter estimate for each tag for the negative binomial model that maximizes the Cox-Reid adjusted profile likelihood. The tagwise dispersions are simply added to the DGEList object provided as the argument to the function.

estimateGLMTagwiseDisp.default returns a vector of the tagwise dispersion estimates.

Author(s)

Gordon Smyth, Davis McCarthy

References

Cox, DR, and Reid, N (1987). Parameter orthogonality and approximate conditional inference. Journal of the Royal Statistical Society Series B 49, 1-39.

See Also

estimateGLMCommonDisp for common dispersion and estimateGLMTrendedDisp for trended dispersion in the context of a generalized linear model.

estimateCommonDisp for common dispersion or estimateTagwiseDisp for tagwise dispersion in the context of a multiple group experiment (one-way layout).

Examples

y <- matrix(rnbinom(1000,mu=10,size=10),ncol=4)
d <- DGEList(counts=y,group=c(1,1,2,2),lib.size=c(1000:1003))
design <- model.matrix(~group, data=d$samples) # Define the design matrix for the full model
d <- estimateGLMTrendedDisp(d, design, min.n=10)
d <- estimateGLMTagwiseDisp(d, design)
summary(d$tagwise.dispersion)

[Package edgeR version 2.4.3 Index]