Genotyping arrays enable the direct measurement of an individuals
genotype at thousands of markers. 'plinkQC' facilitates genotype quality
control for genetic association studies as described by Anderson and
colleagues (2010)
plinkQC is a R/CRAN package for genotype quality control in genetic association studies. It makes PLINK basic statistics (e.g.missing genotyping rates per individual, allele frequencies per genetic marker) and relationship functions easily accessible from within R and allows for automatic evaluation of the results.
Full documentation is available at http://HannahVMeyer.github.io/plinkQC/.
plinkQC generates a per-individual and per-marker quality control report. A step-by-step guide on how to run these analyses can be found here.
Individuals and markers that fail the quality control can subsequently be removed with plinkQC to generate a new, clean dataset.
plinkQC facilitates an ancestry check for study individuals based on comparison to reference datasets. The processing of the reference datasets is documented in detail here.
Removal of individuals based on relationship status via plinkQC is optimised to retain as many individuals as possible in the study.
The current github version of plinkQC is: 0.2.1 and can be installed via
library(devtools)install_github("HannahVMeyer/plinkQC")
The current CRAN version of plinkQC is: 0.2.0 and can be installed via
install.packages("plinkQC")
A log of version changes can be found here.
NEWS.md
file to track changes to the package.