Imputed Stats - imputed-stats
¶
Performs statistical analysis on imputed data (either SKAT analysis, or linear, logistic or survival regression).
Available statistical models¶
Name |
Description |
---|---|
|
Cox’s proportional hazard model (survival regression). |
|
Linear regression (ordinary least squares). |
|
Logistic regression (GLM with binomial distribution). |
|
Linear mixed effect model (random intercept). |
|
SKAT analysis. |
Common options¶
General options¶
Option |
Description |
---|---|
|
Show this help message and exit. |
|
Show program’s version number and exit. |
|
Set the logging level to debug. |
Input files¶
Option |
Description |
---|---|
|
The output from IMPUTE2. |
|
The sample file (the order should be the same as in the IMPUTE2 files). |
|
The file containing phenotypes and co variables. |
|
A list of sites to extract for analysis (optional). |
Output options¶
Option |
Description |
---|---|
|
The prefix for the output files. [ |
General options¶
Option |
Description |
---|---|
|
The number of process to use. [ |
|
The number of line to read at a time.
[ |
|
The analysis is performed for the non
pseudo-autosomal region of the chromosome X
(male dosage will be divided by 2 to get
values [0, 0.5] instead of [0, 1]) (males are
coded as 1 and option ‘ |
|
The name of the gender column (use to exclude
samples with unknown gender (i.e. not 1,
male, or 2, female). If gender not available,
use ‘None’. [ |
Dosage options¶
Option |
Description |
---|---|
|
Scale dosage so that values are in [0, n] (possible
values are 1 (no scaling) or 2). [ |
|
The minimal probability for which a genotype should
be considered. [ |
|
Minor allele frequency threshold for which marker
will be skipped. [ |
Phenotype options¶
Option |
Description |
---|---|
|
The co variable names (in the phenotype file), separated by coma. |
|
The name of the variables that are categorical (note that the gender is always categorical). The variables are separated by coma. |
|
The missing value in the phenotype file. |
|
The name of the sample ID column (in the
phenotype file). [ |
|
Add an interaction between the genotype and this variable. |
Cox’s proportional hazard model options¶
Option |
Description |
---|---|
|
The time to event variable (in the pheno file). |
|
The event variable (1 if observed, 0 if not observed). |
Linear regression options¶
Option |
Description |
---|---|
|
The phenotype. |
Logistic regression options¶
Option |
Description |
---|---|
|
The phenotype. |
Linear mixed effects options¶
Option |
Description |
---|---|
|
The phenotype. |
|
Fit the standard likelihood using maximum likelihood (ML) estimation instead of REML (default is REML). |
|
The p-value threshold for which the real
MixedLM analysis will be performed.
[ |
SKAT options¶
Option |
Description |
---|---|
|
A file indicating a snp_set and an optional weight for every variant. |
|
The variable type for the outcome. This will be passed to SKAT. |
|
By default, the regular SKAT is used. Setting this flag will use the SKAT-O algorithm instead. |
|
The phenotype. |