Software by Faculty

CaTS

  • Power Calculator for Two Stage Association Studies.
  • Faculty: Goncalo AbecasisDownload: Website.
  • Reference: Skol, A.D., Scott, L.J., Abecasis, G.R. and Boehnke, M., 2006. Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies. Nature genetics, 38(2), p.209.

FUGUE

  • Construct haplotypes for the chromosome 22 and 19 linkage disequilibrium maps.
  • Faculty: Goncalo AbecasisDownload: Website.

GAS

  • Genetic Association Study (GAS) Power Calculator interface that can be used to compute statistical power for large one-stage genetic association studies.
  • Faculty: Goncalo AbecasisDownload: Website.
  • Reference: Johnson, J.L. and Abecasis, G.R., 2017. GAS Power Calculator: web-based power calculator for genetic association studies. bioRxiv, p.164343.

GOLD

  • Graphical Overview of Linkage Disequilibrium.
  • Faculty: Goncalo AbecasisDownload: Website.
  • Reference: Abecasis, G.R. and Cookson, W.O.C., 2000. GOLD—graphical overview of linkage disequilibrium. Bioinformatics, 16(2), pp.182-183.

GRR

  • GRR is a Windows-based application for detecting pedigree errors via graphically inspecting the distribution for marker allele sharing among pairs of family members or all pairs of individuals in a study.
  • Faculty: Goncalo AbecasisDownload: Website.

LAMP

  • LAMP is our software for Linkage and Association Modeling in Pedigrees.
  • Faculty: Goncalo AbecasisDownload: Website.
  • Reference: Li, M., Boehnke, M. and Abecasis, G.R., 2005. Joint modeling of linkage and association: identifying SNPs responsible for a linkage signal. The American Journal of Human Genetics, 76(6), pp.934-949.

MACH 1.0

  • MACH 1.0 is a Markov Chain based haplotyper that can resolve long haplotypes or infer missing genotypes in samples of unrelated individuals.
  • Faculty: Goncalo Abecasis. Download: Website.

Merlin

  • Fast pedigree analyses, including non-parametric linkage, error detection and haplotyping.
  • Faculty: Goncalo AbecasisDownload: Website.
  • Reference: Abecasis, G.R., Cherny, S.S., Cookson, W.O. and Cardon, L.R., 2001. Merlin—rapid analysis of dense genetic maps using sparse gene flow trees. Nature genetics, 30(1), p.97.

Metal

  • METAL software is designed to facilitate meta-analysis of large datasets (such as several whole genome scans) in a convenient, rapid and memory efficient manner.
  • Faculty: Goncalo AbecasisDownload: Website.
  • Reference: Willer, C.J., Li, Y. and Abecasis, G.R., 2010. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics, 26(17), pp.2190-2191.

PEDSTATS

  • PEDSTATS is a handy tool for quick validation and summary of any pair of pedigree (.ped) and data (.dat) files.
  • Faculty: Goncalo AbecasisDownload: Website.

PSEUDO


QTDT

  • Linkage Disequilibrium Analyses for Quantitative and Discrete Traits.
  • Faculty: Goncalo AbecasisDownload: Website.
  • Reference: Abecasis, G.R., Cardon, L.R. and Cookson, W.O.C., 2000. A general test of association for quantitative traits in nuclear families. The American Journal of Human Genetics, 66(1), pp.279-292.

SNP-HWE

  • Fast exact Hardy-Weinberg Equilibrium test for SNPs as described in Wigginton, et al. (2005).
  • Faculty: Goncalo AbecasisDownload: Website.
  • Reference: Wigginton, J.E., Cutler, D.J. and Abecasis, G.R., 2005. A note on exact tests of Hardy-Weinberg equilibrium. The American Journal of Human Genetics, 76(5), pp.887-893.

BEHAVIOUR


iBAG


MMCR


PRECISE

  • Proteomic based integrated subject-specific networks in cancer.
  • Faculty: Veera Baladandayuthapani. Download: Github, Website.
  • Reference: Ha, M.J., Banerjee, S., Akbani, R., Liang, H., Mills, G.B., Do, K.A. and Baladandayuthapani, V., 2018. Personalized Integrated Network Modeling of the Cancer Proteome Atlas. Scientific reports, 8(1), p.14924. <doi:10.1038/s41598-018-32682-x>.

 

 CaTS

  • Power Calculator for Two Stage Association Studies.
  • Faculty: Michael BoehnkeGoncalo AbecasisDownload: Website.
  • Reference: Skol, A.D., Scott, L.J., Abecasis, G.R. and Boehnke, M., 2006. Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies. Nature genetics, 38(2), p.209.

FTEC

  • Coalescent simulation program capable of modeling samples drawn from a population which has undergone faster than exponential growth.
  • Faculty: Michael BoehnkeDownload: Website.

RELPAIR

  • RELPAIR 2.0.1 is a FORTRAN 77 program that infers the relationships of pairs of individuals based on genetic marker data, either within families or across an entire sample.
  • Faculty: Michael BoehnkeDownload: Website.
  • Reference: Epstein MP, Duren WL and Boehnke M (2000) Improved inference of relationships for pairs of individuals. American Journal of Human Genetics 67:1219-1231.

RHMAP

  • RHMAP 3.0 (updated September 1996) is a statistical package for radiation hybrid mapping.
  • Faculty: Michael BoehnkeDownload: Website.
  • Reference: Boehnke M, Lunetta K, Hauser E, Lange K, Uro J, and VanderStoep J. RHMAP: Statistical Package for Multipoint Radiation Version 3.0, September 1996.

SIBMED

  • SIBMED 1.0 is a FORTRAN 77 program that identifies likely genotyping errors and mutations for a sib pair in the context of multipoint mapping.
  • Faculty: Michael Boehnke. Download: Website.
  • Reference: Douglas J.A. and Boehnke M. SIBMED: A Program that Identifies Likely Genotyping Errors and Mutations for a Sib Pair in the Context of Multipoint Mapping Version 1.0, April 18, 2000.

SIMLINK

  • SIMLINK 4.12 (updated April 1997) is a program for estimating the power of a proposed linkage study by computer simulation.
  • Faculty: Michael Boehnke. Download: Website.

verifyBamID

  • Verifies whether the reads in particular file match previously known genotypes for an individual (or group of individuals), and checks whether the reads are contaminated as a mixture of two samples.
  • Faculty: Michael Boehnke, Hyun Min Kang. Download: GithubWebsite.
  • Reference: G. Jun, M. Flickinger, K. N. Hetrick, Kurt, J. M. Romm, K. F. Doheny, G. Abecasis, M. Boehnke,and H. M. Kang, Detecting and Estimating Contamination of Human DNA Samples in Sequencing and Array-Based Genotype Data, American journal of human genetics doi:10.1016/j.ajhg.2012.09.004 (volume 91 issue 5 pp.839 - 848).

WINNER

  • WINNER 1.1 (updated Feb 2009) is a program for correcting the winner's curse effect in genetic associations studies.
  • Faculty: Michael BoehnkeDownload: Website.
  • Reference: Rui Xiao and Michael Boehnke 2009. Quantifying and Correcting in Genetic Association Studies. Genetic Epidemiology 33:453-462.

adaptBayes

  • This package contains R functions implementing the adaptive priors described in Boonstra and Barbaro (2018).
  • Language(s): R
  • Faculty: Philip S. Boonstra. Download: Github.
  • Reference: Boonstra, Philip S. and Barbaro, Ryan P., "Incorporating Historical Models with Adaptive Bayesian Updates" (2018) Biostatistics https://doi.org/10.1093/biostatistics/kxy053

RankModeling

  • Penalized multistage models for ordered data.
  • Language(s): R
  • Faculty: Philip S. Boonstra. Download: Github.
  • Reference: Boonstra, Philip S. and Krauss, John C., "Inferring a consensus problem list using penalized multistage models for ordered data" (October 2019) The University of Michigan Department of Biostatistics Working Paper Series. Working Paper 126.

 

 

IVEware

  • Imputations of missing values using the Sequential Regression (also known as Chained Equations) Method. Multiple imputation analyses for both descriptive and model-based analysis. Analysis that accounts for complex design features, weighting, clustering and stratification.
  • Faculty: Trivellore Raghunathan, Roderick Little, Michael Elliott. Download: Website.

lcra

  • A user-friendly interface for doing joint Bayesian latent class and regression analysis with binary and continuous outcomes.
  • Language(s): R
  • Faculty: Michael Elliott. Download: Github.
  • Reference: “Methods to account for uncertainty in latent class assignments when using latent classes as predictors in regression models, with application to acculturation strategy measures” (2020) In press at Epidemiology. doi:10.1097/EDE.0000000000001139

 PRSweb


 MultiRobust

  • Multiply robust estimation for population mean, regression analysis, and quantile regression.
  • Faculty: Peisong HanDownload: CRAN.
  • Reference: Multiply robust estimation for population mean (Han and Wang 2013) <doi:10.1093/biomet/ass087>, regression analysis (Han 2014) <doi:10.1080/01621459.2014.880058> (Han 2016) <doi:10.1111/sjos.12177> and quantile regression (Han et al. 2019) <doi:10.1111/rssb.12309>.

adalasso

  • A computationally efficient adaptive penalization algorithm for high-dimensional survival analysis.
  • Faculty: Kevin He. Download: Github.

FEprovideR

  • A structured profile likelihood algorithm for the logistic fixed effects model and an approximate expectation maximization (EM) algorithm for the logistic mixed effects model.
  • Faculty: Kevin He, Jack D. Kalbfleisch, Yi Li. Download: Github, CRAN.

screening

  • Covariance-insured screening.
  • Faculty: Kevin He, Yanming Li, Yi LiDownload: Website.
  • Reference: He, K., Kang, J., Hong, H.G., Zhu, J., Li, Y., Lin, H., Xu, H. and Li, Y., 2018. Covariance-insured screening. arXiv preprint arXiv:1805.06595.

SurvBoost

  • A new gradient boosting method for high-dimensional variable selection with censored outcomes using the stratified proportional hazards (PH) model.
  • Faculty: Kevin He, Yanming Li, Yi Li, Jian Kang. Download: Github, CRAN.

daarem

  • Implements the DAAREM method for accelerating the convergence of slow, monotone sequences from smooth, fixed-point iterations such as the EM algorithm.
  • Faculty: Nicholas Henderson. Download: Github, CRAN.

CisGenome Browser

  • A flexible stand-alone tool for genomic data visualization.
  • Faculty: Hui JiangDownload: Website.
  • Reference: Jiang, H., Wang, F., Dyer, N.P., Wong, W.H. (2010) CisGenome Browser: A Flexible Tool For Genomic Data Visualization, Bioinformatics, 26 (14).

CisGenome

  • An integrated tool for tiling array, genome and cis-regulatory element analysis, working together with CisGenome Browser. 
  • Faculty: Hui JiangDownload: Website.
  • Reference: Hongkai Ji, Hui Jiang, Wenxiu Ma, David S. Johnson, Richard M. Myers and Wing H. Wong (2008) An integrated software system for analyzing ChIP-chip and ChIP-seq data. Nature Biotechnology, 26: 1293-1300. doi:10.1038/nbt.1505.

fast-opt

  • Package for fast computation of the Optional Polya Tree (OPT).
  • Faculty: Hui JiangDownload: Website.

Glmnet for MATLAB

  • A matlab wrapper for glmnet, a solver for fitting Lasso (L1) and elastic-net regularized generalized linear models.
  • Faculty: Hui Jiang. Download: Website.

mseq

  • An R package for modeling non-uniformity in short-read rates in RNA-Seq data.
  • Faculty: Hui JiangDownload: CRAN Archive.

rSeqNP

  • A non-parametric approach for detecting differential expression and splicing from RNA-Seq data.
  • Faculty: Hui Jiang. Download: Website.
  • Reference: Shi, Y., Chinnaiyan, A. M., Jiang, H. (2015) rSeqNP: A non-parametric approach for detecting differential ex-pression and splicing from RNA-Seq data Bioinformatics, in press.

rSeqDiff

  • Detecting differential isoform expression from RNA-seq data.
  • Faculty: Hui JiangDownload: Website.
  • Reference: Shi, Y., Jiang, H. (2013). rSeqDiff: Detecting differential isoform expression from RNA-Seq data using hierarchical likelihood ratio test, PLoS One, 8 (11): e79448.

rSeq

  • rSeq is a set of tools for RNA-Seq data analysis. It consists of programs that deal with many aspects of RNA-Seq data analysis, such as read quality assessment, reference sequence generation, sequence mapping, gene and isoform expressions (RPKMs) estimation, etc.
  • Faculty: Hui JiangDownload: Website.
  • References: [1] Jiang, H., Wong, W.H. (2009) Statistical Inferences for Isoform Expression in RNA-Seq, Bioinformatics, 25(8), 1026–1032. [2] Salzman, J., Jiang, H., Wong, W. H. (2011) Statistical Modeling of RNA-Seq Data, Statistical Science, 26 (1): 62-83.

SeqAlto

  • Fast and accurate read alignment for resequencing.
  • Faculty: Hui JiangDownload: Website.
  • References: John C. Mu, Hui Jiang, Amirhossein Kiani, Marghoob Mohiyuddin, Narges Bani Asadi and Wing H. Wong, Fast and Accurate Read Alignment for Resequencing, Bioinformatics, 2012.

SeqMap

  • A tool for mapping millions of short sequences to the genome.
  • Faculty: Hui Jiang. Download: Website.
  • References: Jiang, H., Wong, W.H. (2008) SeqMap: Mapping Massive Amount of Oligonucleotides to the Genome, Bioinformatics, 24(20).

SpliceMap

  • SpliceMap is a de novo splice junction discovery and alignment tool. It offers high sensitivity and support for arbitrary RNA-seq read lengths.
  • Faculty: Hui JiangDownload: Website.
  • Reference: Kin Fai Au, Hui Jiang, Lan Lin, Yi Xing, and Wing Hung Wong. Detection of splice junctions from paired-end RNA-seq data by SpliceMap. Nucleic Acids Research, Advance access published on April 5, 2010.

 

FEprovideR

  • A structured profile likelihood algorithm for the logistic fixed effects model and an approximate expectation maximization (EM) algorithm for the logistic mixed effects model.
  • Faculty: Kevin He, Jack D. Kalbfleisch, Yi Li. Download: Github, CRAN.

CoxClusterProcess

  • Bayesian meta-analysis of functional neuroimaging data.
  • Faculty: Jian KangDownload: Website.

GeneNetwork

  • Gene sub-network analysis via Bayesian nonparametric methods.
  • Faculty: Jian KangDownload: Website.

GeoCopula

  • Unified modeling framework for analysis of spatial-clustered continuous and binary data.
  • Faculty: Jian KangPeter X.K. Song. Download: Website.
  • Reference: Bai, Y., Kang, J., & Song, P.X.K. (2014). Efficient pairwise composite likelihood estimation for spatial‐clustered data. Biometrics, 70(3), 661-670.

Poisson Graphical Model

  • Uses the EM algorithm to find the point estimates of the intensity parameters for the Poisson Graphical Model.
  • Faculty: Jian KangDownload: Website.
  • Reference: References: Xue, W., Kang, J., Bowman F.D., Wager, T.D., Guo, J. (2014) Identifying Functional Co-activation Patterns in Neuroimaging Studies via Poisson Graphical Models, Biometrics , In press.

ReverseInference

  • A Bayesian spatial point pattern classifier.
  • Faculty: Jian KangDownload: Website.

STGP

  • This package focus on spatial variable selection for scalar-on-image regression. It uses a new class of Bayesian nonparametric models, soft-thresholded Gaussian processes and the developed efficient posterior computation algorithms.
  • Faculty: Jian KangDownload: Website.
  • References: Kang, J., Reich, B.J. and Staicu, A.M., 2018. Scalar-on-image regression via the soft-thresholded Gaussian process. Biometrika, 105(1), pp.165-184.

SurvBoost

  • A new gradient boosting method for high-dimensional variable selection with censored outcomes using the stratified proportional hazards (PH) model.
  • Faculty: Kevin He, Yanming Li, Yi Li, Jian Kang. Download: Github, CRAN.

TGLG

  • This package implements a novel prior model for Bayesian network marker selection in the generalized linear model (GLM) framework.
  • Faculty: Jian KangDownload: Website.
  • References: Cai, Q., Kang, J. and Yu, T., 2018. Bayesian network marker selection via the thresholded graph Laplacian Gaussian prior. Bayesian Analysis.

apigenome

  • Libraries and command-line utilities for big data genomic analysis.
  • Faculty: Hyun Min KangDownload: Github.

cleancall


cramore

  • A collection of C++ tools to manipulate SAM/BAM/CRAM and BCF/VCF files in various contexts of sequence analysis.
  • Faculty: Hyun Min Kang. Download: Github.

demuxlet

  • Genetic multiplexing of barcoded single cell RNA-seq.
  • Faculty: Hyun Min Kang. Download: Github.
  • Reference: Kang, H.M., Subramaniam, M., Targ, S., Nguyen, M., Maliskova, L., McCarthy, E., Wan, E., Wong, S., Byrnes, L., Lanata, C.M. and Gate, R.E., 2018. Multiplexed droplet single-cell RNA-sequencing using natural genetic variation. Nature biotechnology, 36(1), p.89.

EPACTS

  • Efficient and Parallelizable Association Container Toolbox. Perform various statistical tests for identifying genome-wide association from sequence data through a user-friendly interface.
  • Faculty: Hyun Min Kang. Download: Github.

EMMA

  • Statistical test for model organisms association mapping correcting for the confounding from population structure and genetic relatedness.
  • Faculty: Hyun Min Kang. Download: Website.
  • Reference: Kang, H.M., Zaitlen, N.A., Wade, C.M., Kirby, A., Heckerman, D., Daly, M.J. and Eskin, E., 2008. Efficient control of population structure in model organism association mapping. Genetics, 178(3), pp.1709-1723.

EMMAX

  • Statistical test for large scale human or model organism association mapping accounting for the sample structure.
  • Faculty: Hyun Min Kang. Download: Wiki.
  • Reference: Kang HM, Sul JH, Service SK, Zaitlen NA, Kong SY, Freimer NB, Sabatti C, Eskin E. (2010) Variance component model to account for sample structure in genome-wide association studies. Nat. Genet. 42:348-54.

GotCloud

  • Genomes on the Cloud, Mapping & Variant Calling Pipelines.
  • Faculty: Hyun Min Kang. Download: Github, Wiki.
  • Reference: Jun, Goo, et al. "An efficient and scalable analysis framework for variant extraction and refinement from population scale DNA sequence data." Genome research (2015): gr-176552.

popscle (freemuxlet)

  • A suite of population scale analysis tools for single-cell genomics data including implementation of Demuxlet/Freemuxlet methods and auxilary tools.
  • Faculty: Hyun Min Kang. Download: Github.

RUTH


topmed_variant_calling

  • A collection of software tools used for producing TOPMed variant calls and genotypes with a comprehensive documentation that allows investigators to understand the methods and reproduce the variant calls from the same set of aligned sequence reads.
  • Faculty: Hyun Min Kang. Download: Github.

verifyBamID

  • Verifies whether the reads in particular file match previously known genotypes for an individual (or group of individuals), and checks whether the reads are contaminated as a mixture of two samples.
  • Faculty: Michael Boehnke, Hyun Min Kang. Download: GithubWebsite.
  • Reference: G. Jun, M. Flickinger, K. N. Hetrick, Kurt, J. M. Romm, K. F. Doheny, G. Abecasis, M. Boehnke,and H. M. Kang, Detecting and Estimating Contamination of Human DNA Samples in Sequencing and Array-Based Genotype Data, American journal of human genetics doi:10.1016/j.ajhg.2012.09.004 (volume 91 issue 5 pp.839 - 848).

verifyBamID2

  • A robust tool for DNA contamination estimation from sequence reads using ancestry-agnostic method.
  • Faculty: Hyun Min Kang. Download: Github.
  • Reference: Zhang F., Flickinger M., InPSYght Psychiatric Genetics Consortium, Abecasis G., Boehnke M., Kang H.M.(8 November 2018)."Ancestry-agnostic estimation of DNA sample contamination from sequence reads".bioRxiv 466268; doi: https://doi.org/10.1101/466268.

VT

  • A tool set for short variant discovery in genetic sequence data.
  • Faculty: Hyun Min Kang. Download: Github.
  • Reference: Adrian Tan, Gonçalo R. Abecasis and Hyun Min Kang. Unified Representation of Genetic Variants. Bioinformatics (2015) 31(13): 2202-2204.

SMART Sample Size Calculator

  • Sample size calculator applet for SMART studies.
  • Faculty: Kelley Kidwell. Download: Shiny Application.
  • Reference: Oetting, A., Levy, J., Weiss, R. and Murphy, S. (2007), "Statistical methodology for a SMART design in the development of adaptive treatment strategies ," in Causality and Psychopathology: Finding the Determinants of Disorders and their Cures (American Psychopathological Association), Arlington, VA: American Psychiatric Publishing, Inc., pp. 179-205.

snSMART 

  • Small n Sequential, Multiple Assignment, Randomized Trial (snSMART) calculation applet.
  • Faculty: Kelley KidwellDownload: Website.
  • Reference: Wei, B., Braun, T.M., Tamura, R.N. and Kidwell, K.M., 2018. A Bayesian analysis of small n sequential multiple assignment randomized trials (snSMARTs). Statistics in medicine, 37(26), pp.3723-3732.

LGEWIS

  • Functions for genome-wide association studies (GWAS)/gene-environment-wide interaction studies (GEWIS) with longitudinal outcomes and exposures.
  • Faculty: Seunggeun Shawn LeeBhramar Mukherjee, Min Zhang. Download: CRAN.
  • References: He et al. (2017) "Set-Based Tests for Gene-Environment Interaction in Longitudinal Studies" and He et al. (2017) "Rare-variant association tests in longitudinal studies, with an application to the Multi-Ethnic Study of Atherosclerosis (MESA)".

Lodi

  • Impute observed values below the limit of detection (LOD) via censored likelihood multiple imputation (CLMI) in single-pollutant models, developed by Boss et al (2019) <doi:10.1097/EDE.0000000000001052>.
  • Faculty: Seunggeun Shawn Lee, Bhramar Mukherjee, Min Zhang. Download: CRAN.
  • References: Boss, J., Mukherjee, B., Ferguson, K.K., Aker, A., Alshawabkeh, A.N., Cordero, J.F., Meeker, J.D. and Kim, S., 2019. Estimating outcome-exposure associations when exposure biomarker detection limits vary across batches. Epidemiology, 30(5), pp.746-755.

SAIGE

  • SAIGE is an R-package for testing for associations between genetic variants and binary phenotypes with adjusting for sample relatedness and case-control imbalance.
  • Faculty: Seunggeun Shawn Lee. Download: Website, Github.

SKAT

  • SKAT is an R-package for rare variant association analysis. It can carry out burden test, SKAT, SKAT-O, and combined test of common and rare variants with adjusting for covariates and kinship. For binary traits, it can calculate p-values using resampling and asymptotic based adjustment methods. It also has functions for sample size and power calculations.
  • Faculty: Seunggeun Shawn Lee. Download: Website, GithubCRAN.

MetaSKAT

  • MetaSKAT is an R package for gene-based meta-analysis across studies. It can carry out a meta-analysis of SKAT, SKAT-O and burden tests with individual-level genotype data or gene-level summary statistics.
  • Faculty: Seunggeun Shawn Lee. Download: Website, GithubCRAN.

iECAT

  • iECAT is an R-package to test for single variant and gene/region-based associations using external control samples.
  • Faculty: Seunggeun Shawn Lee. Download: Website, GithubCRAN.

SPAtest

  • SPAtest is an R-package to perform score test for associations between genetic variants and binary traits using saddlepoint approximation. The methods implemented in the package (FastSPA) can accurately calculate p-values even when the case-control ratio is extremely unbalanced. 
  • Faculty: Seunggeun Shawn Lee. Download: WebsiteCRAN.

JointScoreTest

  • JointScoreTest is an R-package to perform a joint test of fixed and random effects in the Generalized linear mixed model framework.
  • Faculty: Seunggeun Shawn Lee. Download: Website.

dSVA


TransMeta & TransMetaRare

  • TransMeta is an R-package to compute single SNP p-values of trans-ethnic meta-analysis using a kernel-based random effect model. This is an early version, and we will keep updating it. We have recently extended it to gene-based rare-variant test (Transmeta-rare). The packages can be downloaded from the following github.
  • Faculty: Seunggeun Shawn Lee. Download: Website, TransMetaRare Github.

EigenCorr

  • EigenCorr is an R-package to compute p-values of principal components (PCs) based on EigenCorr1, EigenCorr2 and Tracy-Widom methods. You need PCs, outcome phenotypes and all eigenvalues to run EigenCorr.
  • Faculty: Seunggeun Shawn Lee. Download: Website.

clikcorr

  • A profile likelihood based method of estimation and inference on the correlation coefficient of bivariate data with different types of censoring and missingness.
  • Faculty: Yanming Li.  Download: CRAN.

MSGLasso

  • Fit multivariate response and multiple predictor linear regression with an arbitrary group structure assigned on the regression coefficients matrix, using the multivariate sparse group lasso and the mixed coordinate descent algorithm.
  • Faculty: Yanming Li. Download: CRAN.

mLDA

  • The mLDA package implements the multi-class linear discriminant analysis method for classifications with ultrahigh-dimensional data. The method can select both marginally and jointly informative features that are informative for classifications.
  • Faculty: Yanming Li. Download: Github.
  • Reference: Li, Yanming and Hong, Hyokyoung and Li, Yi (2018) Multiclass Linear Discriminant Analysis with Ultrahigh-Dimensional Features. Under revision.

SurvBoost

  • A new gradient boosting method for high-dimensional variable selection with censored outcomes using the stratified proportional hazards (PH) model.
  • Faculty: Kevin He, Yanming Li, Yi Li, Jian Kang. Download: Github, CRAN.

FEprovideR

  • A structured profile likelihood algorithm for the logistic fixed effects model and an approximate expectation maximization (EM) algorithm for the logistic mixed effects model.
  • Faculty: Kevin He, Jack D. Kalbfleisch, Yi Li. Download: Github, CRAN.

plac

  • A semi-parametric estimation method for the Cox model with left-truncated data using augmented information from the marginal of truncation times.
  • Faculty: Yi Li. Download: CRAN.
  • Reference: Wu, F., Kim, S., Qin, J., Saran, R. and Li, Y., 2018. A pairwise likelihood augmented Cox estimator for left‐truncated data. Biometrics, 74(1), pp.100-108.

screening

  • Covariance-insured screening.
  • Faculty: Kevin He, Yanming Li, Yi LiDownload: Website.
  • Reference: He, K., Kang, J., Hong, H.G., Zhu, J., Li, Y., Lin, H., Xu, H. and Li, Y., 2018. Covariance-insured screening. arXiv preprint arXiv:1805.06595.

SPARES

  • Estimation and inference for high-dimensional linear models.
  • Faculty: Yi LiDownload: Github.
  • Reference: Fei, Z., Zhu, J., Banerjee, M. and Li, Y., 2018. Drawing inferences for high‐dimensional linear models: A selection‐assisted partial regression and smoothing approach. Biometrics.

SurvBoost

  • A new gradient boosting method for high-dimensional variable selection with censored outcomes using the stratified proportional hazards (PH) model.
  • Faculty: Kevin He, Yanming Li, Yi Li, Jian Kang. Download: Github, CRAN.

IVEware

  • Imputations of missing values using the Sequential Regression (also known as Chained Equations) Method. Multiple imputation analyses for both descriptive and model-based analysis. Analysis that accounts for complex design features, weighting, clustering and stratification.
  • Faculty: Trivellore Raghunathan, Roderick Little, Michael Elliott. Download: Website.

Bama

  • Mediation analysis in the presence of high-dimensional mediators based on the potential outcome framework. Bayesian Mediation Analysis (BAMA), developed by Song et al (2018) <doi:10.1101/467399>.
  • Faculty: Bhramar Mukherjee, Min Zhang, Xiang ZhouDownload: CRAN.
  • Song, Y., Zhou, X., Zhang, M., Zhao, W., Liu, Y., Kardia, S., Roux, A.D., Needham, B., Smith, J.A. and Mukherjee, B., 2018. Bayesian Shrinkage Estimation of High Dimensional Causal Mediation Effects in Omics Studies. bioRxiv, p.467399.

gigg

  • This package implements a Gibbs sampler corresponding to a Group Inverse-Gamma Gamma (GIGG) regression model with adjustment covariates. Hyperparameters in the GIGG prior specification can either be fixed by the user or can be estimated via Marginal Maximum Likelihood Estimation. <arXiv:2102.10670>.
  • Faculty: Bhramar Mukherjee. Download: CRAN.
  • Boss, Jonathan, et al. "Group Inverse-Gamma Gamma Shrinkage for Sparse Regression with Block-Correlated Predictors." arXiv preprint arXiv:2102.10670 (2021).

higlasso

  • Hierarchical integrative group least absolute shrinkage
    and selection operator (HiGLASSO), developed by Boss et al (2020)
    <arXiv:2003.12844>, is a general framework to identify noteworthy nonlinear main and interaction effects in the presence of group structures among a set of exposures.

  • Faculty: Bhramar Mukherjee. Download: CRAN.
  • Boss, J., Rix, A., Chen, Y.H., Narisetty, N.N., Wu, Z., Ferguson, K.K., McElrath, T.F., Meeker, J.D. and Mukherjee, B., 2020. A hierarchical integrative group lasso (higlasso) framework for analyzing environmental mixtures. arXiv preprint arXiv:2003.12844.

LGEWIS

  • Functions for genome-wide association studies (GWAS)/gene-environment-wide interaction studies (GEWIS) with longitudinal outcomes and exposures.
  • Faculty: Seunggeun Shawn Lee, Bhramar Mukherjee, Min Zhang. Download: CRAN.
  • References: He et al. (2017) "Set-Based Tests for Gene-Environment Interaction in Longitudinal Studies" and He et al. (2017) "Rare-variant association tests in longitudinal studies, with an application to the Multi-Ethnic Study of Atherosclerosis (MESA)". 

Lodi

  • Impute observed values below the limit of detection (LOD) via censored likelihood multiple imputation (CLMI) in single-pollutant models, developed by Boss et al (2019) <doi:10.1097/EDE.0000000000001052>.
  • Faculty: Seunggeun Shawn Lee, Bhramar Mukherjee, Min Zhang. Download: CRAN.
  • References: Boss, J., Mukherjee, B., Ferguson, K.K., Aker, A., Alshawabkeh, A.N., Cordero, J.F., Meeker, J.D. and Kim, S., 2019. Estimating outcome-exposure associations when exposure biomarker detection limits vary across batches. Epidemiology, 30(5), pp.746-755.

MetaIntegration

  • An ensemble meta-inference framework to integrate multiple regression models into a current study. Gu, T., Taylor, J.M.G. and Mukherjee, B. (2021) <arXiv:2010.09971>.
  • Faculty: Bhramar Mukherjee. Download: CRAN.
  • References: Du, Jiacong, et al. "Variable selection with multiply-imputed datasets: choosing between stacked and grouped methods." arXiv preprint arXiv:2003.07398 (2020).

miselect

  • Variable selection with multiply-imputed datasets: choosing between stacked and grouped methods. Presents Stacked Adaptive Elastic Net (saenet) and Grouped Adaptive LASSO (galasso) for continuous and binary outcomes.
  • Faculty: Bhramar Mukherjee. Download: CRAN.
  • References: Gu, T., Taylor, J. M., & Mukherjee, B. (2020). A meta-inference framework to integrate multiple external models into a current study. arXiv preprint arXiv:2010.09971.
    Chicago

PRSweb


SAMBA

  • Misclassification of EHR (Electronic Health Record)-derived disease status and lack of representativeness of the study sample can result in substantial bias in effect estimates and can impact power and type I error for association tests. 'SAMBA' implements several methods for obtaining bias-corrected point estimates along with valid standard errors as proposed in Beesley and Mukherjee (2020) <doi:10.1101/2019.12.26.19015859>, currently under review.
  • Faculty: Bhramar Mukherjee. Download: CRAN, Github.
  • References: Beesley, L.J. and Mukherjee, B., 2019. Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification. medRxiv.

SEIRfansy

  • Extended Susceptible-Exposed-Infected-Recovery Model for handling high false negative rate and symptom based administration of diagnostic tests. <doi:10.1101/2020.09.24.20200238>.

  • Faculty: Bhramar Mukherjee. Download: CRAN, Github.
  • References: Bhaduri, R., Kundu, R., Purkayastha, S., Kleinsasser, M., Beesley, L. J., & Mukherjee, B. (2020). Extending the susceptible-exposed-infected-removed (SEIR) model to handle the high false negative rate and symptom-based administration of Covid-19 diagnostic tests: SEIR-fansy. Medrxiv.

subgxe

  • R package that implements p-value assisted subset testing for association (pASTA), a method developed by Yu et al. (2019) <doi:10.1159/000496867>.
  • Faculty: Bhramar Mukherjee, Xiang Zhou, Seunggeun Shawn Lee. Download: CRAN.
  • References: Yu, Y., Xia, L., Lee, S., Zhou, X., Stringham, H.M., Boehnke, M. and Mukherjee, B., 2018. Subset-Based Analysis using Gene-Environment Interactions for Discovery of Genetic Associations across Multiple Studies or Phenotypes. Human heredity, 83(6), pp.283-314.

corrsurv

  • Collection of two-sample tests for treatment effects with paired censored survival data and recurrent events survival data.
  • Faculty: Susan Murray. Download: Github.
  • References: 1. Murray, Susan. Nonparametric Rank-Based Methods for Group Sequential Monitoring of Paired Censored Survival Data. 2000. Biometrics, 56, pp. 984-990. 2. Tayob, N. and Murray, S., 2014. Nonparametric tests of treatment effect based on combined endpoints for mortality and recurrent events. Biostatistics, 16(1), pp.73-83.

IVEware

  • Imputations of missing values using the Sequential Regression (also known as Chained Equations) Method. Multiple imputation analyses for both descriptive and model-based analysis. Analysis that accounts for complex design features, weighting, clustering and stratification.
  • Faculty: Trivellore Raghunathan, Roderick Little, Michael Elliott. Download: Website.

metaboplot

  • Shiny interface for exploring metabolite plots based on attributes.
  • Faculty: Laura Scott. Download: Github.

accelerometer


BivPPL

  • Bivariate frailty models for clustered events via penalized partial likelihood methods.
  • Language(s): R
  • Faculty: Peter X.K. Song. Author: Lili Wang.
  • Download: Github.

coxphGPLE

  • Fit cox model with multiple functional covariate-environment interactions, where covariate effects can be modified nonlinearly by mixtures of exposed toxicants.
  • Faculty: Peter X.K. Song. Download: Website.

eSIR

  • Extended state-space SIR epidemiological models.
  • Language(s): R
  • Faculty: Peter X.K. Song. Author: Lili Wang.
  • Download: Github.

FLAPO

  • Fused lasso with the adaptation of parameter ordering in combining multiple studies with repeated measures.
  • Faculty: Peter X.K. Song. Download: Website.
  • Reference: Wang, F., Wang, L., & Song, P.X.K. (2016). Fused lasso with the adaptation of parameter ordering in combining multiple studies with repeated measurements. Biometrics, DOI: 10.1111/biom.12496.

GDEP

  • Gene network construction based on time course microarray data.
  • Faculty: Peter X.K. Song. Download: Website.
  • Reference: Gao, X., Pu, DQ., & Song, P.X.K. (2009). Transition dependency: a gene-gene interactionmeasure for times seriesmicroarray data. EURASIP Journal on Bioinformatics and Systems Biology, 2009, 2.

GeoCopula

  • Unified modeling framework for analysis of spatial-clustered continuous and binary data.
  • Faculty: Jian KangPeter X.K. Song. Download: Website.
  • Reference: Bai, Y., Kang, J., & Song, P.X.K. (2014). Efficient pairwise composite likelihood estimation for spatial‐clustered data. Biometrics, 70(3), 661-670.

GSMC

  • A simulation-free group sequential design with max-combo tests in the presence of non-proportional hazards.
  • Language(s): R
  • Faculty: Peter X.K. Song. Author: Lili Wang.
  • Download: Github.

IAfrac

  • Calculate sample sizes and information fractions (IF) for Fleming-Harrington class weighted log-rank tests (FH-WLRT) in interim analysis (IA).
  • Language(s): R
  • Faculty: Peter X.K. Song. Author: Lili Wang.
  • Download: Github.

metaFuse

  • Fused lasso approach in regression coefficient clustering.
  • Faculty: Peter X.K. Song. Download: Website.
  • Reference: Tang, L., & Song, P.X.K. (2016). Fused Lasso Approach in Regression Coefficients Clustering -- Learning Parameter Heterogeneity in Data Integration. Journal of Machine Learning Research, 17(113):1−23.

MODAC

  • Method of divide-and-combine in regularized generalized linear models for big data.
  • Faculty: Peter X.K. Song. Download: Website.
  • Reference: Tang, L., Zhou, L., and Song, P.X.K. (2016). Method of Divide-and-Combine in Regularised Generalised Linear Models for Big Data. arXiv preprint arXiv:1611.06208.

NGM

  • Bayesian semi-parametric stochastic velocity model with Ornstein-Uhlenbeck process.
  • Faculty: Peter X.K. Song. Download: Website.

HDDesign

  • Determine the sample size for high dimensional classification studies.
  • Faculty: Peter X.K. Song. Download: Website.
  • Reference: Sanchez, B.N., Wu, M., Song, P.X.K., and Wang W. (2016). Study design in high-dimensional classification analysis. Biostatistics, doi: 10.1093/biostatistics/kxw018.

qif

  • Estimation of regression coefficients in longitudinal marginal models using quadratic inference functions.
  • Faculty: Peter X.K. Song. Download: Github, Website, CRAN.
  • Reference: Bai, Y., Kang, J., & Song, P.X.K. (2014). Reference: Qu A, Lindsay BG, Li B. Improving generalized estimating equations using quadratic inference functions. Biometrika 2000, 87 823-836.

RCD


Tensor

  • CP and non-negative tensor decomposition/factorizations. Also, a Shiny application to study convergence and clustering properties of decomposition methods.
  • Faculty: Peter X.K. Song. Download: Github, Website.

 

 

nltm

  • Non-linear transformation models (nltm) for analyzing survival data.
  • Faculty: Alexander Tsodikov. Download: Github.
  • Reference: Tsodikov, A., 2003. Semiparametric models: a generalized self‐consistency approach. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 65(3), pp.759-774.

 

DAP

  • Integrative genetic association analysis using deterministic approximation of posteriors.
  • Faculty: Xiaoquan William Wen. Download: Github.
  • Reference: Wen, X., Lee, Y., Luca, F., Pique-Regi, R. Efficient Integrative Multi-SNP Association Analysis using Deterministic Approximation of Posteriors. The American Journal of Human Genetics, 98(6), 1114-1129.

fmeqtl


integrative

  • Enrichment estimation aided colocalization analysis.
  • Faculty: Xiaoquan William Wen. Download: Github.
  • Reference: Wen, X., Pique-Regi, R., Luca, F. Integrating Molecular QTL Data into Genome-wide Genetic Association Analysis: Probabilistic Assessment of Enrichment and Colocalization. PLOS Genetics. 2017 Mar 13(3): e1006646.

IRLS

  • Implementation of iteratively re-weighted least squares algorithm (IRLS) algorithm for generalized linear model in C++.
  • Faculty: Xiaoquan William Wen. Download: Github.

TORUS

  • QTL discovery utilizing genomic annotations. Computational procedure for discovering molecular QLTs incorporating genomic annotations.
  • Faculty: Xiaoquan William Wen. Download: Github.
  • Reference: Wen, X. Effective QTL Discovery Incorporating Genomic Annotations. bioRxiv doi:10.1101/032003.

sbams

  • Bayesian model selection in complex linear systems.
  • Faculty: Xiaoquan William Wen. Download: Github.
  • Reference: Wen, X. "Bayesian Model Selection in Complex Linear Systems, as Illustrated in Genetic Association Studies", submit to Biometrics.

baker

  • Bayesian Analytic Kit for Etiology Research.
  • Faculty: Zhenke Wu. Download: Github, Website.
  • Reference: Wu, Z., Deloria-Knoll, M. and Zeger, S.L., 2016. Nested partially latent class models for dependent binary data; estimating disease etiology. Biostatistics, 18(2), pp.200-213. <doi:10.1093/biostatistics/kxw037>.

mpcr

  • Package for estimating treatment effects in matched-pair cluster randomized trials (MPCR) using covariate calibration.
  • Faculty: Zhenke Wu. Download: Github, Website.
  • Reference: Wu, Z., Frangakis, C.E., Louis, T.A. and Scharfstein, D.O., 2014. Estimation of treatment effects in matched‐pair cluster randomized trials by calibrating covariate imbalance between clusters. Biometrics, 70(4), pp.1014-1022. <doi:10.1111/biom.12214>.

rewind

  • Package for fitting Bayesian restricted latent class models.
  • Faculty: Zhenke Wu. Download: Github.
  • Reference: Wu, Z., Casciola-Rosen, L., Rosen, A. and Zeger, S.L., 2018. A Bayesian approach to restricted latent class models for scientifically-structured clustering of multivariate binary outcomes. arXiv preprint arXiv:1808.08326. <doi:10.1101/400192>.

spotgear

  • Package for fitting Bayesian two-dimensional image dewarping models and estimating disease subsets and signatures.
  • Faculty: Zhenke Wu. Download: Github, Website.
  • Reference: Wu, Z., Casciola-Rosen, L., Shah, A.A., Rosen, A. and Zeger, S.L., 2017. Estimating autoantibody signatures to detect autoimmune disease patient subsets. Biostatistics, 20(1), pp.30-47. <doi:10.1093/biostatistics/kxx061>.

 

CopyMap

  • CopyMap is based on a hidden Markov Model (HMM), predicting the location of CNVs and their allele frequencies using data from a set of CGH experiments.
  • Faculty: Sebastian Zöllner. Download: Website.

FTEC

  • A coalescent simulator capable of modeling faster than exponential population growth.
  • Faculty: Sebastian Zöllner. Download: Github.
  • References: Reppell, M., Boehnke, M. and Zöllner, S., 2012. FTEC: a coalescent simulator for modeling faster than exponential growth. Bioinformatics, 28(9), pp.1282-1283.

TRAFIC

  • TRAFIC (Test for Rare-variant Association using Family-based Internal Controls) tests for rare variant associations in affected sibpairs by comparing the allele count of rare variants on chromosome regions shared identical by descent (IBD) to the allele count of rare variants on non-shared chromosome regions.
  • Faculty: Sebastian Zöllner. Download: Github.
  • References: Lin, K.H. and Zöllner, S., 2015. Robust and powerful affected sibpair test for rare variant association. Genetic epidemiology, 39(5), pp.325-333.

 

Bama

  • Mediation analysis in the presence of high-dimensional mediators based on the potential outcome framework. Bayesian Mediation Analysis (BAMA), developed by Song et al (2018) <doi:10.1101/467399>.
  • Faculty: Bhramar Mukherjee, Min Zhang, Xiang ZhouDownload: CRAN.
  • Reference: Song, Y., Zhou, X., Zhang, M., Zhao, W., Liu, Y., Kardia, S., Roux, A.D., Needham, B., Smith, J.A. and Mukherjee, B., 2018. Bayesian Shrinkage Estimation of High Dimensional Causal Mediation Effects in Omics Studies. bioRxiv, p.467399.

LGEWIS

  • Functions for genome-wide association studies (GWAS)/gene-environment-wide interaction studies (GEWIS) with longitudinal outcomes and exposures.
  • Faculty: Seunggeun Shawn Lee, Bhramar Mukherjee, Min Zhang. Download: CRAN.
  • References: He et al. (2017) "Set-Based Tests for Gene-Environment Interaction in Longitudinal Studies" and He et al. (2017) "Rare-variant association tests in longitudinal studies, with an application to the Multi-Ethnic Study of Atherosclerosis (MESA)". 

Lodi

  • Impute observed values below the limit of detection (LOD) via censored likelihood multiple imputation (CLMI) in single-pollutant models, developed by Boss et al (2019) <doi:10.1097/EDE.0000000000001052>.
  • Faculty: Seunggeun Shawn Lee, Bhramar Mukherjee, Min Zhang. Download: CRAN.
  • References: Boss, J., Mukherjee, B., Ferguson, K.K., Aker, A., Alshawabkeh, A.N., Cordero, J.F., Meeker, J.D. and Kim, S., 2019. Estimating outcome-exposure associations when exposure biomarker detection limits vary across batches. Epidemiology, 30(5), pp.746-755.

Tnseq

  • Identification of conditionally essential genes using high-throughput sequencing data from transposon mutant libraries.
  • Faculty: Lili Zhao. Download: CRAN.
  • Reference: Zhao, L., Anderson, M.T., Wu, W., Mobley, H.L. and Bachman, M.A., 2017. TnseqDiff: identification of conditionally essential genes in transposon sequencing studies. BMC bioinformatics, 18(1), p.326.

Bama

  • Mediation analysis in the presence of high-dimensional mediators based on the potential outcome framework. Bayesian Mediation Analysis (BAMA), developed by Song et al (2018) <doi:10.1101/467399>.
  • Faculty: Bhramar Mukherjee, Min Zhang, Xiang ZhouDownload: CRAN.
  • Reference: Song, Y., Zhou, X., Zhang, M., Zhao, W., Liu, Y., Kardia, S., Roux, A.D., Needham, B., Smith, J.A. and Mukherjee, B., 2018. Bayesian Shrinkage Estimation of High Dimensional Causal Mediation Effects in Omics Studies. bioRxiv, p.467399.

DPR

  • DPR is a software package implementing the latent Dirichlet process regression method for genetic prediction of complex traits.
  • Faculty: Xiang Zhou. Download: Github, Website.
  • Reference: Ping Zeng and Xiang Zhou (2017). Non-parametric genetic prediction of complex traits with latent Dirichlet process regression models. Nature Communications. 8: 456.

GEMMA

  • GEMMA is the software implementing the Genome-wide Efficient Mixed Model Association algorithm for a standard linear mixed model and some of its close relatives for genome-wide association studies (GWAS).
  • Faculty: Xiang Zhou. Download: Github, Website.
  • Reference: Xiang Zhou and Matthew Stephens (2012). Genome-wide efficient mixed-model analysis for association studies. Nature Genetics. 44: 821–824.

iMAP

  • iMAP is a method which performs integrative mapping of pleiotropic association and functional annotations using penalized Gaussian mixture models. 
  • Faculty: Xiang Zhou. Download: Github, Website.
  • Reference: Ping Zeng, Xingjie Hao and Xiang Zhou. Pleiotropic Mapping and Annotation Selection in Genome-wide Association Studies with Penalized Gaussian Mixture Models. bioRxiv 2018. Doi: 10.1101/256461.

IMAGE

  • IMAGE is a method that performs methylation quantitative trait locus (mQTL) mapping in bisulfite sequencing studies.
  • Faculty: Xiang Zhou. Download: CRAN, Github, Website.
  • Reference: Yue Fan, Tauras P. Vilgalys, Shiquan Sun, Qinke Peng, Jenny Tung and Xiang Zhou (2019). High-powered detection of genetic effects on DNA methylation using integrated methylation QTL mapping and allele-specific analysis. bioRxiv.

MACAU

  •  MACAU is the software implementing the Mixed model Association for Count data via data AUgmentation algorithm.
  • Faculty: Xiang Zhou. Download: Website.
  • Reference: Amanda J. Lea, Jenny Tung and Xiang Zhou (2015). A flexible, effcient binomial mixed model for identifying differential DNA methylation in bisulfite sequencing data. PLoS Genetics. 11: e1005650.

PMR-Egger

  • PMR-Egger is a method that fits probabilistic Mendelian randomization with an Egger regression assumption on horizontal pleiotropy for transcriptome-wide association studies (TWASs). 
  • Faculty: Xiang Zhou. Download: Github, Website.
  • Reference: Zhongshang Yuan, Huanhuan Zhu, Ping Zeng, Sheng Yang, Shiquan Sun, Can Yang, Jin Liu and Xiang Zhou (2019). Testing and controlling for horizontal pleiotropy with the probabilistic Mendelian randomization in transcriptome-wide association studies.

PQLseq

  • PQLseq is a method that fits generalized linear mixed models for analyzing RNA sequencing and bisulfite sequencing data.
  • Faculty: Xiang Zhou. Download: Github, Website.
  • Reference: Shiquan Sun*, Jiaqiang Zhu*, Sahar Mozaffari, Carole Ober, Mengjie Chen and Xiang Zhou (2018). Heritability estimation and differential analysis with generalized linear mixed models in genomic sequencing studies. Bioinformatics. in press.

SMART

  • SMART is a software implementing the Scalable Multiple Annotation integration for trait-Relevant Tissue identification and usage.
  • Faculty: Xiang Zhou. Download: Website.
  • Reference: Xingjie Hao, Ping Zeng, Shujun Zhang and Xiang Zhou (2018). Identifying and exploiting trait-relevant tissues with multiple functional annotations in genome-wide association studies. PLoS Genetics. e1007186.

SPARK

  • SPARK is a method for detecting genes with spatial expression patterns in spatially resolved transcriptomic studies.
  • Faculty: Xiang Zhou. Download: Github, Website.
  • Reference: Shiquan Sun*, Jiaqiang Zhu* and Xiang Zhou (2019). Statistical analysis of spatial expression pattern for spatially resolved transcriptomic studies.

subgxe

  • R package that implements p-value assisted subset testing for association (pASTA), a method developed by Yu et al. (2019) <doi:10.1159/000496867>.
  • Faculty: Bhramar Mukherjee, Xiang Zhou, Seunggeun Shawn Lee. Download: CRAN.
  • References: Yu, Y., Xia, L., Lee, S., Zhou, X., Stringham, H.M., Boehnke, M. and Mukherjee, B., 2018. Subset-Based Analysis using Gene-Environment Interactions for Discovery of Genetic Associations across Multiple Studies or Phenotypes. Human heredity, 83(6), pp.283-314.

VIPER

  • VIPER is a method that performs Variability Preserving ImPutation for Expression Recovery in single cell RNA sequencing studies.
  • Faculty: Xiang Zhou. Download: Github, Website.
  • Reference: Mengjie Chen and Xiang Zhou (2018). VIPER: variability-preserving imputation foraccurate gene expression recovery insingle-cell RNA sequencing studies. Genome Biology. 19:196.

WHODAD

  • WHODAD is a software package implementing the WHODAD method for paternity inference from low-coverage sequencing data. 
  • Faculty: Xiang Zhou. Download: Website.
  • Reference: Noah Snyder-Mackler, William H Majoros, Michael L Yuan, Amanda O Shaver, Jacob B Gordon, Gisela H Kopp, Stephen A Schlebusch, Jeffrey D Wall, Susan C Alberts, Sayan Mukherjee, Xiang Zhou and Jenny Tung (2016). Efficient genome-wide sequencing and low-coverage pedigree analysis from non-invasively collected samples. Genetics. 203: 699-714.