I am a Ph.D. student in the Biostatistics department of Univerity of Michigan - Ann Arbor. Under the advisory of Goncalo Abecasis, my research focuses on statistical genetics. I have speicalties in next generation sequencing studies (NGS).
Collaborating with excellent colleagues, I have research experiences in sequencing reads alignment, variant callings, quality checks, and genetics association studies. Applying statistical genetics knowledge, I have been studying age-related macular degeneration disease using ~3,300 sequencing samples.
Ph.D. | Biostatistics University of Michigan, Ann Arbor, MI Advisor: Gonçalo R. Abecasis Thesis: Statistical methods and analysis in next generation sequencing. |
2013 (Expected) |
M.S. | Mathematics and Statistics University of Minnesota Duluth, Duluth, MN Advisor: Kang L. James |
2007 |
B.E. | Automation Tsinghua University, Beijing, China Advisor: Chongrong Li |
2005 |
Published
In revision
Submitted
In preparation
* Equal contribution first author
1. | Interuniversity Consortium for Political and Social Research (ICPSR) Summer Program in Quantitative Methods of Social Research Teaching Assistant - Longitudinal Analysis (taught by Dr. Michael Berbaum) |
Jul 2012 |
2. | Department of Mathematics and Statistics, University of Minnesota-Duluth, Duluth, MN Teaching Assistant - Differential Equations (taught by Dr. Marshall Hampton) |
Sep 2005 - Dec 2005 |
3. | Department of Mathematics and Statistics, University of Minnesota-Duluth, Duluth, MN Teaching Assistant - Calculus III (taught by Chad Pierson) |
Sep 2005 - Dec 2005 |
I can efficiently process large data sets in C++, Python, R and bash.
rvtests | Fast, handy association test software for family/unrelated studies and quantitaive/binary trait. Single variant test (score test, Wald test), burden tests (CMC, SKAT, VT) are implemented. |
checkVCF.py | Validation software for VCF file (used in lipid consortium) |
TabAnno | Fast annotation software; also integrated in EPACTS |
seqMiner | Efficiently load and process VCF files (preferrably use with TabAnno) |
KARMA | Illumina and ABI SOLiD sequence reads aligner |
QPLOT | Quality check and visualize sequence alignment files |
Polymutt | Family-based genotype caller |
ThunderVCF | Low-coverage genotype calling using Hidden Markov-chain Model (HMM) |
LASER | Ancestry inference using off-target reads (suitable for target/exome sequencing) |
Xiaowei Zhan