From: Don Armstrong Date: Fri, 13 Apr 2018 22:46:21 +0000 (-0700) Subject: update resume for landolakes X-Git-Url: https://git.donarmstrong.com/?a=commitdiff_plain;h=refs%2Fheads%2Flandolakes_data_analyst;p=resume.git update resume for landolakes --- diff --git a/don_armstrong_resume.org b/don_armstrong_resume.org index f7e8d1e..c31d14a 100644 --- a/don_armstrong_resume.org +++ b/don_armstrong_resume.org @@ -65,8 +65,28 @@ ** Batchelor of Science (BS) in Biology \hfill UC Riverside * Skills +** Genomics and Epigenomics ++ NGS and array-based Genomics and Epigenomics of complex human + diseases using RNA-seq, targeted DNA sequencing, RRBS, Illumina + bead arrays, and Affymetrix microarrays from sample collection to + publication. ++ Reproducible, scalable bioinformatics analysis using make, + nextflow, and cwl based workflows on cloud- and cluster-based + systems on terabyte-scale datasets ++ Alignment, annotation, and variant calling using existing and custom + software, including GATK, bwa, STAR, and kallisto. ++ Assembly of non-model transcriptomes and genomes using trinity, + cortex_var, and soapdenovo. ++ Linkage and association-based mapping of complex phenotypes using + next-generation sequencing and arrays ++ Alignment, annotation, and variant calling using existing and custom + software ++ Correcting for and experimental design to overcome multiple + testing, confounders, and batch effects using Bayesian and + frequentist methods approaches ++ Using evolutionary genomics to identify causal human variants ** Data Science -+ Reproducible, scalable analyses using *R*, *perl*, and python with ++ Reproducible, scalable analyses using R, perl, and python with workflows on cloud- and cluster-based systems on terabyte-scale datasets + Experimental design and correction to overcome multiple testing, @@ -78,7 +98,7 @@ time series, and machine learning in very large (> 1TB) datasets) + Data mining, cleaning, processing and quality assurance of data sources and products using tidydata formalisms -+ Visualization using *R*, ggplot, Shiny, and custom written routines. ++ Visualization using R, ggplot, Shiny, and custom written routines. ** Software Development + Languages: perl, R, C, C++, python, groovy, sh, make + Collaborative Development: git, travis, continuous integration, @@ -87,20 +107,6 @@ + Databases: Postgresql (PL/SQL), SQLite, Mysql, NoSQL + Office Software: Gnumeric, Libreoffice, \LaTeX, Word, Excel, Powerpoint -** Genomics and Epigenomics -+ NGS and array-based Genomics and Epigenomics of complex human - diseases using RNA-seq, targeted DNA sequencing, RRBS, Illumina - bead arrays, and Affymetrix microarrays from sample collection to - publication. -+ Reproducible, scalable bioinformatics analysis using make, - nextflow, and cwl based workflows on cloud- and cluster-based - systems on terabyte-scale datasets -+ Alignment, annotation, and variant calling using existing and custom - software, including GATK, bwa, STAR, and kallisto. -+ Correcting for and experimental design to overcome multiple - testing, confounders, and batch effects using Bayesian and - frequentist methods approaches -+ Using evolutionary genomics to identify causal human variants ** Statistics + Statistical modeling (regression, inference, prediction, and learning in very large (> 1TB) datasets) @@ -111,11 +117,6 @@ + Inter-process communication: MPI, OpenMP + Filestorage: Gluster, CEFS, GPFS, Lustre + Linux system administration -** Genomics and Epigenomics -+ Linkage and association-based mapping of complex phenotypes using - next-generation sequencing and arrays -+ Alignment, annotation, and variant calling using existing and custom - software ** Mentoring and Leadership + Mentored graduate students and Outreachy and Google Summer of Code interns