4 #+AUTHOR: Don L. Armstrong
6 #+LATEX_CLASS: dlaresume
7 # #+LATEX_CLASS_OPTIONS: [10pt,breaklinks]
10 ** Team Lead Data Engineering at Bayer Crop Science \hfill 2018--Present
11 + Hired, managed, and developed team of 5+ Data Engineers, Systems
12 Administrators, and Business Analysts working within the Biologics
13 R&D unit of Bayer Crop Science enabling data capture, data
14 integration, and operationalization of data analysis pipelines
15 + Developed and supervised implementation of data capture,
16 integration, and analysis strategies to increase the value of
17 genomics, metabolomics, transcriptomics, spectroscopic, phenotypic
18 (/in vitro/ and /in planta/), and fermentation/formulation process
19 data for discovery and development
20 + Lead the development of multiple tools in python and R while
21 coaching, mentoring, and developing multiple developers and
23 + Served as a key collaborator on multiple cross-function and
24 cross-divisional projects, including leading the architecture of a
25 life science collaboration using serverless architecture to provide
26 machine-learning estimates of critical parameters from
27 spectrographic measurements
28 ** Debian Developer \hfill 2004--Present
29 + Maintained, managed configurations, and resolved issues in multiple
30 packages written in R, perl, python, scheme, C++, and C.
31 + Resolved technical conflicts, developed technical standards, and
32 provided leadership as the elected chair of the Technical Committee.
33 + Developer of [[https://bugs.debian.org][Debbugs]], a perl and SQL-based issue-tracker with ≥ 100
34 million entries with web, REST, and SOAP interfaces.
35 + Provided vendor-level support for complex systems integration issues
36 on Debian GNU/Linux systems.
37 ** Research Scientist at UIUC \hfill 2015--2017
38 + Primarily responsible for the planning, design, organization,
39 execution, and analysis of multiple complex epidemiological studies
40 involving epigenomics, transcriptomics, and genomics of diseases of
41 pregnancy and post-traumatic stress disorder.
42 + Published results in scientific publications and presented results
43 orally at major scientific conferences.
44 + Wrote and completed grants, including budgeting, scientific
45 direction, project management, and reporting.
46 + Mentored graduate students and collaborated with internal and
48 + Performed literature review, training, and applied new techniques to
49 maintain abreast of current scientific literature, principles of
50 scientific research, and modern statistical methodology.
51 + Wrote software and designed relational databases using R, perl, C,
52 SQL, make, and very large computational systems ([[https://bluewaters.ncsa.illinois.edu/][Blue Waters]])
53 ** Postdoctoral Researcher at USC \hfill 2013--2015
54 + Primarily responsible for the design, execution, and analysis of an
55 epidemiological study to identify genomic variants associated with
56 systemic lupus erythematosus using targeted deep sequencing.
57 + Designed, budgeted, configured, maintained, and supported a secure
58 linux analysis cluster (MPI/torque) with a shared filesystem (NFS
59 over gluster) for statistical analyses.
60 + Wrote multiple pieces of software to reproducibly analyze and
61 archive large datasets resulting from genomic sequencing.
62 + Coordinated with clinicians, molecular biologists, and biologists to
63 produce analyses and major reports.
64 ** Postdoctoral Researcher at UCR \hfill 2010--2012
65 + Primarily responsible for the execution and analysis of an
66 epidemiological study to identify genomic variants associated with
67 systemic lupus erythematosus using prior information and array based
68 approaches in a trio and cross sectional study of individuals from
69 the Los Angeles and greater United States.
70 + Wrote and maintained multiple software components to reproducibly
72 ** Independent Systems Administrator \hfill 2004--2018
73 + Researched, recommended, budgeted, designed, deployed, configured,
74 operated, and monitored highly-available high-performance enterprise
75 hardware and software for web applications, authentication, backup,
77 + Full life-cycle support of medium and small business networking
78 infrastructure, including VPN, network security, wireless networks,
79 routing, DNS, DHCP, and authentication.
81 ** Doctor of Philosophy (PhD) in Cell, Molecular and Developmental Biology \hfill UC Riverside
82 ** Batchelor of Science (BS) in Biology \hfill UC Riverside
85 ** Leadership and Mentoring
87 + Mentored graduate students and Outreachy and Google Summer of Code
89 + Former chair of Debian's Technical Committee
90 + Head developer behind https://bugs.debian.org
91 ** Software Development
92 + Languages: python, R, perl, C, C++, python, groovy, sh (bash, POSIX,
94 + Collaborative Development: git, Jira, github actions, Aha!, travis,
95 continuous integration, automated testing, continuous deployment
96 + Web, Mobile: Shiny, jQuery, JavaScript
97 + Databases: Postgresql (PL/SQL), SQLite, Mysql, NoSQL
99 + Parallel and Cloud Computing (slurm, torque, AWS, OpenStack, Azure)
100 + Inter-process communication: MPI, OpenMP
101 + Filestorage: Gluster, CEFS, GPFS, Lustre
102 + Linux system administration
103 ** Applications and Daemons
104 + Web: apache, ngix, varnish (load balancing/caching), REST, SOAP,
106 + SQL servers: PostgreSQL, MySQL, SQLite, oracle
107 + Build Tools: GNU make, cmake
108 + Continuous Integration/Deployment: codebuild, travis, jenkins,
110 + Virtualization: libvirt, KVM, qemu, VMware, docker
112 + VCS: git, mercurial, subversion
113 + Mail: postfix, exim, sendmail, spamassassin
114 + Configuration Infrastructure: puppet, hiera, etckeeper, git
115 + Documentation: \LaTeX, confluence, emacs, MarkDown, MediaWiki, ikiwiki, trac
116 + Monitoring: munin, nagios, icinga, prometheus
117 + Issue Tracking: Debbugs, Request Tracker, Trac, JIRA
118 + Office Software: Gnumeric, Libreoffice, \LaTeX, Word, Excel,
121 + Hardware, Linux routing and firewall experience, ferm, DHCP,
122 openvpn, bonding, NAT, DNHS, SNMP, IPv4, and IPv6.
124 + GNU/Linux (Debian, Ubuntu, Red Hat)
128 + Strong written communication skills as evidenced by publication
130 + Strong verbal and presentation skills as evidenced by presentation
132 ** Genomics and Epigenomics
133 + NGS and array-based Genomics and Epigenomics of complex human
134 diseases using RNA-seq, targeted DNA sequencing, RRBS, Illumina
135 bead arrays, and Affymetrix microarrays from sample collection to
137 + Reproducible, scalable bioinformatics analysis using make,
138 nextflow, and cwl based workflows on cloud- and cluster-based
139 systems on terabyte-scale datasets
140 + Alignment, annotation, and variant calling using existing and custom
141 software, including GATK, bwa, STAR, and kallisto.
142 + Correcting for and experimental design to overcome multiple
143 testing, confounders, and batch effects using Bayesian and
144 frequentist methods approaches
145 + Using evolutionary genomics to identify causal human variants
147 + Statistical modeling (regression, inference, prediction, and
148 learning in very large (> 1TB) datasets) using R and SAS.
149 + Addressing confounders and batch effects
150 + Reproducible research
151 * Authored Open Source Software
152 + *[[http://bugs.debian.org][Debbugs]]*: Bug tracking software for the Debian GNU/Linux
153 distribution. [[https://bugs.debian.org]]
154 + *[[http://git.donarmstrong.com/r/CairoHacks.git][CairoHacks]]*: Bookmarks and Raster images for large PDF plots in R.
155 + *[[http://rzlab.ucr.edu/function2gene/][Function2Gene]]*: Gene selection tool based on literature mining which
156 enables Bayesian approaches to significance testing.
157 + *[[http://rzlab.ucr.edu/scripts/wheel/wheel.cgi?sequence=ABCDEFGHIJLKMNOP&submit=Submit][Helical Wheel Projections]]*: Web-based tool to draw helical wheel
159 * Publications and Presentations
160 + 24 peer-reviewed publications cited over 3000 times:
162 + Publication record in GWAS, expression analysis of microarrays, SLE,
163 GBM, epigenetics, comparative evolution of mammals, and lipid
166 + Multiple presentations on EWAS of PTSD, genetics of SLE, and Open
167 Source: https://dla2.us/pres
171 + 2017 R Consortium: *[[https://www.r-consortium.org/blog/2017/04/03/q1-2017-isc-grants][Adding Linux Binary Builders to R-Hub]]* Role:
173 + 2015 Blue Waters Allocation Grant: *Making ancestral trees using Bayesian
174 inference to identify disease-causing genetic variants* Role:
176 + *Tracking placenta and uterine funciton using urinary extracellular vesicles* (R21
177 RFA-HD-16-037) Role: Key Personnel
178 + *NIAMS* R01-AR045650-04 *Genetics of Childhood Onset SLE* to Chaim O. Jacob.
180 ** Scholarships and Fellowships
181 + 2001--2003: University of California, Riverside Doctoral Fellowship
182 + 1997--2001: Regents of the University of California Scholarship.