complex, highly regulated environments.
+ Seeking opportunities to grow, lead, and transform organizations
with a larger scope and greater impact.
+
## Director of Data Science and Analytics at Ginkgo Bioworks 2022--Present
+ Directed a geographically distributed team of managers who lead
teams of data engineers, system administrators, statisticians,
+ Extensive collaborations with scientific, business, and customer
leaders attest to my excellent communication and interpersonal
skills.
+
## Team Lead Data Engineering at Bayer Crop Science 2018--2022
+ Hired, managed, and developed team of 5+ Data Engineers, Systems
Administrators, and Business Analysts working within the Biologics
life science collaboration using serverless architecture to provide
machine-learning estimates of critical parameters from
spectrographic measurements.
+
## Debian Developer 2004--Present
+ Maintained, managed configurations, and resolved issues in multiple
packages written in R, perl, python, scheme, C++, and C.
million entries with web, REST, and SOAP interfaces.
+ Provided vendor-level support for complex systems integration issues
on Debian GNU/Linux systems.
+
## Research Scientist at UIUC 2015--2017
+ Architected and engineered systems to store, retrieve, and analyze
complex R&D data including behavioral healthcare data (PTSD),
scientific research, and modern statistical methodology.
+ Wrote software and designed relational databases using R, perl, C,
SQL, make, and very large computational systems ([[https://bluewaters.ncsa.illinois.edu/][Blue Waters]])
+
## Postdoctoral Researcher at USC 2013--2015
+ Design, execution, and analysis of an epidemiological study to
identify genomic variants associated with systemic lupus
study of individuals from the Los Angeles and greater United States.
+ Wrote and maintained multiple software components to reproducibly
perform the analyses.
+
# Education
+ Doctor of Philosophy (PhD) in Cell, Molecular and Developmental Biology at UC Riverside
+ Batchelor of Science (BS) in Biology at UC Riverside
their potential and thrive.
+ Chair or lead of multiple initiatives and committees, including
aligning highly cross-functional and diverse stakeholders.
+
## Data Governance/Management/Engineering
+ Leadership and implementation of data governance and management
programs across multiple functions within Ginkgo and Bayer.
+ Establishment of Metadata and master data management standards and
frameworks in life science and business domains.
+ Snowflake, dbt, Airflow
+
## Bioinformatics, Genomics, and Epigenomics
+ NGS and array-based Genomics and Epigenomics of complex human
diseases using RNA-seq, targeted DNA sequencing, RRBS, Illumina
+ Alignment, annotation, and variant calling using existing and custom
software, including GATK, bwa, STAR, and kallisto
+ Using evolutionary genomics to identify causal human variants
+
## Statistics
+ Statistical modeling (regression, inference, prediction, and machine
learning in very large (> 1TB) datasets) using R and python.
+ Correcting & experimental design to overcome multiple testing,
confounders, and batch effects (both Bayesian and frequentist)
+ Reproducible research
+
## Software Development
+ Languages: python, R, perl, C, C++, groovy, sh (bash, POSIX,
and zsh), make
+ Cloud: AWS, Azure, GCP, OpenStack
+ Infrastructure as Code: AWS Cloudformation, Terraform, puppet,
etckeeper, hieara
+
## Big Data
+ Parallel and Cloud Computing (slurm, torque, AWS, OpenStack, Azure)
+ Inter-process communication: MPI, OpenMP
+ GNU/Linux (Debian, Ubuntu, Red Hat)
+ Windows
+ MacOS
+
## Communication
+ Strong written communication skills as evidenced by publication
record.
teams and stakeholders at all organizational levels.
+ Strong verbal and presentation skills as evidenced by presentation,
leadership, and teaching record
-* Authored Open Source Software
+
+# Authored Open Source Software
+ *[Debbugs](http://bugs.debian.org)*: Bug tracking software for the Debian GNU/Linux
distribution.
+ *[CairoHacks](http://git.donarmstrong.com/r/CairoHacks.git)*: Bookmarks and Raster images for large PDF plots in R.
enables Bayesian approaches to significance testing.
+ *[Helical Wheel Projections](http://rzlab.ucr.edu/scripts/wheel/wheel.cgi?sequence=ABCDEFGHIJLKMNOP&submit=Submit)*: Web-based tool to draw helical wheel
protein projections.
-* Publications and Presentations
+
+# Publications and Presentations
+ 24 peer-reviewed publications cited over 4000 times:
https://dla2.us/pubs
+ Publication record in GWAS, transcriptomics, SLE, GBM, epigenetics,