cluster-based systems on terabyte-scale datasets
+ Using evolutionary genomics to identify causal human variants
** Statistics
-+ Statistical modeling
++ Statistical modeling (regression, inference, Bayesian and
+ frequentist approaches in very large (> 1TB) datasets)
+ Addressing confounders and batch effects
+ Reproducible research
** Big Data
+ Collaborative Development: git, travis, continuous integration,
automated testing
+ Databases: Postgresql (PL/SQL), SQLite, Mysql
++ Office Software: Gnumeric, Libreoffice, \LaTeX, Word, Excel,
+ Powerpoint
** Communication
+ Strong written communication skills as evidenced by publication
record