Epidemiologist II/Data Architect-CDC Contract (Remote). Filled

The position will create and maintain informatics infrastructure within the Division for data management and analysis of viral genome sequences. The candidate will collaborate with a growing team of virologists, bioinformaticians, informaticians, and epidemiologists to integrate genomic and phenotypic data for use in surveillance, outbreak investigation, and virological studies.

Responsibilities

  • Build and maintain data systems and pipelines for organizing and analyzing genomic, phenotypic and other data.

  • Assure operation of data analysis systems by implementing rigorous practices for standardization, software upgrades, documentation, and training of users.

  • Organize, document, and manage code and data to facilitate appropriate access and use.

  • Monitor data for quality control through coding checks and visualization dashboards.

  • Investigate data issues in the context of the data processing and analysis pipeline to assure efficient data flow, storage, and access.

  • Engage with team members and other data producers and consumers, including public sequence data repositories and collaborators around the world, to resolve data quality concerns in a timely manner.

Qualifications

Bachelor's, Master’s, or PhD degree in Bioinformatics, Computational Biology, Computer Science, Mathematics, or related field with 5 years of work experience or post-graduate studies using programming languages such Scala, Java, Python, R, and Perl.

Experience managing data in a data lake (Hadoop, Cloud, etc.) and/or data warehouse, with preference given to those who have managed disparate data sets of different types and sizes.

Knowledge of database design principles is necessary, including fluency in advanced SQL statements required for ETL and analytics.

Knowledge of Apache Spark framework is necessary and experience optimizing Spark jobs is greatly preferred.

Fluency with Linux/Unix and core command line utilities, including BASH scripting.

Knowledge of concepts related to access control and permissions, use of TLS certificates for data transfer, and basic firewall considerations.

Knowledge of basic genetics, microbiology, and molecular biology.

If you have additional questions, please submit form below.

Thank you,

PeopleTruss (a subsidiary of Ailm Medical)

Previous
Previous

Nutrition and Fitness Coach-Orlando, FL-Filled.