***The Data Engineer role is 100% remote, working east coast hours.***
*Unfortunately, we are not able to provide any type of sponsorship now or anytime in the future.***
The Data Engineer, Translational Data Services, will support the Translational Data Services team in implementation, maintenance, and monitoring of data standards across Precision for Medicine’s laboratory services unit. This position will make a key contribution to implement and monitor standards and data transfer/QC processes to support Precision’s laboratory sciences team to ensure delivery of high quality data to Precision’s clients.
Essential functions of the job include but are not limited to:
- Create production-level source code for use in biomarker data management, and biomarker data delivery
- Work closely with our lab team to develop, execute and maintain custom pipelines for data processing and quality control (QC) for biomarker data such as flow cytometry, cell-based assays (e.g. MSD portfolio), next-generation sequencing, gene expression, IHC and similar
- Build data pipelines that clean, transform, aggregate or visualize data from disparate sources
- Perform source code validation and application testing
- Collaborate with members of other Precision divisions, and externally with 3rd party clients and vendors
- Other duties as assigned
Qualifications:
Minimum Required:
- Bachelor’s in data management, Bioinformatics, biotechnology, computer science, information technology, engineering, mathematics, physics, data science or related discipline
- 2 years of relevant working experience
- Proven experience with programming languages such as R or Python and data visualization/exploration tools
- Proven experience with analyzing biological data, querying databases, writing API calls and using AWS ecosystem
- Ability to work under tight deadlines, on a very dynamic, fast-paced team and handle multiple projects at the same time
- Ability to leverage existing technologies, internal and external, to address unique challenges
- Ability to identify problems and work with internal stakeholders to a resolution
- Excellent communication and interpersonal skills
- Team player contributing to a positive, collaborative working environment
- Must be able to read, write, speak fluently and comprehend the English language
Preferred:
- Master’s in data management, Bioinformatics, Data Science, Biostatistics, Biology or related field
- Experience with source code management systems such as git
- Ability to work efficiently under Unix/Linux environment
- Experience interfacing with LIMS systems
- Experience with cloud computing infrastructure (e.g. AWS)
- Experience with relational databases such as MySQL
#LI-Remote