Job Overview
We are seeking a skilled Data Engineer/ETL Specialist to build and maintain robust data pipelines for our drug discovery team. You will be responsible for integrating critical datasets from various bioinformatics sources into our AWS infrastructure, ensuring data quality and accessibility for our scientists.
Key Responsibilities
- Design, develop, and maintain automated ETL pipelines.
- Integrate data from PDB and other bioinformatics sources.
- Ensure data quality and consistency across all pipelines.
- Optimize data pipelines for performance and scalability.
- Collaborate with data scientists and other stakeholders to understand data requirements.
Required Skills
- Proficiency in Python and SQL.
- Experience with AWS data services (e.g., S3, Glue, Lambda).
- Strong understanding of data warehousing concepts.
- Experience with data visualization tools.