Detailed Job Description:
- Extensive experience leveraging Python to build data transformation pipelines (ETL) and experience with libraries such as pandas and numpy.
- Extensive experience with Spark framework (PySpark) for writing complex data transformation logic using AWS EMR Jupyter notebooks.
- Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
- Experience using AWS Glue and AWS EMR to construct data pipelines.
- Experience with AWS cloud services: EC2, EMR, Athena, StepFunctions, CloudWa...