DesiRecruiter - Ultimate Job Board for Desi US IT Recruiters!

Detailed Job Description:

Extensive experience leveraging Python to build data transformation pipelines (ETL) and experience with libraries such as pandas and numpy.
Extensive experience with Spark framework (PySpark) for writing complex data transformation logic using AWS EMR Jupyter notebooks.
Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
Experience using AWS Glue and AWS EMR to construct data pipelines.
Experience with AWS cloud services: EC2, EMR, Athena, StepFunctions, CloudWa...