ETL Developer with Pyspark

Job Detail

  • Job ID 8093
  • Career Level  Others
  • Experience  4-6 Years
  • Industry  IT & BPM
  • Qualifications  Degree Bachelor

Job Description

About The Job:

  • Role: ETL Developer with Pyspark
  • Location: Mumbai / Bangalore / Gurugram / Kolkata
  • Years of Experience: 4 – 10 Yrs

Full job description:

  • Databricks Engineers with 2-3 years of hand-on experience in Pyspark, Cloud Platform and SQL.
  • Proven experience in implementing data solutions on the Databricks platform with hands on experience in setting up
  • Databricks cluster, working in Databricks modules, data pipelines for ingesting and transforming data from various sources into Databricks.
  • Spark-based ETL workflows and data pipelines for data preparation and transformation within Databricks.
  • Bachelor’s or Master’s degree with 4-6 years of professional experience.
  • Proven experience in implementing data solutions on the Databricks platform.
  • Solid understanding of Databricks fundamentals and have hands on experience in setting up Databricks cluster, working in Databricks modules (Data Engineering, ML and SQL warehouse).
  • Configure, set up, and manage Databricks clusters, workspaces, and notebooks to ensure optimal performance, scalability, and resource allocation.
  • Implement data pipelines for ingesting, cleansing, and transforming data from various sources into Databricks, ensuring data quality, consistency, and reliability. Develop ETL (Extract, Transform, Load) processes as needed.
  • Develop, optimize, and maintain Spark-based ETL workflows and data pipelines for data preparation and transformation within Databricks.

Skills:

  • Knowledge of Performance Optimization, Monitoring and Automation would be a plus.
  • Understanding of data governance, compliance, and security best practices.
  • Strong Proficiency in Pyspark, Databricks Notebooks, SQL, Python, Scala, or Java.
  • Proficiency in SQL for data querying and manipulation.
  • Experience with data modeling, ETL processes, and data warehousing concepts.
  • Strong problem-solving and troubleshooting skills.
  • Excellent communication and collaboration skills.
  • Certifications in Databricks would be beneficial.

Required skills