ETL Developer with Pyspark
- Full time
- Bangalore IN
- @FirstMile IT Services Inc
Job Detail
-
Job ID 8093
-
Career Level Others
-
Experience 4-6 Years
-
Industry IT & BPM
-
Qualifications Degree Bachelor
Job Description
About The Job:
- Role: ETL Developer with Pyspark
- Location: Mumbai / Bangalore / Gurugram / Kolkata
- Years of Experience: 4 – 10 Yrs
Full job description:
- Databricks Engineers with 2-3 years of hand-on experience in Pyspark, Cloud Platform and SQL.
- Proven experience in implementing data solutions on the Databricks platform with hands on experience in setting up
- Databricks cluster, working in Databricks modules, data pipelines for ingesting and transforming data from various sources into Databricks.
- Spark-based ETL workflows and data pipelines for data preparation and transformation within Databricks.
- Bachelor’s or Master’s degree with 4-6 years of professional experience.
- Proven experience in implementing data solutions on the Databricks platform.
- Solid understanding of Databricks fundamentals and have hands on experience in setting up Databricks cluster, working in Databricks modules (Data Engineering, ML and SQL warehouse).
- Configure, set up, and manage Databricks clusters, workspaces, and notebooks to ensure optimal performance, scalability, and resource allocation.
- Implement data pipelines for ingesting, cleansing, and transforming data from various sources into Databricks, ensuring data quality, consistency, and reliability. Develop ETL (Extract, Transform, Load) processes as needed.
- Develop, optimize, and maintain Spark-based ETL workflows and data pipelines for data preparation and transformation within Databricks.
Skills:
- Knowledge of Performance Optimization, Monitoring and Automation would be a plus.
- Understanding of data governance, compliance, and security best practices.
- Strong Proficiency in Pyspark, Databricks Notebooks, SQL, Python, Scala, or Java.
- Proficiency in SQL for data querying and manipulation.
- Experience with data modeling, ETL processes, and data warehousing concepts.
- Strong problem-solving and troubleshooting skills.
- Excellent communication and collaboration skills.
- Certifications in Databricks would be beneficial.