menajobs
  • Resume Tools
  • ATS Checker
  • Offer Checker
  • Features
  • Pricing
  • FAQ
LoginGet Started — Free
Home/Jobs/Data Engineer (PySpark)
GSSTech Group logo
GSSTech Group

Data Engineer (PySpark)

🇦🇪 Dubai, UAE🏢 On-site
PySparkCloudera Data PlatformCDPETLELTBig DataSQLAirflow
WhatsAppLinkedInX

Stand Out

  • Get to the top of GSSTech Group's applicant pile
  • Get AI-rewritten bullet points
  • Download Gulf-ready CV
Optimize My CV

60 seconds. $3.99 one-time.

GSSTech Group logo
GSSTech Group
employees

We are seeking a highly skilled Data Engineer with strong expertise in PySpark and the Cloudera Data Platform (CDP). The ideal candidate will design, develop, and maintain scalable data pipelines while ensuring high data quality, performance, and availability across the organisation.

This role requires hands-on experience in big data ecosystems, cloud-native technologies, and advanced data processing frameworks. You will collaborate with cross-functional teams to build reliable and high-performance data solutions that drive business insights.

Key Responsibilities

1. Data Pipeline Development

• Design, develop, and maintain scalable ETL/ELT pipelines using PySpark on CDP
• Ensure data integrity, reliability, and performance optimisation

2. Data Ingestion

• Develop ingestion frameworks to collect data from relational databases, APIs, streaming sources, and file systems
• Load structured and unstructured data into Data Lake/Data Warehouse environments

3. Data Transformation & Processing

• Process, cleanse, and transform large-scale datasets using PySpark
• Build reusable data processing components

4. Performance Optimisation

• Tune Spark jobs and Cloudera components for optimal performance
• Optimise memory, partitioning, and execution plans
• Reduce ETL runtime and improve cluster efficiency

5. Data Quality & Validation

• Implement data validation checks and monitoring mechanisms
• Ensure end-to-end data quality and governance standards

6. Automation & Orchestration

• Automate workflows using tools such as Apache Oozie, Apache Airflow, or similar orchestration frameworks
• Maintain CI/CD integration for data pipelines

7. Monitoring & Support

• Monitor pipeline health and troubleshoot failures
• Provide production support and continuous improvements

Required Skills & Qualifications

• 5+ years of experience in Data Engineering
• Strong hands-on experience in PySpark
• Experience working on Cloudera Data Platform (CDP)
• Strong knowledge of Hadoop ecosystem (HDFS, Hive, Impala, YARN)
• Proficiency in SQL and data modelling concepts
• Experience with workflow orchestration tools (Airflow, Oozie, etc.)
• Good understanding of data warehousing concepts
• Experience with performance tuning and optimisation

Good to Have

• Experience with cloud platforms (AWS, Azure, GCP)
• Knowledge of streaming tools (Kafka, Spark Streaming)
• Exposure to DevOps practices and CI/CD pipelines
• Banking/Financial Services domain experience

Requirements

  • •5+ years of experience in Data Engineering
  • •Strong hands-on experience in PySpark
  • •Experience working on Cloudera Data Platform (CDP)
  • •Strong knowledge of Hadoop ecosystem
  • •Proficiency in SQL and data modelling concepts
  • •Experience with workflow orchestration tools (Airflow, Oozie, etc.)
  • •Good understanding of data warehousing concepts
  • •Experience with performance tuning and optimisation

Nice to Have

  • •Experience with cloud platforms (AWS, Azure, GCP)
  • •Knowledge of streaming tools (Kafka, Spark Streaming)
  • •Exposure to DevOps practices and CI/CD pipelines
  • •Banking/Financial Services domain experience

Responsibilities

  • •Design, develop, and maintain scalable ETL/ELT pipelines using PySpark on CDP
  • •Develop ingestion frameworks to collect data from various sources
  • •Process, cleanse, and transform large-scale datasets using PySpark
  • •Tune Spark jobs and Cloudera components for optimal performance
  • •Implement data validation checks and monitoring mechanisms
  • •Automate workflows using tools such as Apache Oozie, Apache Airflow, or similar
  • •Monitor pipeline health and troubleshoot failures
  • •Provide production support and continuous improvements

Related Jobs

AECOM logo
Engineer - Smart City
AECOM · 🇸🇦 Makkah
Foodics logo
Expansion Executive
Foodics · 🇸🇦 Jeddah
MLabs logo
Head of Ecosystem
MLabs · 🇦🇪 Dubai
Nuvei logo
Business Development Manager
Nuvei · 🇦🇪 Dubai
Back to all jobs
75% Get Rejected
  • See if your CV passes GSSTech Group's ATS filters
  • Get AI-rewritten bullet points
  • Download Gulf-ready CV
Check My Resume

60 seconds. $3.99 one-time.

GCC Info
Company
GSSTech Group logo
GSSTech Group
employees

Visit WebsiteView all jobs
Share
WhatsAppLinkedInX
menajobs

AI-powered resume optimization for the Gulf job market.

Serving:

UAESaudi ArabiaQatarKuwaitBahrainOman

Product

  • Resume Tools
  • Features
  • Pricing
  • FAQ

Resources

  • Resume Examples
  • CV Format Guides
  • Skills Guides
  • Salary Guides
  • ATS Keywords
  • Job Descriptions
  • Career Paths
  • Interview Questions
  • Achievement Examples
  • Resume Mistakes
  • Cover Letters
  • Resume Summaries

Country Guides

  • Jobs by Country
  • Visa Guides
  • Cost of Living
  • Expat Guides
  • Work Culture

Free Tools

  • ATS Checker
  • Offer Evaluator
  • Salary Guides
  • All Tools

Company

  • About
  • Contact Us
  • Privacy Policy
  • Terms of Service
  • Refund Policy
  • Shipping & Delivery
  • Sitemap

Browse by Location

  • Jobs in UAE
  • Jobs in Saudi Arabia
  • Jobs in Qatar
  • Jobs in Dubai
  • Jobs in Riyadh
  • Jobs in Abu Dhabi

Browse by Category

  • Technology Jobs
  • Healthcare Jobs
  • Finance Jobs
  • Construction Jobs
  • Oil & Gas Jobs
  • Marketing Jobs

Popular Searches

  • Tech Jobs in Dubai
  • Healthcare in Saudi Arabia
  • Engineering in UAE
  • Finance in Qatar
  • IT Jobs in Riyadh
  • Oil & Gas in Abu Dhabi

© 2026 MenaJobs. All rights reserved.