Data Engineer (Python)
What's Your Score?
- See the score Qode's ATS gives your resume
- Get AI-rewritten bullet points
- Download Gulf-ready CV
60 seconds. $3.99 one-time.
Data Engineer (Python)
Experience: 5–6 years | Location: Dhahran/Khobar, KSA — onsite only | Duration: 3 months (extension possible) | Availability: Immediate
Role Overview
Build the data backbone of the MRO Inventory Optimization solution — ingestion, cleansing, transformation, and the optimization logic that turns raw SAP material master and inventory data into actionable outputs. You'll own pipelines from source through to the analytics and application layers.
Must-Have — technical depth expected
- Python: Production-grade code, modular design, packaging, logging, config management, unit testing (pytest); strong grasp of data structures and performance.
- Pandas / NumPy: Vectorized transformations, joins/merges, groupby/aggregation, handling large datasets, deduplication, type coercion, working with messy real-world MRO/master data.
- Airflow: Authoring DAGs, operators/sensors, scheduling and backfills, task dependencies, retries/SLAs, idempotent pipeline design, parameterization.
- BigQuery: Writing performant SQL, partitioning/clustering, cost-aware querying, loading/exporting data, working with nested/repeated fields.
- SQL: Advanced joins, window functions, CTEs, aggregation, query optimization across relational and warehouse engines.
- API development: Building and consuming REST APIs (FastAPI/Flask), request validation, pagination, integration with upstream systems (e.g., SAP-sourced data via CPI/OData).Good-to-Have
PySpark (distributed transforms), ML basics (forecasting/classification relevant to inventory optimization — EOQ, demand forecasting, slow-moving/obsolete stock detection), data quality frameworks (Great Expectations or similar), Docker, CI/CD.
Scope of Work
- Data ingestion from SAP material master and inventory feeds (via API/OData) and other sources into the warehouse.
- Data cleansing and master data processing — standardizing material descriptions, deduplication, classification, handling incomplete records.
- Build and orchestrate ETL pipelines (Airflow → BigQuery), ensuring reliability, idempotency, and data lineage.
- Implement inventory optimization logic (reorder points, safety stock, EOQ, criticality/ABC analysis, obsolescence flags).
- Develop backend services / APIs exposing processed data to the UI and BI layers.
Requirements
- •5-6 years of experience as a Data Engineer
- •Production-grade Python coding (modular design, packaging, logging, unit testing)
- •Proficiency with Pandas/NumPy for data manipulation
- •Experience authoring Airflow DAGs and managing pipelines
- •Strong SQL skills for BigQuery, including partitioning and cost-aware querying
- •Experience with REST API development (FastAPI/Flask)
- •Experience with SAP data sources (via API/OData)
- •Onsite work required in Dhahran/Khobar, KSA
Nice to Have
- •PySpark for distributed transformations
- •Basic ML knowledge (forecasting, classification)
- •Experience with data quality frameworks (e.g., Great Expectations)
- •Docker and CI/CD experience
Responsibilities
- •Build data pipelines for ingestion, cleansing, and transformation
- •Implement inventory optimization logic
- •Develop backend services/APIs for data exposure
- •Ensure pipeline reliability, idempotency, and data lineage
- •Process SAP material master and inventory data
- •Standardize material descriptions and handle incomplete records
Related Jobs
Browse Similar
- See the score Qode's ATS gives your resume
- Get AI-rewritten bullet points
- Download Gulf-ready CV
60 seconds. $3.99 one-time.
Qode is a digital transformation consultancy focused on helping businesses in the GCC region leverage technology to achieve their goals. They offer strategic advice and implementation services.
Visit WebsiteView all jobs