menajobs
  • Companies
  • Resume Tools
  • ATS Checker
  • Offer Checker
  • Features
  • Pricing
  • FAQ
Post a Job
LoginGet Started — Free
Home/Jobs/Quality Assurance Engineer
AI71 logo
AI71

Quality Assurance Engineer

🇦🇪 Abu Dhabi, UAE🏢 On-site
QA AutomationAI TestingLLMRAGPythonDockerKubernetesSAP
🛂 Visa Sponsored
WhatsAppLinkedInX

Get Noticed

  • Make sure AI71 actually reads your resume
  • Get AI-rewritten bullet points
  • Download Gulf-ready CV
Start Free Scan

60 seconds. $3.99 one-time.

💰Gratuity
🛂 Visa Sponsored
AI71 logo
AI71

<div id="model-response-message-contentr_106b7d3aed644636" class="markdown markdown-main-panel stronger enable-updated-hr-color">
<h3 data-path-to-node="4"><strong data-path-to-node="4" data-index-in-node="0">The Mission</strong></h3>
<p data-path-to-node="5">AI71 is seeking a <strong data-path-to-node="5" data-index-in-node="18">Senior QA Automation Engineer</strong> to lead the validation and verification strategies for ai71's AI transformation. In this role, you will define "what good looks like" for non-deterministic AI systems, ensuring that Large Language Models (LLMs) and predictive engines meet the strict reliability standards required for the defense and enterprise sectors.</p>
<p data-path-to-node="6">You will act as the bridge between Agile development and formal Systems Engineering. Your mandate is to build automated testing frameworks that validate AI behaviors against "Ground Truth" datasets and ensure our AI agents pass rigorous <strong data-path-to-node="6" data-index-in-node="237">Test Readiness Reviews (TRR)</strong> and <strong data-path-to-node="6" data-index-in-node="270">Functional Configuration Audits (FCA)</strong></p>
<h3 data-path-to-node="8"><strong data-path-to-node="8" data-index-in-node="0">Key Responsibilities</strong></h3>
<h4 data-path-to-node="9"><strong data-path-to-node="9" data-index-in-node="0">1. AI & LLM Validation </strong></h4>
<ul data-path-to-node="10">
<li>
<p data-path-to-node="10,0,0"><strong data-path-to-node="10,0,0" data-index-in-node="0">Non-Deterministic Testing:</strong> Architect automated frameworks to evaluate Generative AI outputs for hallucination, consistency, and factual accuracy against "Gold Standard" datasets.</p>
</li>
<li>
<p data-path-to-node="10,1,0"><strong data-path-to-node="10,1,0" data-index-in-node="0">RAG Evaluation:</strong> Implement automated metrics (e.g., <strong data-path-to-node="10,1,0" data-index-in-node="51">RAGAS</strong>, faithfulness, answer relevance) to verify that Retrieval-Augmented Generation pipelines accurately cite technical and regulatory documentation.</p>
</li>
<li>
<p data-path-to-node="10,2,0"><strong data-path-to-node="10,2,0" data-index-in-node="0">Prompt Regression:</strong> Design regression suites to monitor "prompt drift," ensuring model updates do not degrade the quality of AI-generated engineering documents.</p>
</li>
</ul>
<h4 data-path-to-node="11"><strong data-path-to-node="11" data-index-in-node="0">2. Integration & System Verification</strong></h4>
<ul data-path-to-node="12">
<li>
<p data-path-to-node="12,0,0"><strong data-path-to-node="12,0,0" data-index-in-node="0">Enterprise Integration:</strong> Build robust tests to validate data consistency between AI agents and critical systems (e.g., <strong data-path-to-node="12,0,0" data-index-in-node="118">SAP S/4HANA</strong>, Ariba), ensuring the integrity of Bill of Materials (BOM) and financial data.</p>
</li>
<li>
<p data-path-to-node="12,1,0"><strong data-path-to-node="12,1,0" data-index-in-node="0">Performance Benchmarking:</strong> Design tests to validate latency and throughput for forecasting models and risk-scoring engines using tools like <strong data-path-to-node="12,1,0" data-index-in-node="139">Locust, JMeter, or K6</strong>.</p>
</li>
<li>
<p data-path-to-node="12,2,0"><strong data-path-to-node="12,2,0" data-index-in-node="0">API & Security Validation:</strong> Automate testing of secure API gateways, verifying <strong data-path-to-node="12,2,0" data-index-in-node="78">Role-Based Access Control (RBAC)</strong> and PII redaction logic before data reaches AI models.</p>
</li>
</ul>
<h4 data-path-to-node="13"><strong data-path-to-node="13" data-index-in-node="0">3. Governance & Traceability</strong></h4>
<ul data-path-to-node="14">
<li>
<p data-path-to-node="14,0,0"><strong data-path-to-node="14,0,0" data-index-in-node="0">V-Model Alignment:</strong> Map automated test cases to "System Requirements" to create digital evidence for formal <strong data-path-to-node="14,0,0" data-index-in-node="107">Verification and Validation (V&V)</strong> reports.</p>
</li>
<li>
<p data-path-to-node="14,1,0"><strong data-path-to-node="14,1,0" data-index-in-node="0">Stage Gate Compliance:</strong> Prepare "Test Readiness" packages for formal reviews, providing quantitative evidence that systems are stable enough to move from MVP to Production.</p>
</li>
<li>
<p data-path-to-node="14,2,0"><strong data-path-to-node="14,2,0" data-index-in-node="0">Defect Lifecycle Management:</strong> Manage the feedback loop between Requirements Quality Assistants and development teams, tracing AI logic defects back to specific model versions.</p>
</li>
</ul>
<strong data-path-to-node="16" data-index-in-node="0">What You’ll Bring</strong>
<h4 data-path-to-node="17"><strong data-path-to-node="17" data-index-in-node="0">Technical Requirements</strong></h4>
<ul data-path-to-node="18">
<li>
<p data-path-to-node="18,0,0"><strong data-path-to-node="18,0,0" data-index-in-node="0">Core Automation:</strong> Expert proficiency in <strong data-path-to-node="18,0,0" data-index-in-node="39">Python (Pytest)</strong> and standard libraries (<strong data-path-to-node="18,0,0" data-index-in-node="79">Selenium/Playwright</strong>, Requests).</p>
</li>
<li>
<p data-path-to-node="18,1,0"><strong data-path-to-node="18,1,0" data-index-in-node="0">AI Evaluation:</strong> Hands-on experience with LLM evaluation frameworks (e.g., <strong data-path-to-node="18,1,0" data-index-in-node="73">DeepEval, TruLens</strong>) and "Ground Truth" dataset management.</p>
</li>
<li>
<p data-path-to-node="18,2,0"><strong data-path-to-node="18,2,0" data-index-in-node="0">Performance Engineering:</strong> Proficiency in crafting Performance Test Plans and implementations (Locust, K6, etc.).</p>
</li>
<li>
<p data-path-to-node="18,3,0"><strong data-path-to-node="18,3,0" data-index-in-node="0">Data Validation:</strong> Expertise in <strong data-path-to-node="18,3,0" data-index-in-node="30">SQL</strong> and data quality tools (e.g., <strong data-path-to-node="18,3,0" data-index-in-node="64">Great Expectations</strong>) for Data Lakehouses and Vector Databases.</p>
</li>
<li>
<p data-path-to-node="18,4,0"><strong data-path-to-node="18,4,0" data-index-in-node="0">CI/CD & DevOps:</strong> Strong experience integrating quality gates into <strong data-path-to-node="18,4,0" data-index-in-node="65">GitLab CI/CD</strong> pipelines.</p>
</li>
<li>
<p data-path-to-node="18,5,0"><strong data-path-to-node="18,5,0" data-index-in-node="0">Engineering Practices:</strong> Deep understanding of modern QE practices, including <strong data-path-to-node="18,5,0" data-index-in-node="76">Shift Left</strong>, Test Pyramid, and Mono-repo architectures.</p>
</li>
</ul>
<h4 data-path-to-node="19"><strong data-path-to-node="19" data-index-in-node="0">Professional Qualifications</strong></h4>
<ul data-path-to-node="20">
<li>
<p data-path-to-node="20,0,0"><strong data-path-to-node="20,0,0" data-index-in-node="0">Experience:</strong> 5+ years in QA Automation, with 2+ years focused on complex data-driven applications, ML models, or AI agents.</p>
</li>
<li>
<p data-path-to-node="20,1,0"><strong data-path-to-node="20,1,0" data-index-in-node="0">Domain Expertise:</strong> Background in <strong data-path-to-node="20,1,0" data-index-in-node="32">Defense, Aerospace</strong>, or highly regulated industries is a strong plus. Familiarity with <strong data-path-to-node="20,1,0" data-index-in-node="118">IV&V</strong> processes is highly desirable.</p>
</li>
<li>
<p data-path-to-node="20,2,0"><strong data-path-to-node="20,2,0" data-index-in-node="0">Analytical Mindset:</strong> Ability to define pass/fail criteria for probabilistic systems and communicate "Confidence Levels" to engineering leadership.</p>
</li>
</ul>
<h3 data-path-to-node="22"><strong data-path-to-node="22" data-index-in-node="0">Why AI71?</strong></h3>
<ul data-path-to-node="23">
<li>
<p data-path-to-node="23,0,0"><strong data-path-to-node="23,0,0" data-index-in-node="0">Final Line of Defense:</strong> You define whether an AI agent is "trusted" to negotiate a contract or design a critical system component in a safety-critical environment.</p>
</li>
<li>
<p data-path-to-node="23,1,0"><strong data-path-to-node="23,1,0" data-index-in-node="0">Tax-Free Compensation:</strong> A market-leading, 100% tax-free salary and benefits package.</p>
</li>
<li>
<p data-path-to-node="23,2,0"><strong data-path-to-node="23,2,0" data-index-in-node="0">Full Relocation Support:</strong> Comprehensive assistance including flights, housing support, and visa sponsorship for you and your family to Abu Dhabi.</p>
</li>
<li>
<p data-path-to-node="23,3,0"><strong data-path-to-node="23,3,0" data-index-in-node="0">Pioneering Work:</strong> Set the global standard for how defense organizations validate intelligent systems using world-leading models.</p>
</li>
</ul>
</div>

Requirements

  • •Lead validation and verification strategies for AI transformation
  • •Define standards for non-deterministic AI systems
  • •Ensure LLMs and predictive engines meet reliability standards
  • •Build automated testing frameworks
  • •Evaluate AI outputs for hallucination, consistency, and accuracy
  • •Verify Retrieval-Augmented Generation pipelines
  • •Design regression suites for prompt drift
  • •Validate data consistency with enterprise systems (SAP S/4HANA, Ariba)

Nice to Have

  • •Experience with RAGAS framework
  • •Experience with AI testing for defense and enterprise sectors
  • •Familiarity with Bill of Materials (BOM) and financial data integrity

Responsibilities

  • •Architect automated frameworks for Generative AI outputs
  • •Implement automated metrics for RAG evaluation
  • •Design regression suites to monitor prompt drift
  • •Build tests to validate data consistency between AI agents and critical systems
  • •Design tests to validate latency and throughput for forecasting models
  • •Ensure AI agents pass Test Readiness Reviews (TRR) and Functional Configuration Audits (FCA)

Related Jobs

Stranger Soccer logo
Operator & License Owner, Oman
Stranger Soccer · 🇴🇲 Muscat
Stranger Soccer logo
Operator & License Owner, Kuwait City
Stranger Soccer · 🇰🇼 Kuwait City
Stranger Soccer logo
Operator & License Owner, Jeddah
Stranger Soccer · 🇸🇦 Jeddah
Stranger Soccer logo
Operator & License Owner, Riyadh
Stranger Soccer · 🇸🇦 Riyadh

Browse Similar

Technology jobs in Abu DhabiJobs in Abu DhabiJobs in UAETechnology jobsJobs at AI71
Back to all jobs
Get Noticed
  • Make sure AI71 actually reads your resume
  • Get AI-rewritten bullet points
  • Download Gulf-ready CV
Start Free Scan

60 seconds. $3.99 one-time.

Benefits Package
🏠Housing
✈️Flights
🏥Medical
🎓Education
🚗Transport
💰Gratuity
🎯Bonus
📦Relocation
GCC Info
🛂 Visa Sponsored
Company
AI71 logo
AI71

AI71 offers a platform for creating and deploying advanced AI models. It serves businesses and developers seeking to integrate sophisticated artificial intelligence into their products.

Visit WebsiteView all jobs
Share
WhatsAppLinkedInX
menajobs

AI-powered GCC job board with resume optimization tools.

Serving:

UAESaudi ArabiaQatarKuwaitBahrainOman

Product

  • Resume Tools
  • Features
  • Pricing
  • FAQ

Resources

  • Resume Examples
  • CV Format Guides
  • Skills Guides
  • Salary Guides
  • ATS Keywords
  • Job Descriptions
  • Career Paths
  • Interview Questions
  • Achievement Examples
  • Resume Mistakes
  • Cover Letters
  • Resume Summaries
  • Resume Templates
  • ATS Resume Guide
  • Fresher Resumes
  • Career Change
  • Industry Guides

Country Guides

  • Jobs by Country
  • Visa Guides
  • Cost of Living
  • Expat Guides
  • Work Culture

Free Tools

  • ATS Checker
  • Offer Evaluator
  • Salary Guides
  • All Tools

Company

  • About
  • Contact Us
  • Privacy Policy
  • Terms of Service
  • Refund Policy
  • Shipping & Delivery
  • Sitemap

Browse by Location

  • Jobs in UAE
  • Jobs in Saudi Arabia
  • Jobs in Qatar
  • Jobs in Dubai
  • Jobs in Riyadh
  • Jobs in Abu Dhabi

Browse by Category

  • Technology Jobs
  • Healthcare Jobs
  • Finance Jobs
  • Construction Jobs
  • Oil & Gas Jobs
  • Marketing Jobs

Popular Searches

  • Tech Jobs in Dubai
  • Healthcare Jobs in Dubai
  • Finance Jobs in Dubai
  • Engineering Jobs in Dubai
  • Marketing Jobs in Dubai
  • Oil & Gas Jobs in Dubai
  • Tech Jobs in Riyadh
  • Healthcare Jobs in Riyadh
  • Finance Jobs in Riyadh
  • Engineering Jobs in Riyadh
  • Marketing Jobs in Riyadh
  • Oil & Gas Jobs in Riyadh
  • Tech Jobs in Abu Dhabi
  • Healthcare Jobs in Abu Dhabi
  • Finance Jobs in Abu Dhabi
  • Engineering Jobs in Abu Dhabi
  • Marketing Jobs in Abu Dhabi
  • Oil & Gas Jobs in Abu Dhabi
  • Tech Jobs in Doha
  • Healthcare Jobs in Doha
  • Finance Jobs in Doha
  • Engineering Jobs in Doha
  • Marketing Jobs in Doha
  • Oil & Gas Jobs in Doha
  • Tech Jobs in Kuwait City
  • Healthcare Jobs in Kuwait City
  • Finance Jobs in Kuwait City
  • Engineering Jobs in Kuwait City
  • Marketing Jobs in Kuwait City
  • Oil & Gas Jobs in Kuwait City

Browse by Nationality

  • UAE Jobs for Indians
  • UAE Jobs for Filipinos
  • Saudi Jobs for Indians
  • Saudi Jobs for Pakistanis
  • Qatar Jobs for Nepalis
  • Qatar Jobs for Filipinos
  • Kuwait Jobs for Egyptians
  • Bahrain Jobs for Indians
  • Oman Jobs for Bangladeshis
  • UAE Jobs for Pakistanis
Featured on Better LaunchFeatured on neeed.directoryFeatured on Aura++

© 2026 MenaJobs. All rights reserved.

Go to appSS