Quality Assurance Engineer

🇦🇪 Abu Dhabi, UAE🏢 On-site

QA AutomationAI TestingLLMRAGPythonDockerKubernetesSAP

🛂 Visa Sponsored

WhatsApp LinkedIn X

Get Noticed

Make sure AI71 actually reads your resume
Get AI-rewritten bullet points
Download Gulf-ready CV

Start Free Scan

60 seconds. $3.99 one-time.

💰Gratuity

🛂 Visa Sponsored

AI71

<div id="model-response-message-contentr_106b7d3aed644636" class="markdown markdown-main-panel stronger enable-updated-hr-color">
<h3 data-path-to-node="4">The Mission</h3>
AI71 is seeking a Senior QA Automation Engineer to lead the validation and verification strategies for ai71's AI transformation. In this role, you will define "what good looks like" for non-deterministic AI systems, ensuring that Large Language Models (LLMs) and predictive engines meet the strict reliability standards required for the defense and enterprise sectors.
You will act as the bridge between Agile development and formal Systems Engineering. Your mandate is to build automated testing frameworks that validate AI behaviors against "Ground Truth" datasets and ensure our AI agents pass rigorous Test Readiness Reviews (TRR) and Functional Configuration Audits (FCA)
<h3 data-path-to-node="8">Key Responsibilities</h3>
<h4 data-path-to-node="9">1. AI & LLM Validation </h4>
<ul data-path-to-node="10">
<li>
Non-Deterministic Testing: Architect automated frameworks to evaluate Generative AI outputs for hallucination, consistency, and factual accuracy against "Gold Standard" datasets.
</li>
<li>
RAG Evaluation: Implement automated metrics (e.g., RAGAS, faithfulness, answer relevance) to verify that Retrieval-Augmented Generation pipelines accurately cite technical and regulatory documentation.
</li>
<li>
Prompt Regression: Design regression suites to monitor "prompt drift," ensuring model updates do not degrade the quality of AI-generated engineering documents.
</li>
</ul>
<h4 data-path-to-node="11">2. Integration & System Verification</h4>
<ul data-path-to-node="12">
<li>
Enterprise Integration: Build robust tests to validate data consistency between AI agents and critical systems (e.g., SAP S/4HANA, Ariba), ensuring the integrity of Bill of Materials (BOM) and financial data.
</li>
<li>
Performance Benchmarking: Design tests to validate latency and throughput for forecasting models and risk-scoring engines using tools like Locust, JMeter, or K6.
</li>
<li>
API & Security Validation: Automate testing of secure API gateways, verifying Role-Based Access Control (RBAC) and PII redaction logic before data reaches AI models.
</li>
</ul>
<h4 data-path-to-node="13">3. Governance & Traceability</h4>
<ul data-path-to-node="14">
<li>
V-Model Alignment: Map automated test cases to "System Requirements" to create digital evidence for formal Verification and Validation (V&V) reports.
</li>
<li>
Stage Gate Compliance: Prepare "Test Readiness" packages for formal reviews, providing quantitative evidence that systems are stable enough to move from MVP to Production.
</li>
<li>
Defect Lifecycle Management: Manage the feedback loop between Requirements Quality Assistants and development teams, tracing AI logic defects back to specific model versions.
</li>
</ul>
What You’ll Bring
<h4 data-path-to-node="17">Technical Requirements</h4>
<ul data-path-to-node="18">
<li>
Core Automation: Expert proficiency in Python (Pytest) and standard libraries (Selenium/Playwright, Requests).
</li>
<li>
AI Evaluation: Hands-on experience with LLM evaluation frameworks (e.g., DeepEval, TruLens) and "Ground Truth" dataset management.
</li>
<li>
Performance Engineering: Proficiency in crafting Performance Test Plans and implementations (Locust, K6, etc.).
</li>
<li>
Data Validation: Expertise in SQL and data quality tools (e.g., Great Expectations) for Data Lakehouses and Vector Databases.
</li>
<li>
CI/CD & DevOps: Strong experience integrating quality gates into GitLab CI/CD pipelines.
</li>
<li>
Engineering Practices: Deep understanding of modern QE practices, including Shift Left, Test Pyramid, and Mono-repo architectures.
</li>
</ul>
<h4 data-path-to-node="19">Professional Qualifications</h4>
<ul data-path-to-node="20">
<li>
Experience: 5+ years in QA Automation, with 2+ years focused on complex data-driven applications, ML models, or AI agents.
</li>
<li>
Domain Expertise: Background in Defense, Aerospace, or highly regulated industries is a strong plus. Familiarity with IV&V processes is highly desirable.
</li>
<li>
Analytical Mindset: Ability to define pass/fail criteria for probabilistic systems and communicate "Confidence Levels" to engineering leadership.
</li>
</ul>
<h3 data-path-to-node="22">Why AI71?</h3>
<ul data-path-to-node="23">
<li>
Final Line of Defense: You define whether an AI agent is "trusted" to negotiate a contract or design a critical system component in a safety-critical environment.
</li>
<li>
Tax-Free Compensation: A market-leading, 100% tax-free salary and benefits package.
</li>
<li>
Full Relocation Support: Comprehensive assistance including flights, housing support, and visa sponsorship for you and your family to Abu Dhabi.
</li>
<li>
Pioneering Work: Set the global standard for how defense organizations validate intelligent systems using world-leading models.
</li>
</ul>
</div>

Requirements

•Lead validation and verification strategies for AI transformation
•Define standards for non-deterministic AI systems
•Ensure LLMs and predictive engines meet reliability standards
•Build automated testing frameworks
•Evaluate AI outputs for hallucination, consistency, and accuracy
•Verify Retrieval-Augmented Generation pipelines
•Design regression suites for prompt drift
•Validate data consistency with enterprise systems (SAP S/4HANA, Ariba)

Nice to Have

•Experience with RAGAS framework
•Experience with AI testing for defense and enterprise sectors
•Familiarity with Bill of Materials (BOM) and financial data integrity

Responsibilities

•Architect automated frameworks for Generative AI outputs
•Implement automated metrics for RAG evaluation
•Design regression suites to monitor prompt drift
•Build tests to validate data consistency between AI agents and critical systems
•Design tests to validate latency and throughput for forecasting models
•Ensure AI agents pass Test Readiness Reviews (TRR) and Functional Configuration Audits (FCA)

Related Jobs

Operator & License Owner, Oman

Stranger Soccer · 🇴🇲 Muscat

Operator & License Owner, Kuwait City

Stranger Soccer · 🇰🇼 Kuwait City

Operator & License Owner, Jeddah

Stranger Soccer · 🇸🇦 Jeddah

Operator & License Owner, Riyadh

Stranger Soccer · 🇸🇦 Riyadh

Back to all jobs

Get Noticed

Make sure AI71 actually reads your resume
Get AI-rewritten bullet points
Download Gulf-ready CV

Start Free Scan

60 seconds. $3.99 one-time.

Benefits Package

🏠Housing

✈️Flights

🏥Medical

🎓Education

🚗Transport

💰Gratuity

🎯Bonus

📦Relocation

GCC Info

🛂 Visa Sponsored

Company

AI71

AI71 offers a platform for creating and deploying advanced AI models. It serves businesses and developers seeking to integrate sophisticated artificial intelligence into their products.

Visit Website View all jobs

WhatsApp LinkedIn X

Requirements

•Lead validation and verification strategies for AI transformation

•Define standards for non-deterministic AI systems

•Ensure LLMs and predictive engines meet reliability standards

•Build automated testing frameworks

•Evaluate AI outputs for hallucination, consistency, and accuracy

•Verify Retrieval-Augmented Generation pipelines

•Design regression suites for prompt drift

•Validate data consistency with enterprise systems (SAP S/4HANA, Ariba)

Responsibilities

•Architect automated frameworks for Generative AI outputs

•Implement automated metrics for RAG evaluation

•Design regression suites to monitor prompt drift

•Build tests to validate data consistency between AI agents and critical systems

•Design tests to validate latency and throughput for forecasting models

•Ensure AI agents pass Test Readiness Reviews (TRR) and Functional Configuration Audits (FCA)

Quality Assurance Engineer

Requirements

Nice to Have

Responsibilities

Related Jobs

Browse Similar

Quality Assurance Engineer

Requirements

Nice to Have

Responsibilities

Related Jobs

Browse Similar