AI Systems Architect - LLM & Vector Infrastructure
Wait — Check First
- Check if your CV is ATS-ready for Star Company
- Get AI-rewritten bullet points
- Download Gulf-ready CV
60 seconds. $3.99 one-time.
We are seeking a senior AI Systems Architect to design and implement AI-native application cores where Large Language Models (LLMs), vector databases, retrieval systems, and agent frameworks form the primary computational layer of our web and mobile applications.
This role is responsible for architecting scalable AI pipelines, retrieval-augmented generation (RAG) systems, memory architectures, AI agents, and orchestration workflows integrated with our development stack (Web, Mobile, n8n automation, and AI services).
The ideal candidate understands that AI is not a feature, it is the operating system of the product.
Key Responsibilities
1. AI Core Architecture Design
• Design AI-first system architecture for web and mobile applications
• Architect RAG pipelines using vector databases
• Define long-term memory, short-term memory, and contextual state systems
• Implement multi-agent AI systems
• Design AI orchestration layers2. Vector Database & Embedding Systems
• Select and implement vector databases such as:
• Pinecone
• Weaviate
• Qdrant
• Milvus
• Supabase (pgvector)
• Optimize embedding strategies
• Implement hybrid search (semantic + keyword)
• Design scalable indexing pipelines3. LLM Integration & Optimization
• Work with models such as:
• OpenAI APIs
• Anthropic
• Meta (LLaMA)
• DeepSeek
• Alibaba (Qwen)
• Implement structured output pipelines
• Design evaluation and prompt testing frameworks
• Optimize cost-performance ratio4. AI Agent Systems & Orchestration
• Build autonomous AI agents
• Design tool-calling systems
• Integrate with:
• n8n
• LangGraph / LangChain style agent flows
• Implement memory-aware agents5. Production AI Engineering
• Build monitoring systems for hallucination detection
• Design guardrails and validation layers
• Implement evaluation datasets and benchmarking
• Ensure security of AI pipelines
• Build scalable infrastructure (Docker, Kubernetes, GPU optimization)Requirements
Technical Expertise
• 5+ years software engineering experience
• 2+ years building production AI systems
• Deep knowledge of:
• Vector embeddings & similarity search
• RAG architectures
• Tokenization and context window optimization
• Fine-tuning & LoRA concepts
• Prompt evaluation frameworks
• Experience with Python (mandatory)
• Experience with FastAPI / backend services
• Experience designing scalable APIsArchitecture Experience
• Designing distributed systems
• Microservices & event-driven architecture
• Experience with PostgreSQL + pgvector
• Experience deploying LLM systems in production
Requirements
- •5+ years software engineering experience
- •2+ years building production AI systems
- •Deep knowledge of Vector embeddings & similarity search
- •Deep knowledge of RAG architectures
- •Deep knowledge of Tokenization & context window optimization
- •Experience with Python (mandatory)
- •Experience with FastAPI / backend services
- •Experience designing scalable APIs
Nice to Have
- •Experience with n8n
- •Experience with LangGraph / LangChain
- •Microservices event-driven architecture
- •Experience with PostgreSQL + pgvector
- •GPU optimization
Responsibilities
- •Design AI-first system architecture
- •Architect RAG pipelines using vector databases
- •Implement multi-agent AI systems
- •Design AI orchestration layers
- •Select and implement vector databases (Pinecone, Weaviate, Qdrant, Milvus, Supabase)
- •Optimize embedding strategies
- •Work with LLMs (OpenAI, Anthropic, Meta, DeepSeek, Alibaba)
- •Build monitoring systems for hallucination detection
Related Jobs
- Check if Star Company will actually see your resume
- Get AI-rewritten bullet points
- Download Gulf-ready CV
60 seconds. $3.99 one-time.