
Linguistic Data Analyst / AI Linguistic Specialist- Diplomatic Domain (STT AI)
What's Your Score?
- See the score Master-Works's ATS gives your resume
- Get AI-rewritten bullet points
- Download Gulf-ready CV
60 seconds. $3.99 one-time.
Role Purpose
The Linguistic Data Analyst is responsible for collecting, analyzing, organizing, and cleaning multilingual conversational data, with a strong focus on diplomatic and formal terminology, to prepare high-quality datasets for training Speech-to-Text (STT) AI models.
This role is critical to ensuring linguistic accuracy, terminology consistency, and data readiness for AI model development, particularly in government, diplomatic, and formal communication domains.
Key Responsibilities
Linguistic Data Collection:
- Collect and curate conversational audio and text data (meetings, interviews, speeches).
- Work with multilingual datasets, primarily Arabic and English.
- Ensure compliance with privacy and data governance standards.
Data Cleaning & Structuring:
- Clean datasets by removing noise, duplication, and inconsistencies.
- Normalize formal and semi-formal language usage.
- Organize data by speaker, context, and formality.
Linguistic & Terminology Analysis:
- Extract and standardize diplomatic and official terminology.
- Build and maintain a diplomatic glossary.
AI Training Data Preparation:
- Prepare AI-ready datasets with timestamps and metadata.
- Support annotation teams with linguistic guidelines.
Collaboration & Documentation:
- Work with AI Engineers, Data Scientists and PMs.
- Document standards and methodologies.
Requirements
Required Qualifications
Education:
- Bachelorās degree in Linguistics, Translation, Arabic/English Studies, or related field.
Core Skills:
- Strong linguistic analysis skills.
- Experience with conversational or textual datasets.
- High attention to detail.
Technical Skills (Preferred):
- Familiarity with STT and NLP concepts.
- Experience with data annotation workflows.
Languages:
- Arabic: Fluent (mandatory)
- English: Fluent (mandatory)
- Additional languages are a plus.
Requirements
- ā¢Bachelorās degree in Linguistics, Translation, Arabic/English Studies, or related field
- ā¢Strong linguistic analysis skills
- ā¢Experience with conversational or textual datasets
- ā¢High attention to detail
- ā¢Fluent Arabic (mandatory)
- ā¢Fluent English (mandatory)
Nice to Have
- ā¢Familiarity with STT and NLP concepts
- ā¢Experience with data annotation workflows
- ā¢Additional languages are a plus
Responsibilities
- ā¢Collect and curate conversational audio and text data
- ā¢Work with multilingual datasets (Arabic and English)
- ā¢Clean datasets by removing noise, duplication, and inconsistencies
- ā¢Normalize formal and semi-formal language usage
- ā¢Organize data by speaker, context, and formality
- ā¢Extract and standardize diplomatic and official terminology
- ā¢Prepare AI-ready datasets with timestamps and metadata
- ā¢Document standards and methodologies
Related Jobs
- See what Master-Works's hiring system sees in your CV
- Get AI-rewritten bullet points
- Download Gulf-ready CV
60 seconds. $3.99 one-time.



