# Senior AI Engineer

> Makro PRO · Nawamin Road, Thailand (Hybrid) · Full-time · Posted 2026-07-02

**Workplace:** hybrid

**Department:** Technology

## Description

The Senior AI Engineer owns the enterprise LLM platform substrate that powers every generative-AI consumer across the organisation. This role designs, builds, and operates the LLM Gateway, the evaluation framework, and the AI best-practices playbook that engineers, business teams, and product squads depend on to ship trustworthy AI features at scale.

### **Key Responsibilities**

-   Design, build, and operate the enterprise LLM Gateway — provider abstraction, authentication, rate limiting, per-use-case cost tracking, prompt logging for audit, and model routing across major providers (Azure OpenAI, AWS Bedrock, Anthropic, Vertex AI).
-   Operationalise an evaluation framework (Langfuse or equivalent) — tracing, eval scores, human feedback loops — as a platform service consumed by every production GenAI consumer.
-   Define and enforce evaluation rubrics (accuracy, groundedness, hallucination rate, latency, cost per inference) and embed regression gates in CI so no GenAI consumer ships without a working eval harness.
-   Ship and operate the GenAI cost dashboard with per-use-case attribution and quarterly forecasts to leadership; drive cost-optimisation initiatives (caching, prompt compression, model routing).
-   Partner with Data Governance on AI-model governance evidence (audit-log schema, PII redaction proofs, model routing controls) to support legal and compliance approvals for GenAI at scale (Thailand PDPA, regional AI frameworks).
-   Partner with platform engineering on Vector Search and embedding model selection, retrieval relevance tuning, chunking strategies, and reranker layers.
-   Author and publish the AI Best Practices Playbook — the standard engineering teams across the company use when shipping LLM features; mentor and review.
-   Own GenAI platform service-level objectives — availability, latency, cost ceilings — and lead incident response for production GenAI consumers.

## Requirements

-   Bachelor's or Master's degree in Computer Science, AI/ML, Data Science, or a related discipline.
-   6+ years of software engineering with at least 2+ years shipping production LLM platform systems (gateways, evaluations, cost metering, multi-model routing).
-   Strong Python production-service development (FastAPI, async, observability, tests).
-   Hands-on production experience with at least one major LLM provider (Azure OpenAI, AWS Bedrock, Anthropic, Vertex AI), including cost and latency optimisation.
-   Eval-driven LLM development discipline — golden sets, LLM-as-judge, regression gates in CI, multi-step conversation replay.
-   Solid grounding in prompt-injection defence, data-leakage prevention, and PII handling for LLM systems.
-   Cloud production experience (Azure preferred; AWS/GCP transferable) and Git-based CI/CD; comfortable owning service contracts and SLOs.
-   Excellent written and verbal communication; able to author technical design documents and influence engineering peers and business stakeholders.

**Preferred Qualifications**

-   Experience with LLM gateways (Portkey, LiteLLM, Kong AI) or having built one in-house.
-   Fine-tuning experience (LoRA / QLoRA) and open-weight model deployment (vLLM, TGI).
-   Thai-language NLP exposure (PyThaiNLP, WangchanBERTa, SEA-LION, Typhoon) and retail / commerce data context (POS, catalog, CRM, loyalty).
-   Vendor or industry certifications such as Databricks Generative AI Engineer, Azure AI Engineer Associate, or comparable.

## Apply

[Apply at Makro PRO](https://apply.workable.com/joinmakropro/j/80CF89BC1F/apply)

---
Powered by [Workable](https://www.workable.com)
