TalentAQ

TalentAQ

DevOps/MLOps Engineer

EngineeringFull TimeAbu dhabi

Required Skills
14 skills

Jenkins
GitHub Actions
Terraform
Kubernetes
Docker
Bash
Python
MLflow
Azure ML
Azure
AWS
GCP
Prometheus
Grafana

Job Description

<p>Looking for a DevOps/MLOps Engineer to build and manage scalable, automated infrastructure for our LLM- powered GenAI platform. You’ll enable fast iteration and reliable deployment of models and services through robust CI/CD pipelines, container orchestration, and ML lifecycle tooling.</p><h3>Key Responsibilities:</h3><ul><li>Design and maintain CI/CD pipelines using Jenkins, GitHub Actions, or similar.</li><li>Automate infrastructure provisioning using Terraform and manage services with Kubernetes.</li><li>Write and maintain Bash/Python scripts for automation and operational tooling.</li><li>Implement and monitor MLOps workflows using tools like MLflow, Azure ML, or similar.</li><li>Support deployment and monitoring of LLM-based models and APIs in production.</li></ul><h3>Required Skills:</h3><ul><li>Hands-on experience with Jenkins, GitHub Actions, or equivalent CI/CD tools.</li><li>Proficiency with Terraform, Kubernetes, Docker, and cloud-native practices.</li><li>Strong scripting skills in Bash and Python.</li><li>Experience with ML model tracking, versioning, and deployment using MLflow or similar.</li><li>Familiarity with cloud platforms (e.g., Azure, AWS, or GCP).</li></ul><h3>Nice to Have:</h3><ul><li>Exposure to LLM/GenAI deployment workflows.</li><li>Experience with model performance monitoring and observability tools (Prometheus, Grafana, etc.).</li><li>Security and cost optimization best practices for ML infrastructure.</li></ul>

Looking for a DevOps/MLOps Engineer to build and manage scalable, automated infrastructure for our LLM- powered GenAI platform. You’ll enable fast iteration and reliable deployment of models and services through robust CI/CD pipelines, container orchestration, and ML lifecycle tooling.

Key Responsibilities:

  • Design and maintain CI/CD pipelines using Jenkins, GitHub Actions, or similar.
  • Automate infrastructure provisioning using Terraform and manage services with Kubernetes.
  • Write and maintain Bash/Python scripts for automation and operational tooling.
  • Implement and monitor MLOps workflows using tools like MLflow, Azure ML, or similar.
  • Support deployment and monitoring of LLM-based models and APIs in production.

Required Skills:

  • Hands-on experience with Jenkins, GitHub Actions, or equivalent CI/CD tools.
  • Proficiency with Terraform, Kubernetes, Docker, and cloud-native practices.
  • Strong scripting skills in Bash and Python.
  • Experience with ML model tracking, versioning, and deployment using MLflow or similar.
  • Familiarity with cloud platforms (e.g., Azure, AWS, or GCP).

Nice to Have:

  • Exposure to LLM/GenAI deployment workflows.
  • Experience with model performance monitoring and observability tools (Prometheus, Grafana, etc.).
  • Security and cost optimization best practices for ML infrastructure.

Similar Jobs

10000 jobs available

TVK Technologies
Engineering
Angular
DevOps
Jenkins
+11 more
Engineering10+ years
CI/CD
Cloud Infrastructure
Kubernetes
+9 more
Engineering4-15 years
DevOps
SRE
Kubernetes
+17 more
EngineeringFull-time1 years
DNS
AWS
S3
+13 more
Engineering4-6 yearsRemote
AWS
Azure
GCP
+13 more
AIBC SOLUTIONS LLC
Information Technology
devops
devscops
AWSCloud
+8 more