TalentAQ

TalentAQ

DevOps/MLOps Engineer

EngineeringFull TimeAbu dhabi

Required Skills
14 skills

Jenkins
GitHub Actions
Terraform
Kubernetes
Docker
Bash
Python
MLflow
Azure ML
Azure
AWS
GCP
Prometheus
Grafana

Job Description

<p>Looking for a DevOps/MLOps Engineer to build and manage scalable, automated infrastructure for our LLM- powered GenAI platform. You’ll enable fast iteration and reliable deployment of models and services through robust CI/CD pipelines, container orchestration, and ML lifecycle tooling.</p><h3>Key Responsibilities:</h3><ul><li>Design and maintain CI/CD pipelines using Jenkins, GitHub Actions, or similar.</li><li>Automate infrastructure provisioning using Terraform and manage services with Kubernetes.</li><li>Write and maintain Bash/Python scripts for automation and operational tooling.</li><li>Implement and monitor MLOps workflows using tools like MLflow, Azure ML, or similar.</li><li>Support deployment and monitoring of LLM-based models and APIs in production.</li></ul><h3>Required Skills:</h3><ul><li>Hands-on experience with Jenkins, GitHub Actions, or equivalent CI/CD tools.</li><li>Proficiency with Terraform, Kubernetes, Docker, and cloud-native practices.</li><li>Strong scripting skills in Bash and Python.</li><li>Experience with ML model tracking, versioning, and deployment using MLflow or similar.</li><li>Familiarity with cloud platforms (e.g., Azure, AWS, or GCP).</li></ul><h3>Nice to Have:</h3><ul><li>Exposure to LLM/GenAI deployment workflows.</li><li>Experience with model performance monitoring and observability tools (Prometheus, Grafana, etc.).</li><li>Security and cost optimization best practices for ML infrastructure.</li></ul>

Looking for a DevOps/MLOps Engineer to build and manage scalable, automated infrastructure for our LLM- powered GenAI platform. You’ll enable fast iteration and reliable deployment of models and services through robust CI/CD pipelines, container orchestration, and ML lifecycle tooling.

Key Responsibilities:

  • Design and maintain CI/CD pipelines using Jenkins, GitHub Actions, or similar.
  • Automate infrastructure provisioning using Terraform and manage services with Kubernetes.
  • Write and maintain Bash/Python scripts for automation and operational tooling.
  • Implement and monitor MLOps workflows using tools like MLflow, Azure ML, or similar.
  • Support deployment and monitoring of LLM-based models and APIs in production.

Required Skills:

  • Hands-on experience with Jenkins, GitHub Actions, or equivalent CI/CD tools.
  • Proficiency with Terraform, Kubernetes, Docker, and cloud-native practices.
  • Strong scripting skills in Bash and Python.
  • Experience with ML model tracking, versioning, and deployment using MLflow or similar.
  • Familiarity with cloud platforms (e.g., Azure, AWS, or GCP).

Nice to Have:

  • Exposure to LLM/GenAI deployment workflows.
  • Experience with model performance monitoring and observability tools (Prometheus, Grafana, etc.).
  • Security and cost optimization best practices for ML infrastructure.

Similar Jobs

10000 jobs available

EngineeringFull Time15+ yearsRemote
Postfix
Exim
Dovecot
+22 more
Engineering7-9 years
Linux
Unix
CI/CD
+18 more
ScaleOps
Engineering3+ yearsRemote
Terraform
Ansible
Docker
+10 more
EngineeringFull Time3+ years
Terraform
CloudFormation
Pulumi
+16 more
Elevon AI
EngineeringInternshipRemote
CI/CD pipelines
infrastructure automation
cloud platforms
+32 more
Engineering4-6 yearsRemote
AWS
Azure
GCP
+13 more