TalentAQ

TalentAQ

ML Opns Lead Engineer

EngineeringContract9+ years

Required Skills
18 skills

Azure
AWS
Azure OpenAI
Bedrock
Anthropic Claude
OpenAI API
Terraform
ARM
Bicep
CloudFormation
AWS CDK
Docker
Kubernetes
Grafana
Prometheus
App Insights
Log Analytics
Azure Monitor

Job Description

<ul><li>✨ Design, deploy, and manage cloud-based infrastructures leveraging Azure and AWS services.</li><li>✨ Implement and optimize AI/ML pipelines, including training, deployment, retraining, monitoring, and drift detection.</li><li>✨ Work with Generative AI platforms (Azure OpenAI, Bedrock, Anthropic Claude, OpenAI API, LlamaCloud, LangChain).</li><li>✨ Apply AI red teaming, prompt security scans, jailbreak risk mitigation, and token usage optimization.</li><li>✨ Set up and maintain CI/CD pipelines using Azure DevOps or AWS CodePipeline.</li><li>✨ Implement infrastructure-as-code (IaC) using Terraform, ARM/Bicep, CloudFormation, AWS CDK.</li><li>✨ Build and manage containerized applications using Docker and Kubernetes.</li><li>✨ Design and configure networking, DNS, VPNs, VNets, and load balancing.</li><li>✨ Apply advanced security, IAM, RBAC, Azure Policies, AWS SCPs, audit logging.</li><li>✨ Support database services across OLTP/OLAP including Cosmos DB, DynamoDB, RDS, Aurora, Redshift, SQL.</li><li>✨ Integrate observability tools such as Grafana, Prometheus, App Insights, Log Analytics, Azure Monitor.</li><li>✨ Work with stakeholders to ensure business-aligned AI/ML deployments.</li><li>✨ Conduct unit testing and integration testing as part of CI/CD.</li></ul>
  • ✨ Design, deploy, and manage cloud-based infrastructures leveraging Azure and AWS services.
  • ✨ Implement and optimize AI/ML pipelines, including training, deployment, retraining, monitoring, and drift detection.
  • ✨ Work with Generative AI platforms (Azure OpenAI, Bedrock, Anthropic Claude, OpenAI API, LlamaCloud, LangChain).
  • ✨ Apply AI red teaming, prompt security scans, jailbreak risk mitigation, and token usage optimization.
  • ✨ Set up and maintain CI/CD pipelines using Azure DevOps or AWS CodePipeline.
  • ✨ Implement infrastructure-as-code (IaC) using Terraform, ARM/Bicep, CloudFormation, AWS CDK.
  • ✨ Build and manage containerized applications using Docker and Kubernetes.
  • ✨ Design and configure networking, DNS, VPNs, VNets, and load balancing.
  • ✨ Apply advanced security, IAM, RBAC, Azure Policies, AWS SCPs, audit logging.
  • ✨ Support database services across OLTP/OLAP including Cosmos DB, DynamoDB, RDS, Aurora, Redshift, SQL.
  • ✨ Integrate observability tools such as Grafana, Prometheus, App Insights, Log Analytics, Azure Monitor.
  • ✨ Work with stakeholders to ensure business-aligned AI/ML deployments.
  • ✨ Conduct unit testing and integration testing as part of CI/CD.

Similar Jobs

10000 jobs available

Engineering4-6 yearsRemote
AWS
Azure
GCP
+13 more
Engineering4-6 yearsRemote
AWS
Azure
GCP
+13 more
EngineeringContract5-10 yearsRemote
Azure
AWS
Infrastructure as Code
+10 more
AI/ML11+ yearsRemote
Generative AI
LLMs
GPT
+34 more
AI/ML11+ yearsRemote
Generative AI
LLMs
GPT
+34 more
EngineeringFull Time
Ansible
Chef
Puppet
+15 more