TalentAQ

TalentAQ

Senior Engineer Observability & Real-Time Monitoring

EngineeringFull Time5+ yearsIrvine, Alberta

Required Skills
12 skills

OpenTelemetry
AWS OpenSearch
QuickSight
Kinesis Data Streams
KDA
Node.js
Python
TypeScript
GraphQL APIs
AWS AppSync
Jira
Confluence

Job Description

<p>About the Role</p><p>We are seeking an experienced Observability Engineer to build and enhance real-time monitoring and logging capabilities, starting with the Surveys and Marketplace Catalog-air services. The engineer will implement OpenTelemetry-based observability, design and extend SDKs for consistent instrumentation, stream data pipelines into AWS-native tools, and create real-time health dashboards. This role is part of a high-visibility initiative that ensures application health, operational transparency, and proactive alerting for mission-critical customer-facing systems.</p><p>Responsibilities</p><ul><li>Instrument backend services using OpenTelemetry SDKs for logs, traces, and metrics.</li><li>Develop and extend observability SDKs/libraries for consistent instrumentation across services.</li><li>Integrate observability data pipelines with the FOCUS framework.</li><li>Configure and manage AWS OpenSearch, QuickSight, and Kinesis Data Streams/KDA.</li><li>Build and deploy QuickSight dashboards for service health monitoring.</li><li>Implement near real-time alerting and automated escalation mechanisms.</li><li>Extend monitoring to additional services (e.g., Catalog-air).</li><li>Define performance baselines and set up anomaly detection rules.</li><li>Collaborate with backend and DevOps teams to ensure secure and scalable observability pipelines.</li><li>Document runbooks, observability architecture, and onboarding guides.</li></ul>

About the Role

We are seeking an experienced Observability Engineer to build and enhance real-time monitoring and logging capabilities, starting with the Surveys and Marketplace Catalog-air services. The engineer will implement OpenTelemetry-based observability, design and extend SDKs for consistent instrumentation, stream data pipelines into AWS-native tools, and create real-time health dashboards. This role is part of a high-visibility initiative that ensures application health, operational transparency, and proactive alerting for mission-critical customer-facing systems.

Responsibilities

  • Instrument backend services using OpenTelemetry SDKs for logs, traces, and metrics.
  • Develop and extend observability SDKs/libraries for consistent instrumentation across services.
  • Integrate observability data pipelines with the FOCUS framework.
  • Configure and manage AWS OpenSearch, QuickSight, and Kinesis Data Streams/KDA.
  • Build and deploy QuickSight dashboards for service health monitoring.
  • Implement near real-time alerting and automated escalation mechanisms.
  • Extend monitoring to additional services (e.g., Catalog-air).
  • Define performance baselines and set up anomaly detection rules.
  • Collaborate with backend and DevOps teams to ensure secure and scalable observability pipelines.
  • Document runbooks, observability architecture, and onboarding guides.

Similar Jobs

10000 jobs available

Node.js
React
HTML
+14 more
LogisticsFull-time2+ years
fleet monitoring
analytical skills
problem-solving skills
+4 more
Engineering3-4 years
VueJS 2/3
TypeScript
git
+7 more
VueJS 2/3
TypeScript
git
+7 more
Node.js
React
HTML
+16 more
Quality AssuranceContract10+ years
JavaScript
TypeScript
HTML
+13 more