All jobs

[Remote] Staff Production Operations Engineer

100% Remote Full-time Open now

Note: The job is a remote job and is open to candidates in USA. Redpanda Data is pioneering the Agentic Data Plane (ADP) in AI infrastructure, focusing on connecting AI agents with enterprise data. They are seeking a Staff Production Operations Engineer to enhance their reliability operations program, working with various teams to improve operational excellence and automate processes using AI agents.

Responsibilities

  • Drive process improvements across the incident lifecycle: severity models, triage enforcement, alert noise reduction, and follow-up completion rates
  • Coordinate the on-call program across multiple geographies: manage schedules and shadow rotations, onboard new engineers, and ensure consistent coverage
  • Select incidents for post-incident review, facilitate blameless post-incident reviews, document findings, and track follow-up completion. Contribute to addressing incident follow-ups where possible, either by fixing issues directly or prototyping solutions
  • Build AI agents to automate operational toil, including oncall automation, as well as incident summarization, post-incident reviews prep, follow-up tracking, and on-call analytics
  • Maintain runbooks, playbooks, and incident process documentation, and keep them current as processes evolve

Skills

  • 5+ years of experience in site reliability engineering, DevOps, or production operations in large-scale, highly reliable environments
  • A track record of leading initiatives end-to-end, from design and planning, to execution and production operation
  • Hands-on experience with incident management tooling (incident.io, PagerDuty, or similar) and observability stacks (Datadog, Grafana, Sentry, CloudWatch, or equivalent)
  • Strong Fluency with reliability concepts: MTTD, MTTR, MTTA, error budgets, SLOs
  • Experience building automation and tooling to reduce operational toil
  • Proficiency in Go (or comparable systems language with willingness to ramp)
  • Experience with AI-assisted software development workflows including tools like Claude Code
  • Working knowledge of at least one of AWS / Azure / GCP, including infrastructure as code for system and network infrastructure
  • Strong written communication; ability to drive alignment across engineering teams without direct authority
  • Hands-on experience building agents or automations using LLMs
  • Familiarity with Redpanda, Apache Kafka, or other streaming infrastructure
  • Prior experience in a fast-growing B2B infrastructure or developer tools company

Benefits

  • Join Redpanda if youd enjoy being part of a fast-moving, diverse, people-first organization with team members around the globe and a culture based on trust, transparency, communication, and kindness.
  • You'll dive into a nimble, high-impact team with the latest AI tools 6 and the budget to actually use them.

Company Overview

  • Redpanda is pioneering the agentic data plane — AI infrastructure to connect agents with enterprise data and systems simply and securely. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 51-200 employees. Its website is https://redpanda.com.
  • Company H1B Sponsorship

  • Redpanda Data has a track record of offering H1B sponsorships, with 1 in 2025, 2 in 2023, 2 in 2022. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might also like

    [Remote] Enterprise Account Executive, Arizona

    100% Remote Full-time

    [Remote] Technical Project Manager – Enterprise Integrations (Workday & Workato) -REmote work

    100% Remote Full-time

    [Remote] Software Engineer

    100% Remote Full-time

    [Remote] Senior IT Engineer (AI)

    100% Remote Full-time

    [Remote] Field Account Executive - Upstate New York (White Plains, Stanford, Poughkeepsie)

    100% Remote Full-time

    [Remote] Senior Manager, Growth Marketing & Demand Generation

    100% Remote Full-time

    [Remote] Sales Manager - US - BPO

    100% Remote Full-time

    [Remote] Business Intelligence Lead, Performance Marketing

    100% Remote Full-time

    [Remote] Lead Analytics Manager

    100% Remote Full-time

    [Remote] Lead Data Analyst, Marketing

    100% Remote Full-time

    Tutor - AI Trainer

    100% Remote Full-time

    [Remote] Marketing Paid Consultant - Attribution, MMM & Experimentation

    100% Remote Full-time

    Experienced Customer Support Specialist – Live Chat Agent for Exceptional Client Experience

    100% Remote Full-time

    Client Service & Financial Representative Support Specialist - Remote - Immediate Start - Competitive Salary & Uncapped Commissions

    100% Remote Full-time

    Sr Data Analyst(Remote Or Hybrid)

    100% Remote Full-time

    [FULL TIME Remote] Home-Based Office Assistant (Remote)

    100% Remote Full-time

    RN - Telephonic Utilization Management Nurse- Medicare- Remote - Southeast Region

    100% Remote Full-time

    Experienced Customer Service Representative - Financial Solutions Specialist (Manchester)

    100% Remote Full-time

    Customer Support Specialist, Part-Time – Weekend Availability Required

    100% Remote Full-time

    [Work From Home] Require Restorative Practices Interventionist

    100% Remote Full-time