Observability DevOps III

LivePerson is a transformational force in how brands and consumers communicate. With over 18,000 brands, including HSBC, Disney, Verizon, and Home Depot, we are on a mission to make life easier for people and brands everywhere through trusted Conversational AI. We believe in a future where conversations are the norm for getting your intentions fulfilled - whatever they are.

We are an innovative, intent-driven company that believes in building the future and we are looking for growth minded, unconventional thinkers, developers and builders to join the team.

About this Role: The Observability Platform team is building a state of the art system for logging, motoring, and tracing across cloud and on-prem data centers. We’re looking for an experienced Principal DevOps Lead to head our Logging and Monitoring, ensuring robust, scalable solutions within our Google Cloud Platform. In this role, you will be helping to bring systems to life that give superpowers to an entire organization of software developers. Managing a technology stack that includes Elastic Cloud, Loki, GrafanaLab, and the ELK stack for logging, as well as Zabbix, Captain Hook, and Anodot for anomaly detection and metrics scraping.

Responsibilities:

  • Lead the design, implementation, operation, and continuous improvement of LivePerson’s observability platforms across logs, metrics, traces, alerts, and synthetic monitoring.
  • Own large-scale observability pipelines processing massive volumes of telemetry data daily, including Filebeat, Kafka, Logstash, ElasticCloud, Prometheus, OpenTelemetry, Grafana Labs, Zabbix, Anodot, and related technologies.
  • Design, build, and optimize scalable Kubernetes-based observability services using Helm charts, CI/CD pipelines, GCP, GKE, Docker, and cloud-native best practices.
  • Define observability standards, dashboards, alerting frameworks, best practices, and onboarding materials for hundreds of engineering users.
  • Collaborate closely with DevOps, SRE, Engineering, NOC, Security, and vendor teams to deliver reliable, scalable, and actionable observability solutions.
  • Evaluate new observability technologies and guide the team in adopting modern practices around OpenTelemetry, distributed tracing, anomaly detection, and proactive monitoring.
  • Design and build the full end-to-end Synthetic Monitoring platform, running hundreds of daily synthetic tests on GCP Spot machines to support engineering teams with proactive service validation.

Benefits:

  • Health: medical, dental, and vision
  • Time away: 28 days paid holiday
  • Food vouchers
  • Development: Generous tuition reimbursement and access to internal professional development resources
  • Equal opportunity employer
  • #LI-Remote

Why you’ll love working here:

As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace.

Belonging at LivePerson:

We are proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.

We are committed to the accessibility needs of applicants and employees. We provide reasonable accommodations to job applicants with physical or mental disabilities. Applicants with a disability who require reasonable accommodation for any part of the application or hiring process should inform their recruiting contact upon initial connection.

Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or related work experience.
  • 5 years of experience as a software engineer or DevOps engineer, with experience in application development and cloud engineering.
  • Proficient in Kubernetes and containerization technologies (Docker, etc.)
  • Extensive experience with observability tools such as GrafanaLab, CaptainHook, Zabbix, FluentD, ELK, Kafka, and Prometheus.
  • Familiarity with infrastructure as code (IaC) tools like Terraform, Ansible, or CloudFormation.
  • Experience with cloud platforms (AWS, Azure, GCP) and their services related to computing, storage, and networking.
  • Strong programming skills in one or more languages (JavaScript, Java, Go, etc.).
  • The ideal candidate will have experience with OpenTelemetry Collector and Grafana Agent.
LivePerson

LivePerson