What are the responsibilities and job description for the Principle Kubernetes Platform Engineer position at LexisNexis?
About our Business
LexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,300 employees worldwide, is part of RELX, a global provider of information-based analytics and decision tools for professional and business customers.
About the Team
We are forming a new team that will focus on observability engineering and core strategy for companywide adoption. This team will be responsible for designing the vision, standards, tools, and policies to drive resilience through observability. We are looking for people who are passionate about observability and resilience automation and can think creatively about the evolution of observability (think AI).
About the Role
This position will be highly influential in the overall design and evolution of our next generation observability strategy and will be a hands-on contributor to the development of the solution as well as a primary architect. This is an advanced professional level role for a Platform Engineer. Individuals may be responsible for one or more complex reliability and toil reduction projects. At this level, SREs operate as a subject matter expert in the discipline and will provide guidance to others including product and development teams to define and improve reliability within a product group. A Consulting/Principle SRE has a deeper understanding of system and application code and will make data-driven recommendations which balance customer, development, and operational needs. They are champions for shared services, platforms, and architectural standards. Individuals in this role train and/or mentor junior staff.
Responsibilities
Be a subject matter expert on all thing's observability – concepts, tooling, key metrics, logging, and log parsing
Define standards for logging and metrics adoption for all services and applications
Build and deploy alerts, monitors, and dashboards in our primary observability platform
Document standards, requirements, and architectural designs
Make recommendations for developments standards improvements to drive better observability
Educate, evangelize, and influence observability standards and best practices throughout their organization
Develop AWS code (Terraform, Python, etc.)
Build platform libraries for logging, metrics and alerts for adoption into Java, .NET and Python applications and services
Look for opportunities to drive improvement across the entire spectrum of development processes
Support observability on internal applications and tools that are critical to the overall deployment process
Requirements
AWS Certified Solutions Architect Professional Level certification
10+ years as a software development engineer working with large scale distributed systems
Experience with platform engineering standards for logging, alerting, monitoring, and automation
Deep experience with a leading observability tool like Datadog, Splunk, New Relic etc.
Expertise with AWS observability solutions like CloudWatch
Experience with open-source observability tools like OpenTelemetry, Prometheus, Grafana, ELK etc.
Demonstrated experience deploying and managing applications at scale on Kubernetes systems
Excellent written and verbal communication skills
Demonstrated ability as a cross functional collaborator and influencer
Work in a way that works for you
We promote a healthy work/life balance across the organization. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals. Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive.
Working for you
We know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer:
- Health Benefits: Comprehensive, multi-carrier program for medical, dental and vision benefits
- Retirement Benefits: 401(k) with match and an Employee Share Purchase Plan - Wellbeing: Wellness platform with incentives, Headspace app subscription, Employee Assistance and Time-off Programs
- Short-and-Long Term Disability, Life and Accidental Death Insurance, Critical Illness, and Hospital Indemnity - Family Benefits, including bonding and family care leaves, adoption and surrogacy benefits
- Health Savings, Health Care, Dependent Care and Commuter Spending Accounts
- Up to two days of paid leave each to participate in Employee Resource Groups and to volunteer with your charity of choice
About the Business
LexisNexis Legal & Professional® provides legal, regulatory, and business information and analytics that help customers increase their productivity, improve decision-making, achieve better outcomes, and advance the rule of law around the world. As a digital pioneer, the company was the first to bring legal and business information online with its Lexis® and Nexis® services.