Do you like collaborating across teams to solve complex problems?

Do you enjoy solving large scale distributed content delivery challenges?

Join our critical Platform and Reliability Engineering Team!

Our team is responsible for defining, measuring, & optimizing the key performance indicators of delivery customers. Your expertise in software engineering and systems administration will be instrumental in building robust and resilient infrastructure.

As an SRE you will play a pivotal role in shaping the future of our products. You'll collaborate closely with the cross functional teams ensuring the reliability, scalability, and performance of our systems. You'll define key performance indicators (KPIs). Advance the state of monitoring, alerting and operational responses, and investigate complex performance issues.

As a Senior Site Reliability Engineer, you will be responsible for:

  • Working on Internet technologies to improve the performance, availability, and scalability of large distributed content delivery systems.
  • Engaging in collaborative efforts with cross-functional teams. defining and establishing measurable SLI' & SLO's.
  • Monitoring platform availability and performance, debug issues by leveraging data analysis skills and implement corrective actions to avoid recurrence.
  • Developing and implement automation solutions to improve operational efficiency and reduce toil.
  • Participating in design reviews and providing technical guidance to ensure designs meet requirements for scalability, performance, and robustness.
  • Staying current with the latest advancements in cloud computing, DevOps, and SRE best practices.

To be successful in this role you will:

  • Have relevant experience as an SRE or Dev Ops or Data Analysis and/Computer Networking/Software Development
  • Have proficiency in Scripting languages (Python, bash, JavaScript etc), SQL and working in a UNIX/Linux environment.
  • Experience with monitoring and alerting systems (e.g., Prometheus, Grafana, ADBMS, Datadog), including metric collection, alerting, dashboarding, and troubleshooting.
  • Be a self-starter and have drive for continuous learning and a commitment to operational excellence through tooling/automation.
  • Have the ability to work effectively with cross-functional teams and have good communication and collaboration skills.

FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know flexible working is important to many of the incredible people considering joining Akamai. FlexBase, gives 95% of employees the choice to work from their home, their office, or both (in the country advertised). This permanent workplace flexibility program is consistent and fair globally, to help us find incredible talent, virtually anywhere.

At Akamai, we will provide you with opportunities to grow, flourish, and achieve great things. Our benefit options are designed to meet your individual needs for today and in the future. We provide benefits surrounding all aspects of your life:

  • Your health
  • Your finances
  • Your family
  • Your time at work
  • Your time pursuing other endeavors

Akamai powers and protects life online. Leading companies worldwide choose Akamai to build, deliver, and secure their digital experiences helping billions of people live, work, and play every day. With the world's most distributed compute platform from cloud to edge we make it easy for customers to develop and run applications, while we keep experiences closer to users and threats farther away.

Are you seeking an opportunity to learn and make a real impact in a global technology company? Come join us and learn with a team of people who will energize and inspire you!

Akamai Technologies

Akamai Technologies