Skip to content

Lead Cloud Site Reliability Engineer

At Northwestern Mutual, we are strong, innovative and growing. We invest in our people. We care and make a positive difference.

Northwestern Mutual is looking for a talented engineer to join our growing Cloud Platform team. The Cloud Platform team develops, maintains, loves, and appreciates all things cloud and containers.

We enable hundreds of developers to harness the power of Kubernetes and the cloud to deploy world class apps and infrastructure on their own 1000s of times a day. This is a fun, fast-paced experience that exposes you to cutting-edge cloud technologies and encourages personal learning and development.

What's the role?

As a Site Reliability Engineer, you will be responsible for leading efforts to implement stability, and observability improvements to our Kubernetes container and Cloud platforms. Key to the role will be your ability to mentor and educate other engineers and establish strong relationships with application development customers. You will be focused on SLI development, Automation, TOIL elimination, incident response, root cause analysis and monitoring enhancements. You should have the aptitude and enthusiasm for building and servicing highly distributed, scalable, and mission-critical systems. You should have a passion for automation and creating self-service mechanisms for customers.

Experience/Skills:

  • Bachelor’s Degree or equivalent experience
  • 6+ years experience with networking, Linux based platforms, and modern programming and scripting languages (Python, Go, JavaScript)
  • 6+ years experience performance tuning and operations of application stacks, OSs, DBs, etc.
  • 3+ years experience with AWS Cloud Services (AWS Certified Preferred), and containerized applications and container orchestration (Docker, Kubernetes – CKA Preferred)
  • 3+ years experience in DevOps or SRE roles
  • Strong experience with monitoring and performance management/tuning of systems
  • Strong experience with Prometheus, Dynatrace, New Relic, or other APM solutions with a focus on observability and alerting.
  • Experience with Infrastructure-as-Code frameworks (Terraform, CloudFormation)
  • Experience working with DevOps, CICD, GitOps, Agile methodologies.
  • Experience with CI/CD pipelines and automation and how to apply it with services such as Gitlab CI, Jenkins, CodePipeline, or Circle CI.
  • Strong written and verbal communication skills.
  • Problem solver who enjoys learning on the job and thinking outside of the box.

It's understandable if you do not meet every one of these criteria! Everyone has a different background and you will be working with a diverse team that can help grow your career.

This job is not covered by the existing Collective Bargaining Agreement.

Required Certifications:

Grow your career with a best-in-class company that puts our client’s interests at the center of all we do. Get started now! 

We are an equal opportunity/affirmative action employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender identity or expression, sexual orientation, national origin, disability, age or status as a protected veteran, or any other characteristic protected by law.

If you work in Colorado or work remotely, please click here for information pertaining to compensation and benefits.


FIND YOUR FUTURE

We’re excited about the potential people bring to Northwestern Mutual. You can grow your career here while enjoying first-class perks, benefits, and commitment to diversity and inclusion.