3 roles - Site Reliability Engineer (SRE)

Milwaukee, WI 53223

Posted: 08/03/2022 Industry: IT Job Number: 9021

Job Description

 
Staff Software Engineer – SRE position, you’ ll work with a team that focuses on critical operational aspects such as production monitoring, performance and capacity planning, deployments, incident management and automation. You play an important role to bridge development and operations to improve our data platforms observability, response plans, and continuous integration.

 

Competencies & Skills Needed:

  • PostgreSQL and MSSQL troubleshooting, performance optimization and tuning

  • Proficient in programming languages such as .NET, SQL, CI/CD

  • Intermediate skills at coding Python or similar scripting language

  • Strong analytical and problem-solving skills

  • Demonstrated written and verbal communication skills including a strong ability to communicate with engineering teams and leaders in lines of business, while driving incident response and post mortems

  • Conduct root cause analysis to identify causal factors, prepare and recommend solutions

?

What You’ ll Do and Impact: 

  • Develop, deploy and operate our systems built on AWS cloud services.

  • Ensure our systems are highly available, resilient, and compliant.

  • Continuously improve system observability, such as metrics, logging, tracing and alerting to see the full picture of performance and health of our systems.

  • Consult on and design infrastructure, system and application architecture.

  • Own production system monitoring, alerting, and performance.

  • Collaborate with the line of business engineering teams to establish SLOs for critical applications.

  • Troubleshoot issues and provide architectural insight back to development teams.

  • Participate in on-call rotation, drive incident resolution and improve system resilience. Current rotation is one week primary, one week secondary, five weeks off.

  • Triage database issues with standard remediation.

 

Experience:

  • Bachelor’ s degree in Computer Science, Engineering or equivalent experience is required

  • 5+ years of software engineering at a mid-size to large company

  • Experience working with monitoring and alerting tool like New Relic, PagerDuty

  • Experience with AWS cloud services

?

Additional Items of Interest:

  • Experience with SRE practices

Send an email reminder to:

Share This Job:

Related Jobs:

Login to save this search and get notified of similar positions.