Job type full-time
Full job description
Summary
We exist to help people achieve financial clarityAt thrivent, we believe money is a tool, not a goalDriven by a higher purpose at our core, we are committed to providing financial advice, investments, insurance, banking and generosity programs to help people make the most of all they’ve been given.
At our heart, we are a membership-owned fraternal organization, as well as a holistic financial services organization, dedicated to serving the unique needs of our clientsWe focus on their goals and priorities, guiding them toward financial choices that will help them live the life they want today—and tomorrow.
At thrivent, we are focused on a digital transformation that will deliver modern, innovative experiences for our clients, financial advisors, and employeesWe are investing in data and technology, using devops practices, and building an engineering culture of empowered technical expertsOur technologists are involved in work that includes cloud native development, digital architecture and integration, automation, cloud data platforms, artificial intelligence, and machine learning as well as maximizing platforms such as salesforce, aws and microsoft.
Systems reliability engineering manager is responsible for evolving reliability engineering organization with necessary tools, standards, and practices to support our rapidly growing digital applications and classic application capabilitiesAlso responsible for building an instrumentation function to enable engineering organization with right dashboards and reports to effectively capture their critical success factors (csf’s) through measuring key performance indicators (kpis)Primary goal of keeping the digital experience reliable and help engineering and product teams to deliver features seamlessly into production
Job description
Job duties and responsibilities
Lead a team of system reliability engineering (sre) engineers and manage the team`s work.
Help product teams set service-level objectives (slo’s) and track service-level indicators (sli’s) for key digital products and infrastructure.
Build necessary tooling, instrument right metrics and reports to help proactively detect issues and resolve them before impacting our client experiences.
Leads partnerships with counterparts across various it product teams throughout product development and implementations.
Implement metrics driven processes to ensure service quality targets are met
Oversees the resolution and/or after-action reviews of complex or high impact system issues to drive preventive measures from similar issues happening again.
Drive capacity planning, performance analysis, instrumentation, and other non-functional system requirements.
Through dashboards, orchestrate availability, latency, traffic, errors, saturation, and efficiency parameters in thrivent it product development by instilling engineering reliability into our development life cycle with a focus on fault tolerant approaches.
Required job qualifications
Bachelor’s degree in business, mathematics, computer science, or equivalent work experience
Track record of building and managing high-performance systems reliability engineering (sre) teams
Demonstrated experience of improving service reliability and efficiency
Experience with systems reliability engineering goals, processes, and culture
Demonstrated experience leading people, including people change management
Prior successful experience as a systems performance or site/systems reliability engineer.
2 to 3 years of experience working with one of the apm tools like datadog, elasticsearch, dynatrace, and appdynamics etc.
2 to 3 years of experience working with visualization tools for apm dashboards like bi reporting, grafana etc.
This role can sit 100% remote (must be able to work between the hours of 8 am – 4 pm cst)
Thrivent provides equal employment opportunity (eeo) without regard to race, religion, color, sex , gender identity, sexual orientation, pregnancy, national origin, age, disability, marital status, citizenship status, military or veteran status, genetic information, or any other status protected by applicable local, state , or federal lawThis policy applies to all employees and job applicants.
Thrivent is committed to providing reasonable accommodation to individuals with disabilitiesIf you need a reasonable accommodation , please let us know by sending an email to [email protected] or call 800-847-4836 and request human resources.