Principal Application Engineer (SRE) Job in | Yulys
×

Job Title: Principal Application Engineer (SRE)

Company Name: Discover Financial Services
Salary: USD 48.00
-
USD 82.00
Job Industry: Computer & Network Security
Job Type: Full time
WorkPlace Type: remote
Location: Alaska, United States
Job Description:

Discover. A brighter future.

 

With us, you’ll do meaningful work from Day 1. Our collaborative culture is built on three core behaviors: We Play to Win, We Get Better Every Day & We Succeed Together. And we mean it — we want you to grow and make a difference at one of the world's leading digital banking and payments companies. We value what makes you unique so that you have an opportunity to shine.

 

Come build your future, while being the reason millions of people find a brighter financial future with Discover.

 

Job Description:

 

At Discover, be part of a culture where diversity, teamwork and collaboration reign. Join a company that is just as employee-focused as it is on its customers and is consistently awarded for both. We’re all about people, and our employees are why Discover is a great place to work. Be the reason we help millions of consumers build a brighter financial future and achieve yours along the way with a rewarding career.

 

As a Principal Application Reliability Engineer, you’ll tap into your passion for finding and fixing inefficiencies to solve our reliability and performance issues. In our Agile environment, you’ll focus on availability, latency, performance, efficiency, change and problem management, monitoring, emergency response and capacity planning of our services. Your projects will deliver enhanced infrastructure, development, and deployment automation at Discover. Actively manages and escalates risk and customer-impacting issues within the day-to-day role to management.

 

Roles And Responsibilities

  • Consult teams and provide hands-on training to teams in observability, incident management and reliability best practices.
    • Includes defining SLOs\SLAs\SLIs, on-call support behaviors, troubleshooting, building support playbooks, implementing monitoring and alerting, logging standards, conducting fragility & performance testing, etc.
    • Review product journeys and reliability practices on regular interval to enforce best practices.
    • Periodically pair/mob program with the teams to help build reliability thinking.
  • Lead failure point discussions, chaos testing and family level capacity management.
  • Responsible for family level application reliability and resiliency
  • Leverage metrics and scorecards to better drive site reliability adoption in the product areas
  • Ensure delivery teams in the product family track and meet annual operational goals (MTTR reduction, incident reduction, platform availability, SLO\SLA targets)
  • Ensure automated delivery for all family level products.
  • Ensure proper level of documentation exists.
  • Drive SRE community discussions, share wins and failures with Discover SRE community of practice.

 

Minimum Qualifications

 

At a minimum, here’s what we need from you:

 

  • Bachelors – Computer Science or related
  • 6+ Years -- Information Technology, (Software) Engineering, or related
  • Internal applicants only: technical proficiency rating of proficient on the Dreyfus engineering scale

 

Preferred Qualifications

 

Bonus Points If You Have:

 

  • In depth knowledge on application development landscape - React, Java, Rest API, design patterns and CI/CD.
  • Broader understanding across application security, infrastructure components and databases.
  • Think about systems: edge cases, failure modes, behaviors, specific implementations.
  • Strong knowledge of git, Docker, Kubernetes, and Jenkins
  • Know what the use of configuration management systems like Chef, Ansible
  • Strong programming skills with Shell or Java
  • Previous experience working with Linux/Unix servers
  • Extensive experience with monitoring and observability tools and technologies including but not limited to Instana, Grafana/Kibana, Datadog and App Dynamics and cloud observability tools for AWS.
  • Creation of standardized monitoring dashboards in cloud platforms including AWS/OCP for proactive monitoring of application and infrastructure health.
  • In depth knowledge of Non-functional requirements (NFR’s) including pressure/chaos testing, performance and penetration testing.
  • Reliability best practices in the cloud native environment.
  • Operational Readiness strategies and best practices.

 

External applicants will be required to perform a technical interview.

 

#Remote

 

Compensation:

 

The base pay for this position generally ranges between $101,500.00 to $171,500.00. Additional incentives may be provided as part of a market competitive total compensation package. Factors, such as but not limited to, geographical location, relevant experience, education, and skill level may impact the pay for this position.

 

Benefits:

 

We also offer a range of benefits and programs based on eligibility. These benefits include:

 

  • Paid Parental Leave
  • Paid Time Off
  • 401(k) Plan
  • Medical, Dental, Vision, & Health Savings Account
  • STD, Life, LTD and AD&D
  • Recognition Program
  • Education Assistance
  • Commuter Benefits
  • Family Support Programs
  • Employee Stock Purchase Plan

 

Learn more at MyDiscoverBenefits.com.

 

What are you waiting for? Apply today!

 

All Discover employees place our customers at the very center of our work. To deliver on our promises to our customers, each of us contribute every day to a culture that values compliance and risk management.

 

Discover is committed to a diverse and inclusive workplace. Discover is an equal opportunity employer and does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status, or other legally protected status. (Know Your Rights)

 

Looking for remote jobs near your area? At Yulys, thousands of employers are looking for exceptional talent like yours. Find your perfect fit now.

Become a part of our growth newsletter