Lead Developer-Site Reliability Engineer(SRE) - Hyderabad - Advance Auto Parts

Date de Publication: 6/7/2021

Résumé de l'offre

  • Type de contrat:
    Employé à plein temps
  • Type de poste:
  • Date de Publication:

Description de l'offre

Job Description

At AAP, we’re passionate about building software that solves problems. We count on our site reliability engineers (SREs) to empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions. As we expand our customer deployments, we are currently seeking an experienced SRE to keep our B2C website and app running at high performance and availability. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.

Objectives of this Role

  • Leverage analytics from monitoring systems, take a holistic view of system health and troubleshoot any patterns showing performance or availability degradation

  • Be curious and have an eye towards detail for constantly identifying improvement opportunities in system performance

  • Partner with QA Performance Team to execute performance tests and provide solutions to solution architects and engineering team on improving site and app performance

  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve

  • Participate and perform Root Cause Analysis on Major Incidents, address the issue and drive for closure

  • Review all the services and build a resilience plan

Daily and Monthly Responsibilities

  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding

  • Partner with development teams to improve services through rigorous testing and release procedures

  • Participate in system design consulting, platform management, and capacity planning

  • Create sustainable systems and services through automation and uplifts

  • Balance feature development speed and reliability with well-defined service level objectives

Required Skills and Qualifications

  • Bachelor’s degree in computer science or other highly technical, scientific discipline

  • Ability to program (structured and OO) with one or more high level languages, such as React, Typescript and  Java

  • Deep knowledge in New Relic monitoring solutions (APM, Browser side metric capture and logging)

  • Deep knowledge in AWS architecture

  • Experience and knowledge in API architecture

  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks

Preferred Qualifications

  • Previous success in technical engineering

  • Coding experience beyond simple scripts