Back to Job Search

Software Reliability Engineer

Posted about 1 month ago

  • Expiry Date: 20 January 2022

​​​​​​​​​​​​​​​​​​​Job Description:

  • Our client is looking for a problem solver. Someone who takes initiative and will establish their new site reliability engineering position as a top performing role . This role will keep the ITmaking sure the platforms and services they rely on are available when they want to use them. This person is responsible for the availability, performance, efficiency, change management, monitoring, emergency response, capacity planning, back-up, and disaster recovery vital to keep their technical ecosystem reliable. This is a new position and an outstanding opportunity to be a key pillar of clients' futures. The IT team is seeking a highly motivated individual with a background in Software Reliability Engineering (SRE). This position will be based in Cupertino, CA.

Key Qualifications

  • 3 years in a DevOps or SRE role.

  • Firm grasp of at least one modern programming language: Go, Python, Bash, PHP, etc.

  • Experience with cloud computing services: AWS or Google Cloud Platform.

  • Hands-on experience with infrastructure tooling, for example: Terraform, Cloud Formation, Ansible.

  • Experience with container tooling, one of: Kubernetes, Mesosphere, Docker, or Amazon ECS.

  • Experience with distributed storage technologies like NFS, HDFS, Ceph, S3.

  • Basic understanding of Atlassian products and particularly for Jira and Confluence.


  • The SRE professional will be involved in new projects, starting from concept to crafting the infrastructure, toolset, and processes needed to deliver it, coordinating their implementation, monitoring the performance of a working system, and adjusting it when necessary. This role also involves creating training materials and training our staff to follow new guidelines and procedures. This person will need to have a detailed problem-solving approach, coupled with a strong sense of ownership and drive.

Responsibilities Will Include:

  • Taking a holistic view of system health to provide primary operational support for multiple distributed software applications and infrastructure layers. Handling incidents within the ACWN technical ecosystem.

  • Automating manual tasks such as the provisioning of users in production and test environments.

  • Collaborating with our service desk, vendors and engineering to get ahead of customer needs and innovate to continually improve.

  • Participating in the evaluation of infrastructure tools.

  • Working with AC Wellness engineering and vendors to ensure delivery of the non-functional requirements of availability, performance, security, compliance and maintainability.

  • Develop tools that improve production monitoring, telemetry, visualization, alerting, observability, workflows and reporting.

  • Define new designs, architectures, standards and methods for our healthcare ecosystems systems.

  • Engage in service capacity planning and demand forecasting.

  • Participate in the on call rotation.

Education & Experience

  • Bachelor’s degree in software engineering, computer science, computer engineering, or related technical field.

Additional Requirements

  • A proactive approach to spotting problems, performance, and process bottlenecks and areas for improvement.

  • Excellent troubleshooting skills: ability to quickly recognize patterns in failures.

  • Self-motivated to improve efficiency and uptime of the entire technical ecosystem.

  • Strong written and verbal communication skills.

  • Experience with Application Performance Management (APM) solutions is a plus.

  • Familiarity of JavaScript extensions like: Node JS, React is preferred.

  • Our client is an Equal Opportunity Employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other legally protected characteristics. 

About ASK:ASK Consulting is an award-winning technology and professional services recruiting firm servicing Fortune 500 organizations nationally. With 5 nationwide offices, two global delivery centers, and employees in 42 states-ASK Consulting connects people with amazing opportunities

ASK Consulting is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all associates.