DevOps III

Synergis IT

Apply Now

Remote anywhere in the U.S. Must work PST hours

2 openings

Must haves:

• 3-5 years experience

• Deep understanding of AWS services including EKS, ECS and MSK.

• Coding experience using a high-level programming language like Python

• Experience building and managing infrastructure in AWS using Terraform.

Nice to have:

• Experience with Kubernetes

• Understanding of the Linux and system administration at large scale

Ad Platforms AWS SRE Contractor

Job Description Summary

The Site Reliability Engineer (SRE) position requires a mix of strategic engineering and design along with hands-on, technical work. An ideal candidate will have experience building and managing infrastructure in AWS and have coding skills to automate tasks and build tools to help with our service operations. The SRE will configure, tune, and troubleshoot multi-tiered systems to achieve optimal application performance, stability and availability. The SRE will work closely with the software engineers, infrastructure and network engineers to deploy and maintain our services.

Key Qualifications

• Strong sense of ownership, customer service, and integrity demonstrated through clear communication.

• Deep understanding of the Linux and system administration at large-scale

• Deep understanding of AWS services including EKS, ECS and MSK

• Coding experience using a high-level programming language like: Python, Golang

• Experience building and managing infrastructure in AWS using Terraform.

• Experience running docker based workloads in production.

• Experience with Kubernetes is a plus, but not required

Description

The successful candidate will be highly self-motivated with a passion for excellence, quality and attention to detail. Responsibilities of the SRE include the following:

• Keeping the lights on – Oncall and Alert Handling.

• Manage new build-outs (additions and decommissions)

• Develop and maintain scripts used for environment monitoring and task automation (Python, Ansible, Puppet)

• Experience setting up and managing monitoring tools such as Graphite, Prometheus, InfluxDB, Grafana

• Set priorities and work efficiently in a fast-paced environment

• Measure and optimize system performance

• Demonstrate ability to deliver results on time with high quality

• Experience with Spinnaker is a plus.

Apply Now

  Apply with Google   Apply with Twitter
  Apply with Github   Apply with Linkedin   Apply with Indeed
  Stack Overflow