Principal Site Reliability Engineer

Description

Cloud Service Engineer – Job Description

Shutterfly is looking for an experienced cloud service engineer to work with
cross functional teams consisting of application and systems engineers that
are establishing shared infrastructure that all of Shutterfly will use, as
well as the governance around best practices and standards with cloud.
Shutterfly treats its cloud infrastructure as a software product itself, using
Terraform and Packer to establish and update infrastructure, while automating
unit and functional testing of it, and using modern CI/CD pipelines to deploy
it. This approach is more software engineering than traditional systems/DevOps
engineering.

As a member of this service infrastructure team, the Cloud Service Engineer
shares in the ownership and responsibilities for enabling engineering
activities and efficiencies related within the AWS cloud. This role blends
innovation in a sophisticated cloud ecosystem with sustained compliance, while
supporting a high traffic, highly visible platform with large volumes of data,
third-party integrations, and challenging scalability and performance
considerations.

This candidate possesses a strong aptitude for learning new technologies,
embracing and driving complex, highly visible, and challenging projects to
completion. This role requires seasoned engineering and communication
capabilities. This individual must exhibit strong collaboration across
multiple cross-functional teams and a rich set of problem-solving skills. This
candidate should be self-motivated, have a passion for quality, and the
ability to clearly communicate out an up to the executive level.

Key Responsibilities

* Responsible for designing, building, and supporting advanced cloud automation workflows and tools, written mostly in Terraform, used to facilitate the automated setup, configuration and deployment of workloads in the cloud
* Design and build automated workflows/tools to enforce policies, best practices and cost controls
* Modifying and improving existing systems
* Consult and collaborate with engineering and developments teams to optimize the provisioning of their infrastructure
* Contribute principles and policies to form best practices, standards and guidelines for how to be successful in Shutterfly’s cloud environment
* Adhere to process and security standards when building infrastructure in AWS
* Consult with product and engineering teams to assist their education, onboarding and ongoing performance
* Ability to understand and influence:

* If a given infrastructure is correctly provisioned or over/under-provisioned
* If something can be optimized further or is already optimized
* Work with Service owners to understand the criticality of the service, and what its ideal needs are for cloud operation

* Share is responsibilities of on-call support for 24/7/365 operations in the cloud
* Share in responsibilities for monitoring infrastructure and educating application teams on monitoring best practices

Qualifications

* Minimum 4 years of experience in any combination of software engineering roles of some type: DevOps, applications, services, automation, etc.
* Minimum 3 years of experience with DevOps practices and automation tooling such as Chef, Puppet or Ansible
* Experience designing, developing and operating software applications/services in AWS ecosystem
* Experience writing code with one or more interpreted languages such as: Python, PHP, Perl, Ruby, Linux Shell
* 1 year+ experience with IaC technologies such as CloudFormation or Terraform. Terraform Associate Certification preferred
* Experience with standard software development best practices and tools such as code repositories (Git preferred)
* 2 years+ experience with agile software development methodologies such as scrum, extreme programming, Kanban
* Must be very comfortable in a Linux Bash shell environment
* Familiarity with pervasive AWS architecture guidelines. AWS Associate Architect Certification preferred
* Thorough understanding of pricing/cost models across AWS services, especially compute, storage, and database offerings
* Must be able to multitask and work well with changing priorities in a fast paced, 24×7 environment
* Experience with public cloud migrations/transformations
* Ability to learn new technologies quickly and the willingness to do so
* Must be highly collaborative and be able to work in a team environment, with technical and business associates
* Excellent communication, problem solving and customer service skills with the ability to translate technical detail into non-technical information
* A strong ability to learn and adapt new technologies
* Strong ability to coordinate activities in a coordinated and highly visible manner
* Education: Bachelor’s degree in computer science, science, engineering or workforce equivalent Technical certifications preferred

מספר משרה: 8716

למה לעבוד קשה?

שלחו לנו קו"ח ותנו למשרה הנכונה למצוא אתכם