
Site Reliability Engineer
Overview
Zapier is hiring a Site Reliability Engineer to help strengthen Zapier’s reliability posture. The role involves designing and operating cloud systems, strengthening observability and incident response practices, and building resilient services that support high-traffic workloads.
Job Description
Zapier is a platform that helps millions of businesses globally scale with automation and AI. The company is committed to making automation work for everyone by delivering products that delight customers. The Site Reliability Engineer will work on Zapier’s Internal Platform, providing engineers with a reliable foundation for building, shipping, and operating software.
Responsibilities
- - Design and manage AWS infrastructure with Terraform and Helm
- - Strengthen observability and incident response practices
- - Build resilient services that support high-traffic workloads
- - Partner with engineering teams to solve infrastructure challenges
- - Execute critical migrations and integrations between systems
- - Apply site reliability engineering practices consistently
- - Share knowledge through documentation and communication
- - Identify and recommend new tools or approaches
- - Explore and apply AI tools to optimize workflows
- - Contribute to business-hours on-call support
Required Skills
- - At least 4 years of experience in cloud engineering, systems administration, or a related field
- - Experience with cloud platforms such as AWS, GCP, or Azure
- - Proficiency in at least one programming language such as Python, Go, or similar
- - Experience with automation tools
- - Effective communication skills
- - Alignment with Zapier’s values
- - AI fluency or willingness to learn
Benefits
- - Offers Equity
- - Offers Bonus
- - Remote work flexibility
- - Competitive and equitable compensation practices
About the company
Zapier puts the power of automation in your hands—no coding required. Take your workflows to the next level with our suite of automation tools.
All Job Openings at Zapier