
Python
Kubernetes
AWS
Site Reliability Engineer
Overview
PostHog is hiring a Site Reliability Engineer to automate and maintain the infrastructure for their ClickHouse cluster.
Job Description
PostHog is equipping every developer to build successful products by providing a suite of products to analyze, test, observe, and deploy new features. As an open-source project from Y Combinator's W20 cohort, PostHog has seen significant growth and success.
Responsibilities
- - Automate the provisioning of metal resources (on AWS) for our cluster
- - Automate dynamic provisioning of instances, utilizing Terraform, Ansible, and K8s
- - Enhance visibility into cluster status
- - Conduct performance investigations and experiments using the latest hardware the hyperscalers have to offer.
Required Skills
- - Proficiency in Python, Kubernetes, and AWS
- - Experience building and operating high-scale complex data storage solutions
- - Strong interest and experience in ClickHouse (or similar OLAP databases) internals and query performance optimization
- - Ability to thrive in a culture of autonomy and self-direction
- - Experience with Terraform and Ansible for infrastructure automation.
Benefits
- - Generous, transparent compensation & equity
- - Unlimited vacation (with a minimum!)
- - Two meeting-free days per week
- - Home office Coworking credit
- - Private health, dental, and vision insurance
- - Training budget
- - Access to our Hedge House
- - Carbon offsetting
- - Pension & 401k contributions
- - Company offsites
About the company
PostHog started as open source product analytics. We've grown into a product & data toolkit, used by 70,000+ teams. We launched on Hacker News with our MVP – just 4 weeks after we started writing code. The response was overwhelmingly positive. We had over 300 deployments in a couple of days. 2 weeks later, we'd gone past 1,500 stars on GitHub.
All Job Openings at PostHog