Site Reliability Engineer (SRE)
Swan is the leading education focused Bitcoin-only onramp for retail customers, high net worth individuals and corporations looking to save in Bitcoin for the long term. We hire passionate Bitcoiners who want to work with a self-motivated and fully distributed startup team.
In this position, you will work closely with our development team, the CTO, and cloud/infra engineers to develop and operate a robust and scalable platform to support Swan’s business lines. You’ll cover a wide range of activities, from day to day operations, error monitoring, and proactive communication, to engineering, bug fixes, and database analysis to improve performance of queries. While this position is focused on operational expertise, experience and desire to build software and systems will always be encouraged.
Skills and experience that will help you succeed:
- Experience with Datadog or similar, setting up monitors, alerting systems, anomaly management and forecasting. A desire to drive a proactive approach to scalability.
- Medium to advanced level understanding of Postgres databases, having dealt with databases at scale, understanding how to tweak parameters, optimize sql queries, and knowledge of AWS RDS in particular.
- Excellent understanding of HA architectures built in AWS.
- At least mid level knowledge of DNS, SSL, AWS networking, Docker, and ECS.
- Working knowledge of security principles in the cloud and a familiarity with the AWS Well Architected Framework.
- Cool under pressure, able to manage incidents involving multiple systems, communicate effectively internally and externally using tools like StatusPage and PagerDuty, marshal resources, and get things resolved, including writing blameless postmortems.
- Comfortable in taking (very occasional) pager alerts during working hours and sometimes weekends (we generally try to avoid night time pager alerts, as we do have staff in Europe and can split pager duty across timezones). You will not be the only on-call staff, but you will be in charge of primary incident response and leadership and training of other developers in response and mitigation.
Here's a bit about our culture:
- We’re a growing team: Fully distributed across the world, Slack and video conferencing are huge here.
- We’re very flat: Leadership is desired and encouraged; we hire people who care about the product they are working on.
- We’re Bitcoiners: We find solutions that encourage Bitcoin principles. Many of us pull double duty alongside our main job, producing content for Bitcoin newsletters, podcasts, social audio platforms, and YouTube shows, and spend some of the day on Twitter educating the masses. We love Bitcoin, and it comes through in our daily chats, meetings, and actions.
Join us, become a Swan!