Stack Exchange

your communities

Job Description

We're looking for an exceptionally talented engineer to help manage our growing infrastructure, ensuring our site stays up and performs well, and refining our processes for operating our production systems. Working closely with the rest of our engineering team, you'll have a great deal of authority in designing and implementing the hardware and software systems we use to host, manage and monitor our production environment.

Thumbtack's infrastructure has always been managed by our small team and a single SRE, and while there haven't been any major disasters, we recognize it's time to take our operations to the next level. Our Python deploys could be much smoother, our monitoring could be more systematic and accessible, our alerting could be much less noisy. We are actively moving our infrastructure from dedicated hardware to the AWS cloud to improve development speed and make our platform more scalable.

We're looking for someone to work with our nascent engineering operations team and push us forward. As an authority on operations, you'll help plan and execute how we manage and monitor our platform as it grows. You'll continually look for new ways to make our systems more reliable and easier to manage, incorporating third-party tools when available and writing software of your own when nothing else fits the bill. You'll anticipate performance bottlenecks and provision new hardware as necessary. And finally, we'd love to find someone who's excited to learn and grow, expanding skills and expertise as our systems continue to grow and develop.

Our current infrastructure:

  • Our platform operates primarily on a few dozen dedicated Linux machines on RHEL, Ubuntu, and Debian, all managed via Puppet; we additionally run a small number of machines and services on AWS
  • Our main data stores are Postgres (website backend) and Mongo (internal analytics); we also make use of DynamoDB, Riak, and Memcached
  • We use DataDog, New Relic, Munin, Graphite and a handful of custom tools for monitoring and alerting
  • We practice continuous deployment using a custom one-click deployment system written in Python (Fabric). Auxiliary systems are deployed directly via Puppet.

Skills & Requirements

  • Expert with Linux administration, security and configuration management
  • Deep knowledge of the steps involved in serving a web request, including a strong understand of TCP/IP, and experience dealing with the corresponding infrastructure components
  • Fanatic about monitoring
  • Enjoy diagnosing and fixing misbehaving and underperforming Linux servers
  • Fluent with the shell and comfortable writing tools in Python to automate our operations and development processes
  • Experience with AWS is a plus
  • Experience tuning database performance is a plus
  • Comfortable working with a great deal of autonomy
  • Excited to continually learn, grow and share knowledge

About Thumbtack

Thumbtack helps you accomplish the personal projects that are central to your life. Whether you need to paint your home, learn a new language, or plan your daughter's birthday party, Thumbtack is the easiest and most dependable way to hire the right professional for your projects. Just tell us what you need, and we'll introduce you to several qualified professionals. Then, compare and hire the pro that's right for you. It's that easy.

Joel Test score: 10 out of 12

The Joel Test is a twelve-question measure of the quality of a software team.

  • Do you use source control?
  • Can you make a build in one step?
  • Do you make daily builds?
  • Do you have a bug database?
  • Do you fix bugs before writing new code?
  • Do you have an up-to-date schedule?
  • Do you have a spec?
  • Do programmers have quiet working conditions?
  • Do you use the best tools money can buy?
  • Do you have testers?
  • Do new candidates write code during their interview?
  • Do you do hallway usability testing?

Learn more about Thumbtack

We have great benefits

Competitive compensation, including meaningful equityMonthly $150 allowance to spend hiring service pros on ThumbtackFull health, dental and vision insurance for you and your familyBeautiful SF location - close to public transportationFlexible working hours and as much paid vacation as you need$1500 learning credit each year for books, courses and conferencesFresh lunch and dinner every day cooked by our in-house chefGreat equipment - huge monitors and work environment of your choice

Visit the Thumbtack company page

view all job listings view all Thumbtack job listings

Site Reliability Engineer at Thumbtack - Python