Ref#: R0005977

Date published: 22-Feb-2017

Our mission.

As the world’s number 1 job site, our mission is to help people get jobs. We need talented, passionate people working together to make this happen. We are looking to grow our teams with people who share our energy and enthusiasm for creating the best experience for job seekers.

The team.

We are builders, we are integrators. Tech Services creates and optimizes solutions for a rapidly growing business on a global scale. We work with distributed infrastructure, petabytes of data, and billions of transactions with no limitations on your creativity. You don’t have to wait for some architect or manager to tell you what you can work on - you decide the priorities. With tech hubs in Seattle, San Francisco, Austin, Tokyo and Hyderabad, we are improving people's lives all around the world, one job at a time.

Your job.

Indeed is looking for a Lead to join our Site Reliability Engineering (Systems) team. Our team helps people get jobs by engineering resilient systems that are fast, fault-tolerant and scalable. These systems provide job search capabilities to over 200,000,000 job seekers who use Indeed websites every month.

The team is responsible for resiliency, performance and security of Indeed's global production infrastructure:

  • We provide guidance to the Software Engineering teams and drive best practices for Indeed's products, which span multiple data centers across five continents.

  • We are experts in core infrastructure technologies like load balancers, HTTPd, Puppet, Tomcat, Memcached, RabbitMQ, Elasticsearch, MongoDB, and more.

  • We realize that failure is inevitable, so we embrace it and plan for fast recovery, in order to deliver near 100% uptime.


  • Develop training and mentor teammates

  • Define standards for configuration, monitoring, reliability and performance

  • Serve as subject matter expert for multiple proprietary and open source technologies

  • Design and implement innovations that improve software engineering velocity, infrastructure resiliency/security, and data availability

  • Coordinate and perform major upgrades with minimal downtime

  • Provide expert perspective regarding the capabilities and limits of the multi-datacenter production infrastructure in software architecture designs

  • Influence Software Engineering leadership by motivating improvements to Indeed’s software systems & education

  • Solve live performance & stability issues in production and then prevent their recurrence

  • Participate in a follow-the-sun on-call rotation

About you.

As a Lead Site Reliability Engineer, you are growing a team of engineers in charge of the resiliency and performance of the production infrastructure. We expect that you have deep and broad technical knowledge and that you can still command the shell fluently. While you will gain the respect from the team with your knowledge and experience, you will keep the respect through continual motivation and coaching of the team. You will provide a logical vision and evoke your team’s greatest strengths to achieve that vision.

Minimum Qualifications

  • 5+ years of experience designing and managing services in a distributed, internet-scale Linux environment.

  • 4+ years of scripting experience in Shell, Perl, or Python.

  • 2+ years of experience managing a team of system administrators or infrastructure engineers.

  • 1+ years of experience with configuration management in Puppet, Chef, Ansible, or Salt.

  • BS degree in Computer Science or related technical field, or equivalent practical experience.

Preferred Qualifications

  • Ability to motivate technology and process innovations. You’re a systemic thinker and have a natural inclination for making things better.

  • Preference and skills to lead by example. You demonstrate care for your team’s well-being, growth and technical execution.

  • Vision for improving on the status quo. You’re a systemic thinker and have a natural inclination for making things better.

  • Extremely curious about how things work. You quickly comprehend how code, processes, and systems fit together.

  • Meticulous and cautious. You consider edge cases and risk mitigation strategies for every change.

  • Expert communicator. You can explain technical details one minute and career feedback the next.

  • Bring something new to the team. You bring a diverse viewpoint to the table which increases our successes.

  • Data-driven, results-oriented and adaptive during uncertainty. You will navigate the uncertainty of growth with a balanced focus towards results.

  • Demonstrated capability to advance multiple projects simultaneously.

Indeed provides a variety of benefits that help us focus on our mission of helping people get jobs.

View our bounty of perks: