Remote

SRE Team Lead

Serokell is a fast-growing, fully-remote software development company that works on innovative solutions for complex problems. We’re packed full with adventurous, curious, and inclusive folks that want to build cutting-edge software with modern technologies like functional programming.

We’re seeking a Head of the Site Reliability Engineering department that will work together with other members of the Serokell team to build and manage infrastructure for our and our clients’ software projects.

Responsibilities:

  • Supervise the team of engineers that develops and maintains our IT infrastructure: assign actionable tasks, review their work, and provide technical expertise.
  • Manage plans and goals for the department: turn high-level business requests into tasks, provide time estimates, and schedule the tasks.
  • Make the department work in a preventive manner rather than a reactive/responsive one.
  • Participate in the hiring process: review test task submissions, interview candidates, and write interview reports.
  • Help your team members grow professionally: provide regular feedback and performance reviews.
  • Develop in-house SRE products that boost our productivity and, ideally, can be sold to other companies.

Requirements:

  • Have experience with the Nix ecosystem (nixpkgs, NixOS, haskell.nix, etc.).
  • Have experience managing a team of engineers.
  • Feel at home using Linux.
  • Understand a thing or two about networks (IP addresses and routing, TCP, UDP).
  • Have some understanding of cloud computing and Terraform.
  • Be knowledgeable about monitoring best practices.
  • Have experience in setting up incident reporting tools and mitigation protocols.
  • Know how to use standard dev tools (like Git, Bash, GPG, etc.).
  • Be able to read (and ideally write) some Haskell or Rust code.
  • Know how to operate various CI/CD platforms.

As part of your job, you will:

  • Spend quite a lot of time reading and writing Nix expressions.
  • Spend some time maintaining and improving our NixOS-based, declarative infrastructure.
  • Spend some time doing supervision and resolving management issues of the SRE team.
  • Review work done by your colleagues.
  • Maintain and propel DevOps culture within the company.
  • Help developers learn the tools needed for infrastructure-as-code approach.
  • Improve our alerting systems and incident mitigation procedures.
  • Monitor and respond to security risks.
  • Contribute to our open-source tooling and libraries.
  • Some of the tools that you will encounter at Serokell are Nginx, Postgres, Docker, Prometheus, Grafana, Terraform, Vault, Borgbackup.

Why Serokell could be your dream job:

  • Chance to use Nix and Haskell while working on interesting projects in fintech, blockchain, and more.
  • Possibilities to work on helpful content (articles, videos) for the Nix community.
  • Conference and meetup budget.
  • Open-source culture.

By joining Serokell, you’ll become a part of a warm community of ingenious engineers, computer scientists, designers, marketing experts, and other inventive people. You will be able to take part in ambitious projects that work on building the next generation of computer software, try out crazy initiatives unhindered by bureaucracy or judgment, and have fun in our Slack channels that range from #music to #butthurt (did we mention the huge custom emoji set?).

All applicants go through a test task to test their skill sets.