Ir para o conteúdo
Logotipo da Catho
Seu novo emprego está na palma das suas mãos! Baixe nosso app e obtenha vantagens :)
Botão App StoreBotão Play Store

Vaga de Site Reliability Engineer (AdTech)

1 vaga: | Publicada em 06/07

Sobre a vaga

We are looking for a strong Site Reliability Engineer who can strengthen our team and will participate in the development of a complex and in-demand AdTech project! Customer Our customer is Beeswax ( https://www.beeswax.com/about/ ), a rapidly growing US AdTech company. Founded by three former Google specialists, it has a highly technical team and an excellent technological culture. Beeswax provides extremely high-scale Bidder-as-a-Service solutions in advertising technology, works with global businesses, and has raised $28M (including the most recent Series B raise of $15M). Sigma Software works with Beeswax to provide numerous key components of the platform. It is looking for engineers to complement the Beeswax engineering team and drive further platform development. Project Were seeking a skilled Site Reliability Engineer responsible for the Clients Platforms Cloud Infrastructure and Observability solutions and ensuring all systems run smoothly. If youre passionate about complex tasks, optimizing systems, driving innovation, providing the highest quality, and collaborating with top talent, this is the perfect opportunity. The project is an easy-to-use, massive-scale, and highly available demand-side platform. Backed by Amazon Web Services and Kubernetes, the team has embraced Infrastructure as code to manage thousands of applications, servers, and containers running in multiple regions worldwide. Bring your expertise to our dynamic and forward-thinking environment! Job Description Design and build infrastructure and tooling to provide high scalability, reliability, and sub-second performance levels using security industry best practices Write code and scripts to support Infrastructure as code (IaC), configuration management, and automated incident resolution Maintain and extend the observability stack to capture and alert on any system issues Participate in on-call rotations and be an escalation contact for service incidents Write systems documentation, troubleshooting playbooks, and other instruction manuals Other duties and responsibilities as assigned Qualifications Bachelors or higher degree in computer science, computer engineering, relevant technical field, or equivalent practical experience At least 4 years of administration experience with Linux, AWS, and Kubernetes At least 4 years of experience in configuration management using Cloud Formation, Terraform, and Ansible or similar At least 2 years of experience coding on Python Experience in designing, analyzing, and troubleshooting large-scale distributed systems Strong problem-solving skills Strong verbal/written communication skills At least an Upper-Intermediate level of English