Site Reliability Engineer

Remote - Mexico·remote·CorpEng (Sub Team)·engineering
Apply on Dropbox →

<h2>Role Description</h2> <p><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>As a Corporate Site Reliability Engineer</span> <span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z h-lparen&quot;>(SRE)</span><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;> at Dropbox, you will help lead the infrastructure strategy and technical direction of one of the most innovative technology companies globally. Successful candidates will possess a growth mindset, strong accountability and be passionate about designing, building, and securing scalable infrastructure services in a dynamic environment. You will drive improvement projects in automation and observability and effectively handle incidents that arise in a prompt but measured way. In this role, you&#39;ll serve as a technical lead of programs related to monitoring, metrics, alerting and reliability throughout the IT Services organization, and contribute to the evolution of our world-class infrastructure while ensuring utmost security and scalability.</span></p> <div> <p><span class=&quot; author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z95lz89zy6z71zz79zz84zz68zyz69zupz72zz79zcz69zz76zkz79zp1z66ztz67zxz71zz89zz86zz71z&quot;>Our Engineering Career Framework is </span><span class=&quot;attrlink url author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z95lz89zy6z71zz79zz84zz68zyz69zupz72zz79zcz69zz76zkz79zp1z66ztz67zxz71zz89zz86zz71z&quot;><a class=&quot;attrlink&quot; href=&quot;https://dropbox.github.io/dbx-career-framework/&quot; target=&quot;_blank&quot; data-target-href=&quot;https://dropbox.github.io/dbx-career-framework/&quot;><u>viewable by anyone outside the company</u></a></span><span class=&quot; author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z95lz89zy6z71zz79zz84zz68zyz69zupz72zz79zcz69zz76zkz79zp1z66ztz67zxz71zz89zz86zz71z&quot;> and describes what’s expected for our engineers at each of our career levels. Check out our blog post on this topic and more </span><span class=&quot;attrlink url author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z95lz89zy6z71zz79zz84zz68zyz69zupz72zz79zcz69zz76zkz79zp1z66ztz67zxz71zz89zz86zz71z&quot;><a class=&quot;attrlink&quot; href=&quot;https://dropbox.tech/culture/sharing-our-engineering-career-framework-with-the-world&quot; target=&quot;_blank&quot; data-target-href=&quot;https://dropbox.tech/culture/sharing-our-engineering-career-framework-with-the-world&quot;>here</a></span><span class=&quot; author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z95lz89zy6z71zz79zz84zz68zyz69zupz72zz79zcz69zz76zkz79zp1z66ztz67zxz71zz89zz86zz71z&quot;>.</span></p> </div> <h2>Responsibilities</h2> <ul> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Ensure the reliability, scalability, and performance of Dropbox&#39;s infrastructure and services</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Collaborate with cross-functional teams to develop and maintain best practices for monitoring, logging, and incident response</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Build, Implement and maintain automations &amp; infrastructure-as-code tooling, specifically Terraform, Ansible, and Github Actions as well as custom code platforms</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Utilize container orchestration platforms, such as Kubernetes, Amazon ECS and Red Hat Openshift, to manage containers at scale</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Manage and optimize monitoring and logging pipelines using tools like Datadog and Cribl LogStream</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Drive improvement projects related to service health and visibility for our stakeholders, ranging from developers to business service owners to C-level</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Develop and maintain custom tooling and automation scripts in Bash, Python and other scripting languages</span></li> </ul> <p><span class=&quot;thread-348589118974529206372994 attrcomment attrcommentfirst thread-348589118974529206372994-first author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z95lz89zy6z71zz79zz84zz68zyz69zupz72zz79zcz69zz76zkz79zp1z66ztz67zxz71zz89zz86zz71z&quot;>On-call work may be necessary occasionally to help address bugs, outages, or other operational issues, with the goal of maintaining a stable and high-quality experience for our customers.</span></p> <h2>Requirements</h2> <ul> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>5+ years of experience in site reliability engineering or a similar engineering roles with hands-on coding experience</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Strong knowledge of AWS services, including EC2, S3, RDS, R53, Lambda, and others</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Strong knowledge of Linux administration, internals, filesystems, volume management and specific distro&#39;s such as Ubuntu, RHEL, DNS, DHCP</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Experience with monitoring and logging tools, Datadog and logging pipeline tools such as Vector or Cribl LogStream</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Experience driving one or more transformational programs related to metrics and observability</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Experience with scripting in a higher level language</span> <span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z h-lparen&quot;>(Python</span><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;> preferred)</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Experience developing automation to solve infrastructure-related tasks with tools such as Chef/Ansible/Terraform</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Experience with log analysis and building metrics, alerts and visuals from log data</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Strong proficiency in infrastructure-as-code tools, such as Terraform</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Strong Proficiency in Config Management tools specifically Ansible Automation Platform and Chef</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Experience with containerization technologies, such as Docker, and container orchestration platforms like Kubernetes or Amazon ECS</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Knowledge of LDAP, REST API&#39;s and current Auth</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Familiarity with GitHub and Git-based workflows</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Understanding of RDS databases and network security technologies, such as WAF</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Strong problem-solving skills and the ability to work well in a fast-paced, collaborative environment</span></li> <li><span class=&quot; author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz80zsz86z01ji4l3az83zrz74zz82zz68z0gz80z2lz82zz78zz122zz65zz68zz86zn7sz74z&quot;>Excellent written and verbal communication skills</span></li> </ul> <h2>Preferred Qualifications</h2> <ul> <li>Experience managing large-scale multi-cloud or hybrid infrastructure.</li> <li>Strong background in Infrastructure as Code <span class=&quot; h-lparen&quot;>(Terraform,</span> Ansible) and GitOps workflows.</li> <li>Familiarity with Kubernetes, Docker, and serverless platforms.</li> <li>Proven track record improving observability, reliability, and incident response.</li> <li>Understanding of compliance and security frameworks <span class=&quot; h-lparen&quot;>(SOC2,</span> ISO 27001, FedRAMP).</li> <li>Experience implementing Zero Trust security and access models.</li> </ul>

More open roles at Dropbox