Staff Site Reliability Engineer
Staff Site Reliability Engineer
- 1 Vacancy
- 3 Views
Offer Salary
Sign in to view salary
For Freelance
No
Job Description
Join the Tilt team At Tilt (formerly Empower), we see a side of people that traditional lenders miss. Our mobile-first products and machine learning-powered credit models look beyond outdated credit s...
At Tilt (formerly Empower), we see a side of people that traditional lenders miss. Our mobile-first products and machine learning-powered credit models look beyond outdated credit scores, using over 250 real-time financial signals to recognize real potential. Named among the next billion-dollar startups, we're not just changing how people access financial products - we're creating a new credit system that backs the working, whatever they're working toward.
The Opportunity: Staff Site Reliability Engineer -At Tilt, we're on a mission to empower people to achieve more with less friction. Our platform helps customers get unstuck by simplifying complexity - and the same principle drives how we build and operate our own systems. We don't just keep the lights on; we design infrastructure that scales effortlessly, adapts intelligently, and makes everyone's job easier.
We're looking for a Staff Site Reliability Engineer who thrives at the intersection of infrastructure, automation, and AI-first thinking. Someone who doesn't just "manage infrastructure" but asks: how can we make this self-healing, faster, cheaper, and more delightful? If that sounds like your kind of puzzle, you'll feel right at home here.
Tilt is a remote-first company. We drive connectivity through regular company offsites. Travel for company offsites is expected at a minimum 2 times a year.
How you'll make an impactDesign & Infrastructure: Design, build and maintain scalable Azure cloud infrastructure (compute, networking, storage, etc.) using Infrastructure-as-Code (Terraform templates). Ensure the environment is well-architected for reliability, scalability, and security.
Automation & CI/CD: Develop and manage robust CI/CD pipelines (e.g. Azure DevOps/GitHub Actions) to automate deployments of infrastructure and applications. Integrate Terraform deployments smoothly into the delivery pipeline to enable continuous improvement.
AI-First Investigation: Leverage AI-driven approaches in investigation and solution planning to speed up delivery, reduce manual effort, and drive more effective implementations.
Monitoring & Observability: Implement comprehensive monitoring, logging and alerting (Azure Monitor/Log Analytics, etc.) to track health, performance and SLOs/SLAs. Continuously optimise the observability stack for cost efficiency, fast incident detection and resolution.
Incident Management: Lead incident response and root-cause analysis for system outages or performance issues. Conduct blameless post-mortems to identify improvements and prevent recurrences. Maintain run-books and on-call processes to ensure rapid recovery.
Collaboration & Best Practices: Work closely with engineering, QA, technical leadership and security teams to design solutions, share expertise, and enforce best practices (secure configurations, secret management, networking policies, etc.). Drive infrastructure enhancements, cost optimisations, and reliability improvements independently.
We're looking for people who chase excellence and impact. Those who stand behind their work, celebrating the wins and learning from the missteps equally. We foster an environment where every voice is valued and mutual respect is non-negotiable - brilliant jerks need not apply. We're in this together, working to expand access to fair credit and prove that people are incredible. When you join us, it's not just another day at the virtual office, you're helping millions of hardworking people reach better financial futures.
You're pushing ahead in your career? We can get behind that. Join us in building the credit system that people deserve.
- Share this job: