Shape the future of trust in the age of AI At Oscilar, we're building the most advanced AI Risk Decisioning™ Platform. Banks, fintechs, and digitally native organizations rely on us to manage their fraud, credit, and compliance risk with the power of AI. If you're passionate about solving complex problems and making the internet safer for everyone, this is your place.
About the Role Oscilar is growing fast, and so is the complexity of our systems. We’re looking for a experienced SRE to take ownership of reliability across our multi-region, cloud-native platform. You’ll have the mandate and autonomy to design, implement, and evolve systems that stay performant and resilient—through traffic spikes, dependency failures, and global deployments. You’ll be shaping how we scale, how we build observability, and how we run infrastructure that supports billions of events and large-scale data pipelines.
What You’ll Own * Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes). * Lead initiatives to improve availability, latency, and performance at scale. * Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability. * Define the metrics, alerts, and runbooks that form our observability backbone. * Run chaos experiments and failure simulations to harden the platform. * Mentor engineers and set best practices for SRE across the company.