The Role
You’ll join the infrastructure team behind one of Israel’s largest cloud-native security platforms — handling the systems that keep threat data flowing, services available, and deployments reliable at a scale most engineers won’t encounter. The core challenge: CI/CD and cloud infra that doesn’t just work, but holds up under the pressure of a global security product.
About the Product
The platform is a large-scale cloud-native cybersecurity system — one of the 10 largest of its kind in Israel by infrastructure footprint. It processes security events and threat intelligence for enterprise customers globally, where reliability and observability aren’t optional features — they’re the product. You’re not maintaining internal tooling: you’re building the backbone of something that’s actively in use by security teams around the world.
The Stack: The infrastructure runs on Linux with Docker and Kubernetes as the container layer, fully deployed on AWS. Observability is built around Prometheus, Grafana, and OpenTelemetry with centralized logging — the kind of stack that’s been chosen deliberately, not inherited by accident. Python handles automation throughout. It’s a mature, high-scale environment where the tooling choices reflect the operational reality of running a global security product.
What You’ll Be Doing
- Build and maintain CI/CD pipelines for both development and production environments across a high-traffic, multi-service AWS setup
- Own container orchestration — designing, deploying, and operating services on Kubernetes at real production scale
- Write and maintain automation scripts in Python to reduce toil and improve pipeline reliability
- Instrument observability across the stack — Prometheus, Grafana, OpenTelemetry, and centralized logging as core tools, not afterthoughts
- Evaluate and introduce new technologies through structured proof-of-concepts and cost analysis
- Maintain and harden the Linux-based infrastructure, including networking and container runtime concerns
- Collaborate across an international team, with English as the working language
What We Expect
Must-have
- 2+ years of hands-on DevOps experience building CI/CD pipelines for development and production
- 2+ years working with AWS across high-traffic, multi-service environments
- Production experience with Docker and Kubernetes (or ECS)
- Solid Linux fundamentals — networking, containers, scripting
- Python scripting experience applied to infrastructure automation
- Hands-on experience with observability tooling: Prometheus, Grafana, OpenTelemetry, centralized logging
- Bachelor’s degree in Computer Science or related field
Nice to have
- Experience in security-adjacent infrastructure or cybersecurity products
- Background in SaaS or product companies with real production load
- Knowledge of security best practices in cloud infrastructure design
- Experience running formal PoCs with cost and performance analysis
Why This Role Is Worth Your Time
- You’re working on infrastructure that serves one of the largest cloud-native security systems in Israel — the scale is genuine, not a pitch
- Fully remote role, international team — English is your working language day-to-day
- The domain is active cybersecurity, which means the problems are real, the stakes are high, and the technical challenges don’t get stale
- The tech stack is current, well-scoped, and gives you room to introduce and validate new tools through structured evaluation — your judgment on technology decisions carries weight here