We’re hiring a Senior Backend Software Engineer (Python) ready to take ownership of complex systems and help scale AI-powered media tools used by industry giants. If you’re driven by impact, enjoy tackling distributed infrastructure challenges, and want to contribute to a high-performance product at the forefront of speech technology — this is your chance to build something that’s redefining global content localization.
Responsibilities
- Design and develop scalable, high-performance backend services and APIs
- Lead infrastructure implementation using Python and modern frameworks
- Build CI/CD pipelines and infrastructure-as-code across cloud platforms
- Integrate ML models into production-ready backend systems
- Develop large-scale data pipelines for processing audio, video, and text
- Own features end-to-end: from architecture to deployment and monitoring
- Ensure observability, resilience, and service availability
Requirements
- 7+ years in backend software development
- Deep Python expertise with experience in modern frameworks
- Proven cloud DevOps experience (AWS, GCP, Azure) and automation skills
- Solid grasp of distributed systems, microservices, and event-driven architecture
- Experience with Docker, Kubernetes (EKS/ECS), and serverless tech
- Familiar with SQL and NoSQL databases
- Comfortable working in a fast-moving, startup-like environment
Will be a plus
- Knowledge of ML lifecycle and MLOps practices
What we offer
- Competitive salary and benefits package
- Medical insurance
- Top equipment kit
- Full Remote
- Collaborative and innovative work environment
- Career growth and development opportunities
- A chance to work with a talented and driven team of professional
About the project
Join a deep-tech company transforming the global media landscape with a proprietary AI-driven localization platform. The solution leverages cutting-edge advancements in speech synthesis, emotional voice cloning, and neural machine translation to deliver hyperrealistic multilingual dubbing at scale. Developers work on a robust pipeline combining real-time audio processing, GPU-accelerated inference, and intelligent lip-sync algorithms to preserve the original actors’ performance across languages. The product is used by major studios and streaming platforms to localize premium video content faster, cheaper, and with unprecedented authenticity.