Principal Software Engineer, Docker Agents (London)
At Docker, we make app development easier so developers can focus on what matters. Our remote-first team spans the globe, united by a passion for innovation and great developer experiences. With over 20 million monthly users and 20 billion image pulls, Docker is the #1 tool for building, sharing, and running apps—trusted by startups and Fortune 100s alike. We’re growing fast and just getting started. Come join us for a whale of a ride!
We are looking for a Principal Software Engineer (Docker Agents) to join Docker’s AI engineering team to build the future of containerized AI agents. Docker containers are the perfect vehicle to host and run AI agents—providing isolation, portability, and reproducibility. You’ll be working on cagent, our open-source project (https://github.com/docker/cagent), and expanding on it to enable developers to build, deploy, and scale intelligent agents using Docker’s container technology.
This is a greenfield opportunity to shape how developers leverage containers for AI agents at massive scale. You’ll define the technical vision, lead architecture decisions, and partner with engineers and leaders across Docker to bring containerized agent capabilities into Docker’s developer experience.
Please note: this role is only available to candidates currently located in London (or reasonable commuting distance to London), we are unable to make exceptions to this location requirement.
Responsibilities
- Technical Leadership & Architecture: Define and drive the long-term technical strategy for Docker’s containerized agent platform, including core primitives, APIs, and extensibility patterns
- Build Containerized Agent Systems: Design and implement systems that leverage Docker containers as the ideal runtime for AI agents, ensuring isolation, scalability, and portability
- Expand cagent: Maintain and evolve the open-source cagent project, adding new capabilities for containerized agent deployment, orchestration, and lifecycle management
- Agent Runtime Development: Build robust infrastructure for packaging, deploying, and managing agents in containers across local and cloud environments
- Evaluation & Testing: Define evaluation frameworks to measure agent quality, reliability, and production readiness; plus the deployment effectiveness of containerized runtimes
- Reliability & Operability: Establish standards for observability, performance, and operational excellence; lead critical production decision-making and incident learnings as needed
- Rapid Prototyping: Iterate quickly on new agent capabilities and deployment patterns, moving from concept to production efficiently
- Open Source Community: Engage with the cagent community, review contributions, and help grow the ecosystem
- Cross-functional Collaboration: Lead cross-functional technical discussions and influence architectural decisions across Docker’s AI initiatives (including sister teams and platform efforts)
- Mentorship & Enablement: Mentor senior engineers, raise the bar through design reviews, and accelerate team execution through clear technical direction and coaching
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience
- Take part in on-call rotation for your team; respond to incidents, debug production issues, and drive continuous improvement of system reliability