September 6
🏡 Remote – Anywhere in California
Ansible
AWS
Bash
Cloud
Grafana
JavaScript
Kubernetes
Prometheus
Python
SDLC
Shell Scripting
Terraform
TypeScript
Go
• Architect and implement cloud infrastructure solutions, ensuring they meet the needs of scalability, reliability, and security. • Develop and maintain automated CI/CD pipelines, enabling smooth and rapid software deployment. • Lead and manage incident response efforts, ensuring that issues are resolved quickly and that root cause analysis is performed to prevent future occurrences. • Implement and manage monitoring tools and performance tuning practices to ensure optimal infrastructure performance and cost-efficiency. • Work closely with software engineers, product, and other stakeholders to integrate infrastructure solutions seamlessly into the overall software development lifecycle. • Develop and maintain tools to automate routine tasks, improve infrastructure efficiency, and reduce manual intervention. • Ensure all infrastructure meets security and compliance requirements, implementing best practices and keeping up-to-date with industry standards. • Create and maintain comprehensive documentation for infrastructure designs, processes, and incident response actions.
• Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience. • 7+ years of experience in cloud infrastructure engineering, with a strong background in incident management. • Extensive experience with cloud platforms such as AWS and Google Cloud. • Deep understanding of Kubernetes cluster management, debugging and troubleshooting. • Excellent communication and collaboration skills. • Background in software engineering with a solid understanding of SDLC. • Strong Linux systems understanding including Bash / shell scripting and debugging • Proficiency in Go, Python, and/or JavaScript/TypeScript for building tools and automation. • Expertise with CI/CD systems such as GitHub Actions, GitLab CI, or similar. • Experience with infrastructure as code tools like Terraform, CloudFormation, or Ansible. • Familiarity with observability tools such as Prometheus, Grafana, Datadog, or equivalent. • Proficient in Git and version control practices. • Strong problem-solving and debugging skills. • Proven effectiveness in leading cross-functional projects. • Track record of building patterns and solutions leveraged across an organization.
Apply Now