Senior DevOps Engineer
A Senior DevOps Engineer is needed to handle production incidents, design and manage Kafka and Redis clusters, optimize cloud infrastructure, and develop DevOps platforms. Requires 5+ years of experience with Kafka and Redis, proficiency in programming languages, and experience with CI/CD tools and AWS.
Responsibilities:
- Handle production incidents and conduct post-mortem analysis to enhance system stability.
- Design, deploy, monitor, and troubleshoot Kafka and Redis clusters in production environments, ensuring optimal performance and reliability.
- Collaborate with development teams to ensure seamless deployment of applications and systems.
- Manage and optimize cloud infrastructure (AWS, Alicloud) for performance, cost efficiency, and reliability.
- Develop DevOps platforms, including online load testing and change management systems.
- Enhance automation in infrastructure operations management using LLM or AI.
Qualifications:
- At least 5 years of hands-on experience in Kafka and Redis operations in large-scale production environments, with the ability to collaborate with developers to optimize code.
- Proficient in at least one programming language: Python, Go, or Java, along with strong SQL skills.
- Hands-on experience with containerization and orchestration technologies, including Docker and Kubernetes.
- Strong experience with CI/CD tools such as GitHub Actions, Ansible, and Terraform.
- At least 3 years of experience with the AWS cloud platform; experience with GCP, Azure, or AliCloud is a plus.
- Excellent problem-solving and troubleshooting skills.
- Strong team collaboration skills and the ability to build partnerships with other teams and business units.
- Practical experience in AIOps is preferred.