We are seeking an experienced Senior DevOps Engineer to join our dynamic team. As a Senior DevOps Engineer, you will play a crucial role in designing, implementing, and maintaining the infrastructure and tools necessary for the continuous integration, delivery, and deployment of our software products. You will work closely with cross-functional teams to streamline our development processes, enhance automation, and ensure the scalability, reliability, and security of our systems.
Responsibilities:
1. End-to-End Automation:
• Design, implement, and manage end-to-end automation pipelines for software development, deployment, and machine learning model lifecycle.
2. CI/CD Implementation:
• Develop and maintain CI/CD pipelines for both traditional application development and machine learning model deployment.
3. Infrastructure as Code (IaC):
• Utilize Infrastructure as Code (IaC) tools (e.g., Terraform, Alibaba cloud ROS / CloudFormation) for provisioning and managing cloud resources efficiently.
4. Containerization and Orchestration:
• Implement containerization using Docker and orchestrate containers using Kubernetes for scalable and resilient applications and machine learning workloads.
5. Monitoring and Logging:
• Implement and manage monitoring solutions for both software applications and machine learning models, proactively addressing issues.
6. Security and Compliance:
• Enforce security measures for applications and machine learning systems, ensuring compliance with industry standards and data privacy regulations.
7. Automation and Scripting:
• Write automation scripts (e.g., Bash, Python) to streamline operational tasks, enhance system efficiency, and support machine learning workflows.
8. Incident Response:
• Participate in incident response activities for both application and machine learning-related incidents."
Skills
• Proficiency with CI/CD tools such as Jenkins, GitLab CI, or Travis CI.
• Experience with IaC tools like Terraform or Alibaba Cloud Formation and familiarity with cloud platforms (e.g., ALIBABA, Azure, Google Cloud).
• Hands-on experience with Docker and Kubernetes.
• Knowledge of monitoring tools such as Cloud Monitor, Cloud Watch, Prometheus, Grafana, ELK Stack, or similar.
• Proficiency in scripting languages (e.g., Bash, Python) for automation.
• Effective communication and collaboration skills to work with cross-functional teams.
• Strong problem-solving skills and the ability to troubleshoot complex issues across both application and machine learning domains.
Qualifications:
1. Bachelor's/master’s degree in computer science information technology or a related field, or relevant experience.
2. Minimum of 5 years of experience in a cloud support role.
3. ACP Certified; CKA & ACE will be an advantage.
4. Excellent problem-solving and analytical skills.
Location:
This position is located in Lahore/KL, Malaysia.
Join our team and be part of an innovative and forward-thinking organization dedicated to delivering cutting-edge solutions to our customers.