Job Responsibility
AWS Cloud Infrastructure Management:
- Design, deploy, and manage scalable and highly available cloud infrastructure on AWS.
- Optimize and maintain cloud resources for cost-eAiciency and performance.
- Implement security best practices to safeguard cloud environments and data.
Kubernetes Management:
- Oversee the deployment, scaling, and management of Kubernetes clusters hosted on EC2.
- Manage Kubernetes master and worker nodes, including node scaling, updates, and monitoring.
- Implement and maintain Calico for network policies and ingress control.
- Utilize CertManager for SSL certificate automation and management within Kubernetes. Continuous Integration and Continuous Deployment (CI/CD):
- Establish and maintain CI/CD pipelines for automated application deployments.
- Implement version control systems and branching strategies for streamlined development workflows.
- Automate build, testing, and deployment processes for rapid and reliable releases. Infrastructure as Code (IaC):
- Use Infrastructure as Code tools like Terraform or CloudFormation to automate provisioning and management of AWS and Kubernetes resources.
- Maintain version-controlled IaC templates for reproducible and scalable infrastructure deployments. Monitoring, Alerting, and Security Information and Event Management (SIEM):
- Set up and configure monitoring tools (e.g., CloudWatch, Prometheus, Grafana) to proactively identify and resolve operational issues.
- Create and manage alerts to notify teams of potential incidents or performance concerns.
- Manage SIEM tools for threat detection, log analysis, and security event management. Security and Compliance:
- Implement security measures to protect cloud and Kubernetes resources from potential threats.
- Ensure compliance with industry standards and best practices related to data security and privacy. Performance Optimization:
- Monitor system performance and proactively identify bottlenecks or areas for improvement.
- Implement performance optimization strategies to enhance application and infrastructure performance. Collaboration and Communication:
- Collaborate with development and operations teams to understand requirements and provide technical solutions.
- Communicate eAectively with team members and stakeholders about ongoing projects and initiatives. Documentation:
- Maintain detailed technical documentation related to AWS infrastructure, Kubernetes setup, CI/CD pipelines, and processes.
Job Requirements
- Proficiency in English and Mandarin
- Bachelors Degree in Computer Science, Information Technology, or a related field (or equivalent work experience).
- 5+ years of proven experience as a DevOps Engineer, with a focus on AWS cloud infrastructure and Kubernetes.
- Strong knowledge of AWS services, including EC2, EKS/ECS, S3, Lambda, Route53, RDS, CloudFront, etc.
- Hands-on experience managing Kubernetes clusters on AWS (EC2), including CertManager for SSL generation and Calico for ingress.
- Experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
- Proficiency in scripting languages such as Python, Bash, or PowerShell.
- Familiarity with CI/CD process tools such as Azure, Jenkins, GitLab CI, or AWS CodePipeline.
- Experience with monitoring tools like CloudWatch, Prometheus, Grafana, and SIEM systems.
- Strong understanding of AWS security best practices, including VPC, IAM, and security groups.
- Knowledge of containerization technologies like Docker and orchestration with Kubernetes.
- Excellent problem-solving skills and the ability to troubleshoot complex technical issues.
- Strong communication and interpersonal skills, with the ability to work collaboratively in a team-oriented environment.
- AWS Certification(s), such as AWS Certified DevOps Engineer - Professional, is a plus.
- We are seeking a highly experienced Senior DevOps Engineer to lead the design, implementation, and management of our cloud infrastructure and server environment.
- The ideal candidate will have in-depth knowledge of AWS, Kubernetes, and related technologies, along with strong skills in automation, security, and performance optimization.
Job Benefits
- Medical, dental & Optical Allowance
- Transport Allowance
- EPF SOCSO