We are seeking an experienced DevOps/Ops Engineer to join our dynamic team. The ideal candidate will be responsible for enhancing and maintaining our cloud infrastructure, monitoring systems, and CI/CD pipelines. This role involves working with a variety of technologies and platforms, including AWS, Azure, GCP, Jenkins, Linux, Ansible, Docker, Python, InfluxDB, and Grafana.
As our DevOps/Ops Engineer you will:
- Cloud Infrastructure Management:
- Migrate artifacts from S3, ECR, EC2 from an old AWS account to a new one.
- Configure and manage AWS, Azure, and GCP accounts in alignment with Infrastructure as Code (IaC) principles.
- Automate configuration and optimize performance of OpenNebula Linux servers, including build nodes, data storage, and docker registry.
- CI/CD Pipeline Optimization:
- Optimize the performance and stability of Jenkins nodes; investigate and resolve current failures.
- Review and enhance the structure and efficiency of scripted pipelines.
- Monitoring and Performance Improvement:
- Conduct a thorough review of our existing InfluxDB/Grafana monitoring system and implement improvements.
- Optimize Docker files to reduce size and enhance efficiency.
- Expertise in Linux performance optimization.
- Collaboration and Documentation:
- Collaborate with development and operations teams to streamline deployment processes.
- Document all processes and maintain records of configurations and changes.
In your first 180 days you will...
- Migrate resources: Transfer resources from Amazon S3, ECR, and EC2 from an existing AWS account to a new one, ensuring a smooth and secure transition of resources.
- AWS Account Configuration: Set up the new AWS account in alignment with Infrastructure as Code (IAC) principles to streamline and automate the deployment of infrastructure resources.
- Monitoring System Enhancement: Conduct a thorough evaluation of our current monitoring setup, including InfluxDB and Grafana, with the aim of identifying and implementing enhancements to improve efficiency and effectiveness.
- Jenkins Nodes Optimization: Analyze and optimize the performance of our Jenkins nodes. Investigate and resolve existing issues to enhance node reliability and stability.
- Infrastructure Optimization: Examine our infrastructure, specifically focusing on OpenNebula Linux servers, including build nodes, data storage, and Docker registry. Automate configuration processes and implement performance improvements.
- *Docker File Optimization: Assess current Docker files for efficiency and size reduction opportunities. Implement strategies to streamline and minimize Docker image sizes without compromising functionality.
- *Pipeline Refinement: Review and refine our scripted pipelines, focusing on improving their structure and performance to enhance automation and deployment processes.
- *Cloud Account Management: Evaluate and configure our Azure and GCP accounts in accordance with Infrastructure as Code (IaC) practices, ensuring optimal setup and resource management.
Technology Stack: The ideal candidate will work with a technology stack that includes AWS (EC2, S3, ECR), Jenkins, Linux, Ansible, Docker, Python, InfluxDB, OpenNebula and Grafana.
Requirements:
To be successful you should have;
- Bachelor’s degree in Computer Science, Information Technology, or related field.
- Proven experience in DevOps or similar roles, with a focus on cloud infrastructure and CI/CD pipelines.
- Strong knowledge of AWS, Jenkins, Linux, Ansible, Docker, Python, InfluxDB, and Grafana.
- Experience with Azure and GCP is highly desirable.
- Demonstrated experience in Linux performance optimization.
- Strong problem-solving skills and ability to work under pressure.
- Excellent communication and teamwork abilities.
KernelCare is a product of CloudLinux Inc., the maker of the #1 OS in security and stability for hosting providers. KernelCare, a live kernel patching service provides security patches and bug fixes for a range of popular Linux kernels that can be installed without rebooting the system. Â Check out our website for more information about our KernelCare Product https://tuxcare.com/live-patching-services/
Benefits:
What's in it for you?
A focus on professional development;
- Training reimbursements
- Mentor programs
- Knowledge-Exchange programs
- Interesting and challenging projects
- Flexible working hours
- Paid 24 days of vacation per year and unlimited sick leave
- Medical insurance reimbursement
- Co-working and gym/sports reimbursement
- The opportunity to receive a reward for the most innovative idea that the company can patent.