
Cloud Site Reliability Engineer (SRE)
- Türkiye
- Sürekli
- Tam zamanlı
- Reliability and Availability:
- Implement best practices for high availability and disaster recovery across cloud environments.
- Monitor system performance, availability, and incident response to ensure minimal downtime.
- Create and maintain robust monitoring and alerting systems.
- Automation and Infrastructure as Code (IaC):
- Develop and maintain automation scripts and Infrastructure as Code (IaC) templates for provisioning and managing cloud resources.
- Automate routine tasks to increase operational efficiency and reduce manual interventions.
- Scalability and Performance Optimization:
- Collaborate with development teams to design and implement scalable and performant cloud architectures.
- Conduct performance analysis and tuning to optimize system response times and resource utilization.
- Incident Response and Troubleshooting:
- Participate in incident response activities, including root cause analysis, resolution, and post-incident reviews.
- Troubleshoot complex issues across the cloud stack and coordinate with relevant teams for resolution.
- Security and Compliance:
- Implement security best practices and compliance measures in cloud environments.
- Collaborate with security teams to ensure data protection and compliance with industry standards.
- Capacity Planning:
- Monitor resource utilization and forecast capacity requirements to support business growth.
- Implement scaling strategies to accommodate changing workloads.
- Documentation and Knowledge Sharing:
- Maintain comprehensive documentation of cloud configurations, processes, and procedures.
- Share knowledge and best practices with team members and contribute to a culture of continuous learning.
Basic Qualifications:
- Bachelor's Degree in Computer Science, Information Technology, or a related field.
- 3+ years of experience in cloud operations, SRE, or a related role.
- Proficiency in cloud platforms such as AWS, Azure, or Google Cloud.
- Certification in cloud platforms (e.g., AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, Azure DevOps Engineer Expert).
- Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
- Knowledge of infrastructure monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
- Strong scripting and programming skills (e.g., Python, Bash, Go).
- Familiarity with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI/CD).
- Excellent problem-solving and communication skills.
- Ability to work collaboratively in a cross-functional and fast-paced environment.