Work in South Holland Job Board

The SRE will be responsible for the reliability, availability, and performance of Azure/AWS PaaS and IaaS workloads. They bridge the gap between development and operations, focusing on building automated systems that prevent failures, managing incident responses, and optimizing cloud costs.

Key Responsibilities

System Reliability & Monitoring: Design, implement, and maintain comprehensive monitoring and alerting systems such as Azure Monitor, AWS CloudWatch, Application Insights, and Log Analytics. • Automation & Toil Reduction: Automate repetitive manual operations (toil) such as environment provisioning, system patching, and scaling. Use IaC tools like Terraform and Ansible to manage infrastructure. • Incident Response & Management: Actively manage incident responses, root cause analysis (RCA), and post-mortem investigations to improve system reliability and minimize mean time to resolution (MTTR). • Cloud SRE Agent Integration: Deploy and configure Cloud SRE Agent to automate incident investigation, execute remediation steps (restart, scale, rollback), and manage routine tasks. • Capacity Planning & Scalability: Analyze usage patterns to optimize cloud resources, ensuring high availability and performance while managing costs via Azure Cost Management. • CI/CD & DevOps Collaboration: Integrate automation workflows into CI/CD pipelines (e.g., GitHub Actions or Azure Pipelines) to ensure reliable deployments.

Skill Requirements

Cloud Platforms: Expert knowledge of Microsoft Azure infrastructure services (Compute, Storage, Networking, AKS). • Scripting & Programming: Proficiency in Python, Bash, or PowerShell for building automation tools. • Infrastructure as Code (IaC): Extensive experience with Terraform and ARM templates/Bicep. • Observability Tools: Experience with Azure Monitor, Grafana, Prometheus, or Datadog. • Containers & Orchestration: Solid understanding of Kubernetes/AKS (Azure Kubernetes Service). • Operating Systems: Proficient in Windows/Linux environments. • Azure Certification is a + • Exposure to multi Cloud environment is must.

Other Requirements

Reviewing Service Level Objectives (SLOs) and error budgets. 2. Refining auto-scaling rules for Kubernetes clusters based on traffic trends. 3. Working with developers to review service architecture and ensure fault tolerance. 4. Configuring AI-driven alert suppression to reduce alert fatigue. 5. Creating Azure Dashboards to visualize key performance indicators (KPIs).

Apply now

Information at a Glance

Why HCLTech?

At HCLTech, you'll supercharge your potential. You'll find your career. And you'll find your spark. All at a place that knows that helping its customers stay on top starts by putting its people first.

HCLTech is a global technology company, home to more than 226,300 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. Consolidated revenues as of 12 months ending December 2025 totaled $14.5 billion.

Benefits

At HCLTech, we believe in empowering our employees with comprehensive benefits that support their professional growth and enhance their well-being. When you sign up for a career with us, you gain access to:

Industry-benchmarked compensation

Best-in-class healthcare benefits

Personal time off

Maternity and paternity benefits

Access to skills / higher education programs/resources

Discounts on products and services via Benefit Box

Participate in CSR programs and live life with a purpose

Opportunities to grow and advance your career

Note: The benefits listed above vary depending on the nature of your employment and the country where you work. Some benefits may be available in some countries but not in all.

See more open positions at HCL Technologies

Powered by Getro.com

Privacy policy Cookie policy

Sign up for our Newsletter

About us

Work in Rotterdam – The Hague is collaborative effort between public entities, focused on facilitating the contact between companies with open positions located in our region and qualified international professionals. Our mission is to attract top talent to the Greater Rotterdam – The Hague area in the Netherlands and work together on creating a better, greener, smarter, cleaner and healthier world.