Site Reliability Engineer
H R Solutions
Role Overview :As a seasoned Site Reliability Engineer, you will be instrumental in ensuring the unwavering availability, performance, and scalability of our critical production systems. This role demands a proactive approach to operational excellence, working closely with our development, product, and infrastructure teams to embed reliability principles throughout the software development lifecycle. You will champion a culture of automation and continuous improvement, directly impacting the stability of our services, enhancing user trust, and safeguarding business continuity for our diverse customer base.Key Responsibilities :- Lead the architectural design and implementation of highly resilient and scalable infrastructure solutions, focusing on Service Reliability and performance optimization for critical applications.- Drive the development and enforcement of robust Disaster Recovery and IT Disaster Recovery plans, conducting regular drills and ensuring swift, effective recovery strategies to minimize downtime.- Define, monitor, and optimize Service Level Indicators (SLIs) and Service Level Objectives (SLOs), establishing `Error Budgets` to balance innovation with operational stability and meet stringent `SLA` commitments.- Spearhead Incident Management processes, leading post-incident reviews to identify root causes, implement corrective actions, and prevent recurrence, fostering a learning culture.- Develop and implement advanced Troubleshooting methodologies and tools to quickly diagnose and resolve complex system issues across distributed environments.- Automate operational tasks, infrastructure provisioning, and deployment pipelines using modern SRE practices, reducing manual toil and improving system efficiency.- Provide expert guidance and mentorship in System Administration best practices, cloud infrastructure, and observability tools to cross-functional teams, elevating overall technical capabilities.- Collaborate with engineering teams to design and build systems that are inherently observable, scalable, and maintainable from inception.- Evaluate and integrate new technologies and tools to enhance our reliability posture, security, and operational efficiency.Required Skillset :- Demonstrated expertise in designing, building, and operating large-scale, highly available distributed systems, with a deep understanding of Site Reliability principles.- Proven ability to lead and execute complex Disaster Recovery and IT Disaster Recovery initiatives, including strategy formulation, testing, and execution.- Strong analytical skills to define, track, and interpret Service Level Indicators (SLIs), manage Error Budgets, and ensure SLA adherence.- Exceptional Troubleshooting and problem-solving capabilities for intricate production issues across diverse technology stacks and cloud environments.- Extensive experience in System Administration for Linux-based systems, including networking, security, and performance tuning.- Proficient in scripting and automation (e.g., Python, Go, Shell) and experience with infrastructure-as-code tools (e.g., Terraform, Ansible).- Strong leadership and Incident Management skills, with the ability to remain calm and decisive under pressure during critical outages.- Excellent communication and interpersonal skills, capable of articulating complex technical concepts to both technical and non-technical stakeholders.- Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field from a reputable institution.- Adaptability to work in a dynamic, fast-paced environment based out of our Navi Mumbai office, collaborating effectively with global teams. (ref:hirist.tech)
- ...in the development of cross-functional software products. About the Role We are looking for a Senior DevOps / Site Reliability Engineer (SRE) with 7+ years of experience to join our high-performing engineering team. This role is pivotal in building scalable systems...SuggestedRemote job
- ...Let’s be #BrilliantTogether ISS STOXX is looking for a Senior Site Reliability Engineer to join our team in Mumbai, India. Shift hours : Working hours (10 AM IST to 7 PM IST). This role expects rotational on-call support 24X7. Responsibilities: Assist the...SuggestedRemote jobLocal areaWorldwideShift work
- ...Job Title: Rotating Engineer – Offshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Offshore) Job Description: The Rotating Engineer – Offshore Reliability will be responsible...SuggestedPermanent employmentFull time
- ...Job Title: Rotating Engineer – Onshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Onshore) Job Description: The Rotating Engineer – Onshore Reliability will be responsible...SuggestedPermanent employmentFull time
- ...appropriately balance speed and long-term maintainability3. Drive a results oriented tech team with help of your expertise in technical sales engineering in cloud computing environment, big data, machine learning, and numerical programming frameworks (e.g., TensorFlow, Python, MATLAB)...SuggestedLong term contract
- Install, update, and administer Mainframe 3rd Party Products. Perform changes within Company regulations and security standards. Assist with testing, implementation, and installation of new/improved systems, as well as product upgrades. Attend meetings and interact regularly...
- ...Santacruz (Kalina) / Mahape (Navi Mumbai)Notice Period : Immediate-15 daysRole Overview : We are seeking an experienced Multi-Cloud Platform Engineer with 4 to 7 years of hands-on expertise in building and operating secure, scalable platforms across Microsoft Azure and Google Cloud...Hybrid workImmediate start
- ...Job Title: Mechanical Engineer – Offshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Offshore) Job Description: The Mechanical Engineer – Offshore Reliability will be...Permanent employmentFull time
- ...Job Title: Mechanical Engineer – Onshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Onshore) Job Description: The Mechanical Engineer – Onshore Reliability will be responsible...Permanent employmentFull time
- Job Responsibilities :- CI/CD Pipeline Management- Design, implement, and maintain CI/CD pipelines for faster and reliable releases.- Infrastructure as Code (IaC)- Manage infrastructure using tools like Terraform, CloudFormation, or ARM templates.- Cloud & Environment Management...
- ...yet powerful charts, reports, slides, etc.- For internal and external client projects, use our proprietary tools for performing data engineering, analytics and visualization activities. Responsible for project deliveries, escalation, continuous improvement, and customer...Local area
- Role & Responsibilities : - Develop and deliver automation software required for building & improving the functionality, - Reliability, availability, and manageability of applications and cloud platforms - Champion and drive the adoption of Infrastructure as Code (laC) practices...
- Job Description : Responsibilities : - Build and Polish iOS Magic : Dive deep into SwiftUI for sleek, modern UIs and UIKit for rock-solid legacy love. Make our app swipe-right smooth!- Cross-Platform Adventures : Tinker with Flutter basics to bridge iOS and Android - because...
- Job Description :- Make a difference: Transform industries with the power of dynamic pricing, enabling them to (a) optimize the utilization of resources (do more with less), (b) democratize products and services across customer segments, (c) create success stories for rest of...
- Job Description : We are seeking a skilled Flutter Developer responsible for developing high-quality, cross-platform mobile applications using the Flutter framework. The ideal candidate should have experience in building scalable, performance-driven apps for Android and iOS,...
- ...and constantly strive to improve solutions.- Follow software design, development, testing and documentation best practices.- Data engineering : Extract and parse data from online and local data sources; Clean up data, audit data for accuracy, consistency and completeness.-...Local area
- ...- Work closely with backend and product teams to define APIs, offline payload structures, and reconciliation logic. - Mentor junior engineers, conduct design reviews, and ensure adherence to clean architecture and coding best practices. - Collaborate with QA to design robust...
- Role Overview:We are looking for an Android Developer to build seamless mobile experiences and solve real-world challenges. You will work closely with Product, Design, and Backend teams to own end-to-end feature development, from architecture and APIs to UI and delivery.Key ...
- ...instrumental in collaborating with commercial, project management, engineering and design technical leaders to create opportunities for Jacobs... ...: Planning/ monitoring of all construction activities on site. Assist Site Manager in planning of construction activities within...Full timeFor contractorsFlexible hours
- Job Responsibilities :- Collaborate with data scientists, software engineers, and business stakeholders to understand data requirements and design efficient data models.- Develop, implement, and maintain robust and scalable data pipelines, ETL processes, and data integration...
- Responsibilities :- Craft responsive, high-performance front-end applications using React / Angular / Vue.js.- Translate beautiful UI/UX designs into interactive experiences for global clients.- Collaborate with backend developers to integrate with AI-powered RESTful APIs.- ...
- Role Overview : We are seeking a detail-oriented and motivated QA Engineer with 0 to 2 years of experience to join our team. The ideal candidate will ensure the delivery of high quality software products by conducting thorough testing and collaborating with the development team...
- ...insurance analytics.Experience :- 10+ years of experience in Data Engineering or Data Platform development.- Strong hands-on expertise in... ...-based data lakes and data warehouses.- Ensure data quality, reliability, governance, and monitoring across data pipelines.- Integrate...
- Job Description - (DevSecOps with AWS & Kubernetes Expertise)Profile Designation : (DevSecOps) Department : IT Job Location : Thane Job Type : PermanentWork Mode : OfficeKey Responsibilities- Design, implement, and manage secure AWS cloud infrastructure.- Build and maintain ...
- ...construction materials to ensure timely availability at project sites. This role requires strong organizational skills, vendor management... ..., invoices, and usage reports. Support project managers and engineers with material-related updates and forecasts. Here's what you'll...Full timeFor contractorsFlexible hours
- ...instrumental in collaborating with commercial, project management, engineering and design technical leaders to create opportunities for Jacobs... ...(state government). Inspection of materials received at site. Knowledge to identify the contractor resource requirement to execute...Full timeFor contractorsFlexible hours
- ...constantly strive to improve solutions.- Setup and following software design, development, testing and documentation best practices.- Data engineering: Extract and parse data from online and local data sources; Clean up data, audit data for accuracy, consistency and completeness.-...Local area
- We are looking for an experienced Software Engineer with strong expertise in SharePoint Online, SPFx, React, and Power Platform to design, develop, and maintain enterprise-level applications. The ideal candidate should be capable of translating business requirements into...Full time
- Summary :- Capable to write code with generic simplified algorithms for any new build or enhancement.- Can apply Object Oriented concepts and Design Patterns.- Application Development for various Business Demands using latest development technologies.- Responsible for Coding...
- ...objectives. This role ensures high performance, maintainability, and reliability of applications built using .NET technologies, microservices,... ...Experience: ~ Bachelor’s degree in Computer Science, IT, Engineering, or a related field ~5+ years of professional experience in...Full timeImmediate start
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
