Cloud Operations Engineer – L1 Job Description Template
Our company is looking for a Cloud Operations Engineer – L1 to join our team.
Responsibilities:
- Support and sustain customer facing AWS Production;
- Respond to product escalations from Support as well as Engineering;
- Communicate and troubleshoot operational issues supporting a complex environment;
- Participate in Cloud Updates and Maintenance;
- Assist in security vulnerability and remediation;
- Initiate Incident Response for cloud outages;
- Scale infrastructure capacity on production;
- Front line support for Cloud Monitoring – Infrastructure and Application;
- Must be able to work extended hours as needed including being available for off hours production support;
- Respond, troubleshoot and resolve production alerts;
- Analyze trends to pro-actively prevent incidents;
- The CloudOps Engineer is responsible for 24/7 availability for Druva, a cloud SaaS;
- Must feel comfortable working in a fast-paced, dynamic and flexible environment;
- Participation in an on-call rotation and operate effectively in a global 24×7 environment.
Requirements:
- Knowledge of Cloud providers including Amazon AWS, Google Cloud Platform, or Microsoft Azure;
- Requires the ability to multitask and work well under pressure;
- Scripting knowledge (Shell, Python);
- Strong Linux/Unix administration;
- Requires excellent communications skills, both verbal and written;
- Ability to learn new technologies quickly with some support and guidance;
- Monitor site reliability and performance;
- Monitor and analyze system logs and RCA;
- Ability to think outside-of-the-box to generate creative solutions to problems;
- Knowledge of Configuration Management (SaltStack) for complex software management.