Cloud Operations Engineer – L1

Cloud Operations Engineer – L1 Job Description Template

Our company is looking for a Cloud Operations Engineer – L1 to join our team.

Responsibilities:

  • Support and sustain customer facing AWS Production;
  • Respond to product escalations from Support as well as Engineering;
  • Communicate and troubleshoot operational issues supporting a complex environment;
  • Participate in Cloud Updates and Maintenance;
  • Assist in security vulnerability and remediation;
  • Initiate Incident Response for cloud outages;
  • Scale infrastructure capacity on production;
  • Front line support for Cloud Monitoring – Infrastructure and Application;
  • Must be able to work extended hours as needed including being available for off hours production support;
  • Respond, troubleshoot and resolve production alerts;
  • Analyze trends to pro-actively prevent incidents;
  • The CloudOps Engineer is responsible for 24/7 availability for Druva, a cloud SaaS;
  • Must feel comfortable working in a fast-paced, dynamic and flexible environment;
  • Participation in an on-call rotation and operate effectively in a global 24×7 environment.

Requirements:

  • Knowledge of Cloud providers including Amazon AWS, Google Cloud Platform, or Microsoft Azure;
  • Requires the ability to multitask and work well under pressure;
  • Scripting knowledge (Shell, Python);
  • Strong Linux/Unix administration;
  • Requires excellent communications skills, both verbal and written;
  • Ability to learn new technologies quickly with some support and guidance;
  • Monitor site reliability and performance;
  • Monitor and analyze system logs and RCA;
  • Ability to think outside-of-the-box to generate creative solutions to problems;
  • Knowledge of Configuration Management (SaltStack) for complex software management.