Sign up to access all features of our service
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Solutions and Platforms Engineer

Overview

Agentic AI Observability Senior Engineer is responsible for deploying, integrating, and operating a scaled Agentic AI observability platform across both internal and external agent frameworks. This role focuses on production-ready instrumentation and telemetry pipelines that provide end-to-end visibility across multi-step agent workflows—including planner/executor loops, tool/function calls, RAG retrieval, and memory/state—ensuring reliability, safety, performance, and cost governance at scale

Responsibilities
  1. Agentic AI Observability at Scale (0%)
  • Platform Deployment & Operations (Agentic AI Observability at Scale)Deploy and run the Agentic AI observability platform across  dev/uat/prod with HA, resiliency, and controlled rollouts Implement release automation (CI/CD), canary deployments, rollback strategies, and configuration management for platform components Own operational readiness: on-call runbooks, incident response, and production support for agent observability services
  • End-to-End Agent Workflow Tracing (Planner → Tools → Retrieval → Response)Implement  distributed tracing for full agent execution graphs, including correlation across: prompts, intermediate reasoning steps (where permitted), tool calls, external APIs, retrieval queries, and final responses Enforce consistent  trace context propagation , correlation IDs, and semantic conventions across agent services Build instrumentation patterns to represent agent flows as spans (e.g.,  plan span tool span retrieval span memory span response span )
  • Agent Framework Integrations & Standardized Instrumentation Deploy and maintain integrations for internal agent frameworks and external ecosystems such as  Crew.ai , LangChain Semantic Kernel AutoGen , and custom orchestrators Create reusable SDKs/middleware/sidecar patterns for teams to instrument agents with minimal effort Define and implement tagging standards for: agent name/version, tool name, model provider, prompt template ID, retrieval source, tenant/app, and environment
  • Agentic AI Telemetry Pipelines & AI-Specific Signals Build scalable pipelines for agent telemetry (logs/metrics/traces) using  OpenTelemetry and platform observability tooling Capture AI-specific metrics including: token usage, cost per task, tool-call latency, retrieval latency, grounding score proxies, error rates, and agent loop iterations Implement sampling and redaction strategies for sensitive agent payloads (prompts, responses, retrieved content) aligned to governance requirements
  1. Collaboration with Teams (10%)
    • Collaborate with transformation teams and business stakeholders to understand requirements and tailor AI agents to specific domains.
    • Work closely with AI platform teams to build scalable and cross-domain AI agents while ensuring end-to-end observability.
  2. Integration & Deployment (10%)
    • Build and maintain CI/CD pipelines for agent services and operations center components, including automated testing and deployment
    • Automate onboarding for new agent use cases (templates, scaffolding, configuration checks)
    • Drive best practices for secure, scalable, and cost-effective agent deployments
  3. Continuous Learning (10%)
    • Stay updated with the latest advancements in AI and machine learning technologies and integrate these into existing or new AI agents.
    • Conduct thorough testing and validation to ensure the reliability and accuracy of AI agents and solutions.
Qualifications
  • Education: Bachelor’s or Masters in Computer Science, AI/ML, Data Science, or a related field.
  • Experience: 4-8+ years of software engineering experience; 2-3+ years building and observe AI/ML or GenAI applications preferred
  • Required Expertise:
    • Strong hands-on experience deploying observability solutions (Prometheus/Grafana/Elastic/Splunk/Datadog or equivalent)
    • Deep working knowledge of OpenTelemetry instrumentation and telemetry pipeline operations
    • Experience observing agentic AI systems: tool/function calls, orchestration, routing, memory/state, and RAG pipelines
    • Familiarity with Crew.ai, LangChain, Semantic Kernel, AutoGen, or similar agent frameworks
    • Experience with evaluation/quality monitoring and safe logging strategies for LLM systems
    • FinOps experience for tracking token and GPU spend, chargeback/showback, and cost anomaly detection
    • Experience implementing data governance controls for AI telemetry (PII redaction, retention, auditability)
    • Strong Kubernetes experience (AKS/EKS/GKE) including Helm, operators, ingress, and service networking
    • Strong automation skills (Python/Bash/Go) and CI/CD experience
    • Infrastructure-as-Code (Terraform/Bicep/CloudFormation)
    • Agent workflow tracing and telemetry correlation
    • Production operations and debugging distributed systems
    • Observability-as-a-platform enablement and automation
    • Strong documentation, collaboration, and stakeholder influence
    • Technical Proficiency: Implement monitoring for agent failure modes: tool-call failures, infinite loops, timeouts, hallucination risk signals, retrieval misses, and degraded response quality. Create alerts aligned to operational SLOs (availability, latency, tool reliability) and AI-specific indicators (cost spikes, loop bursts, retrieval anomalies). Support guardrail observability: policy blocks, content filtering events, and safety classifier outcomes (where applicable). Build onboarding automation (IaC, templates, CI checks) that makes observability “default-on” for all agentic services.
    • Problem-Solving: Ability to translate business challenges into technical solutions.
    • Collaboration Skills: Effective at working within cross-functional teams.
    • Agility: Flexibility to adapt to changing requirements and new technologies.
    • Communication Skills: Capable of explaining complex technical concepts to non-technical stakeholders.
Vacancy posted 11 days ago
Similar jobs that could be interesting for youBased on the AI Solutions and Platforms Engineer in Hyderabad vacancy
  •  ...Overview Agentic AI Observability Senior Engineer is responsible for deploying, integrating, and operating a scaled Agentic AI observability platform across both internal and external agent frameworks...  ...and accuracy of AI agents and solutions. Qualifications Education... 
    Suggested
    Full time
    Hyderabad
    14 days ago
  •  ...Overview The Junior AI Observability Architect is an execution-focused engineer who designs, builds, and operates observability...  ...the enterprise AI observability platform. Working under the strategic...  ...Quality Engineering for Agentic Solutions — Post Go-Live & Continuous QE (... 
    Suggested
    Full time

    PepsiCo

    Hyderabad
    a month ago
  • Job Title : AWS Solution Architect. Location : Delhi / Gurgaon (Work at Client...  ...working closely with business, engineering, and operations teams. This...  ...Azure/GCP).- Exposure to Generative AI / AI-driven cloud use cases.- Experience in SaaS platform architecture.- Background in... 
    Suggested
    Full time
    Hybrid work

    MINFY TECHNOLOGIES PRIVATE LIMITED,

    Hyderabad
    21 days ago
  •  ...Overview The  AI Observability Engineer (Agentic Frameworks & AI Agent Operations Center Developer)builds and operationalizes  agentic AI solutions using modern orchestration frameworks and contributes...  ...domains. Work closely with AI platform teams to build scalable and cross... 
    Suggested
    Hyderabad
    14 days ago
  • Description :We are looking for experienced AI Solutions Engineers with strong expertise in Microsoft Copilot Studio and Power Platform technologies to build and manage production-grade AI agents and workflow automation solutions.The ideal candidate should be capable of owning... 
    Suggested

    Wits Innovation Lab

    Hyderabad
    29 days ago
  •  ...:Job description :Senior Cloud Platform EngineerLocation : This is a hybrid...  ...results. As a Fortune 500 Solutions Integrator with deep expertise in cloud, data, AI, cybersecurity, and intelligent...  ...role :As a Senior Cloud Platform Engineer, you will work collaboratively to... 
    Hybrid work

    Insight Direct India Private Limited

    Hyderabad
    21 days ago
  •  ...experienced DevOps Specialist, GitHub & Cloud Platform Engineer to join our enterprise cloud team. This...  ...DevOps environments, GitHub enterprise solutions, implementing Public Cloud Landing Zone...  ...with GitHub Copilot for Business and AI-assisted development workflows... 

    Luxoft

    Hyderabad
    11 days ago
  • The Job in short : The Platform Engineer is an infrastructure specialist focused on providing shared...  ...workflows and automate SDLC steps using AI tools.- Infrastructure as Code (IaaC) :...  ...to optimize delivery.- Centralized Solutions : Identify recurring manual tasks across... 
    Long term contract
    Local area

    Backbase

    Hyderabad
    8 days ago
  •  ...Integrate advanced observability platforms (Dynatrace, CloudWatch) with...  ...• Design, deploy, and govern AI-powered agents (using Azure Copilot...  ...robust secrets management solutions (AWS Secrets Manager,...  ...connectivity, software, digital engineering and platforms. The Group €22.5... 
    Work at office
    Remote job
    Flexible hours

    Capgemini

    Hyderabad
    more than 2 months ago
  •  ...payment software, we provide the world’s leading brands with AI-powered solutions across the full AR lifecycle—from invoice presentment and...  .... When we fall short, we own it and come back stronger. Platform Engineer As a Platform Engineer within our Operations Engineering... 
    Full time

    Billtrust India Careers

    Hyderabad
    1 day ago
  •  ...reliability and security of cloud platforms and services. • Design, deploy, and govern AI-powered agents (e.g., using...  ...Implementing AI based automation solutions for Cloud Operations to Monitor...  ...connectivity, software, digital engineering and platforms. The Group €22.5 billion... 
    Work at office
    Remote job
    Flexible hours

    Capgemini

    Hyderabad
    more than 2 months ago
  •  ...Summary The CAE Solutions product team is mainly responsible for consultancy, administration and support of the whole range of applications and tools used by the engineering departments of BASF, like different 2D and 3D tools for planning and maintenance activities of production... 
    Permanent employment
    Full time

    BASF Digital Solutions Private Limited

    Hyderabad
    14 days ago
  •  ...Career Family : Cloud Role Type : Platform Engineer The opportunity Your role will be...  ...optimization. Use various tools to orchestrate solutions Understand how IT operations are...  ...in capital markets. Enabled by data, AI and advanced technology, EY teams help... 
    Full time

    Ernst & Young

    Hyderabad
    18 days ago
  •  ...As part of the high performance ServiceNow platform team of Reckitt, the role of the Platform...  ...strategic platform programs – Eg: Implementation AI capabilities, Now Assist, Employee portal,...  ...in Technical Architecture, design & Solutions on ServiceNow Design/Architect Now... 
    Full time
    Local area

    Reckitt

    Hyderabad
    19 days ago
  • Job Description:Role Overview:The Platform & Orchestration Engineer is a critical new role responsible for designing...  ...execution layer that underpins all AI agent workflows in the Agentic AI ERP...  ...workflow templates for each Rimini Solution (Finance, Procurement, Supplier, Expense... 

    Rimini Street

    Hyderabad
    4 days ago
  •  ...opportunity to join a high-caliber software engineering team that is growing quickly. You will...  ..., with a focus on our core data and AI platforms. Your work will focus on enhancing the platform...  ...Integration Ops, product teams, and solutions architects to understand integration... 
    Relocation

    Cohere Health

    Hyderabad
    8 days ago
  •  ...experience developing Generative AI applications on AWS using...  ...cost-efficient Generative AI solutions involving evaluation, prompt management...  ...-grade GenAI microservices or platform components equipped with...  ...testing. Build & Platform Engineering o Lead the use of... 
    Full time
    Shift work
    Weekday work

    Weekday AI

    Hyderabad
    7 days ago
  •  ...MNCRole : Enterprise Architect AI & Cloud PlatformsExperience :...  ...native and AI-powered enterprise platforms. The role requires strong...  ...scalable, secure, and resilient solutions across cloud, AI, data, and enterprise...  ..., and best practices across engineering teams.- Collaborate with... 

    Saaki, Argus, Averil Consulting

    Hyderabad
    7 days ago
  •  ...hiring for the role of Associate Software / Platform Engineer I! Responsibilities of the Candidate:...  ...integrate Python-based components for AI use cases following established designs...  ...scalable, reliable, and maintainable AI solutions. Assist in testing, debugging, documentation... 
    Hyderabad
    16 hours ago
  • Role : Senior AI + Java Platform Engineer About Kanerika : Kanerika Inc. is a premier global software products and services company specializing in innovative solutions for data-driven enterprises.We help organizations accelerate digital transformation and maximize business... 
    Flexible hours

    Kanerika Inc

    Hyderabad
    18 days ago
  • About the Role :We are seeking strong Senior Java + AI Platform Engineers to work on strategic enterprise initiatives focused on building scalable...  ...and implement telemetry, monitoring, and observability solutions- Build dashboards and reporting systems for evaluation results... 

    Caucus Consultant

    Hyderabad
    10 days ago
  •  ...Architect will drive end-to-end solution architecture, delivery...  ...Salesforce Data Cloud concepts.- AI & Automation Enablement : Contribute...  ..., Information Technology, Engineering, or related field (or equivalent...  ...(LWC), and Salesforce platform development.- Strong expertise... 

    Argano Software Private Limited

    Hyderabad
    22 days ago
  • Rs 10 - 40 lakhs p.a.

     ...are seeking a highly experienced and strategic AWS Platform Architect to lead the design, implementation, and optimization...  ..., and container orchestration. An active AWS Solutions Architect Professional or AWS DevOps Engineer Professional certification is mandatory for this... 
    Full time
    Hybrid work
    Weekday work

    Weekday AI

    Hyderabad
    a month ago
  •  ...applications using Microsoft Power Platform.- Build and customize AI-powered copilots using Microsoft Copilot...  ...Design and implement custom Copilot solutions integrated with Microsoft 365 and...  ...Computer Science, Information Technology, Engineering, or a related field.- 6-10 years of... 

    Recruitement Agency

    Hyderabad
    4 days ago
  • Summary Location: Hyderabad The Associate Director, AI Platform Architect will play a pivotal role in architecting, evaluating, and delivering enterprise-grade AI platform solutions across Novartis. Working at the intersection of cloud infrastructure, GenAI innovation... 
    Full time

    Information Technology

    Hyderabad
    a month ago
  •  ...description : Role & responsibilities : We are looking for Java GCP Solution Architect permanent position with MNC company for Bengaluru/...  ...- [Must] Need to have an understanding and designed integration platform to meet the NFR requirements.- [Must] Should have implemented design... 
    Permanent employment
    Hybrid work

    Tanisha Systems Pvt Ltd.

    Hyderabad
    14 days ago
  •  ...We are seeking a highly experienced Power Platform Technical Advisor to provide strategic and...  ...DevOps, citizen development initiatives, and AI/Copilot adoption within a rapidly evolving...  ...pipeline hygiene for low-code/no-code solutions Guide CI/CD strategy and deployment best... 
    Long term contract
    Full time
    Part time
    US shift

    No Limit Technology

    Hyderabad
    a month ago
  •  ...Integration Partner and Internal/External stakeholders to create requirements and design complex solutions in WFM. · Serve as configuration and subject matter expert on WFM platform including, but not limited to advanced scheduling, pay rules, accruals, function and display... 

    PepsiCo

    Hyderabad
    23 days ago
  •  ...Location: Hyderabad The Associate Director, AI Platform Architect – AWS will play a strategic...  ...Units, AI PMO, MLOps, Security, and Engineering teams to shape the future of scalable AI...  ...leadership and mentorship to engineering and solution teams within the India hub and global... 

    Information Technology

    Hyderabad
    a month ago
  •  ...experience  designing, configuring, deploying and maintenance in UKG WFM including but not limited to  pay rules accruals, function access, web/mobile navigators, data views, attestation, reporting and device management. 5+ years experience working with cloud solutions.... 
    Hyderabad
    10 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Solutions and Platforms Engineer. Be the first to apply!