Experience: 4 to 12+ years

Location: India. Remote candidates welcome; must be open to relocating to Hyderabad if required. Preference for local Hyderabad candidates.

Role Overview

We are looking for a Senior SRE with strong C# expertise to drive the end-to-end onboarding, validation, and production readiness of Azure VM and hardware SKUs.

This role sits at the intersection of cloud infrastructure, automation, and reliability engineering, ensuring that new SKUs meet Azure’s performance, compliance, and scalability standards before General Availability (GA).

You will act as a technical owner and integrator, collaborating across compute, networking, storage, and hardware teams to ensure seamless SKU lifecycle management.


Key Responsibilities

  • Lead end-to-end onboarding of Azure VM and hardware SKUs
  • Develop and maintain automation frameworks using C# for validation, testing, and reliability workflows
  • Integrate SKUs into Azure control plane and lifecycle systems
  • Validate:
    • Host configurations (firmware, drivers, platform settings)
    • Guest configurations (OS images, VM sizing, features)
  • Define and execute SKU qualification strategy and test coverage
  • Drive synthetic and scenario-based validation testing
  • Track readiness, risks, and blockers across qualification gates
  • Manage test plans, defects, and pipelines in Azure DevOps (ADO)
  • Partner with:
    • Compute, Networking, Storage, Fabric, Capacity teams
    • Hardware vendors for platform and BOM issue resolution
  • Provide go/no-go signals for SKU launches
  • Support Private Preview → Public Preview → GA lifecycle
  • Ensure production reliability, observability, and incident readiness
  • Build tooling for monitoring, alerting, and failure remediation
  • Drive first-time quality and continuous improvement in onboarding workflows
  • Document SKU onboarding, validation processes, and best practices

Required Qualifications

Education

  • Bachelor’s or Master’s in Computer Science, IT, or related field

Technical Skills

  • Strong hands-on experience in C# (Mandatory)
  • 6–8+ years in SRE / Cloud Engineering / Platform Engineering
  • Strong understanding of:
    • Azure VM architecture
    • Host vs Guest boundaries
    • Cloud control plane concepts
  • Experience in:
    • Automation, tooling, or test frameworks using C#
    • Azure infrastructure or hyperscale cloud environments
    • DevOps / SRE practices (CI/CD, monitoring, reliability engineering)
    • Bug tracking & workflow tools (Azure DevOps preferred)
  • Exposure to:
    • SKU onboarding / NPI / qualification workflows
    • Virtualization and VM platforms
  • Knowledge of:
    • Server hardware (CPU, memory, NICs, GPUs, storage)
    • Firmware (BIOS/BMC) and driver dependencies
    • Compute hardware engineering concepts

Apply for this position

Allowed Type(s): .pdf, .doc, .docx