All roles

Senior Customer Experience Engineer – Cloud Reliability, SLO Monitoring & Observability Lead at arenaflex

Remote · USA Full-time New today
```html

About arenaflex – Pioneering the Future of Cloud Excellence

At arenaflex, we empower every person and every organization on the planet to achieve more through a resilient, secure, and innovative cloud platform. Our customers entrust us with their most critical workloads, brand reputation, and business continuity. When they succeed, we all win. This is why the arenaflex Cloud Customer Experience (CXP) team exists – to turn every arenaflex Cloud customer into a lifelong fan.

Our culture is built on a foundation of customer obsession, relentless measurement, and collaborative problem‑solving. We champion diversity and inclusion, believing that authentic voices and varied perspectives drive the best outcomes for our customers and for each other. If you thrive in a fast‑moving, start‑up‑like environment where you can shape the future of observability, automation, and proactive reliability, you’ll feel right at home with us.

Why This Role Matters – The Impact You’ll Have

As a Senior Customer Experience Engineer on the Observability team, you will be the guardian of reliability for the most demanding arenaflex Cloud customers. You’ll design, implement, and evolve Service Level Objective (SLO) monitoring solutions that keep critical applications running smoothly, meet contractual commitments, and exceed expectations. Your work will directly influence the health of thousands of services, the satisfaction of millions of end‑users, and the strategic growth of arenaflex.

Key Responsibilities

  • Customer‑Centric SLO Definition: Partner with customers to co‑create Service Level Objectives (SLOs) and Service Level Indicators (SLIs) that align with their business goals, risk tolerance, and compliance requirements.
  • Instrumentation & Measurement: Embed instrumentation into customer workloads, develop code libraries, and configure telemetry pipelines to accurately capture SLO‑related metrics.
  • Breach Detection & Automation: Build automated detection mechanisms, alerting rules, and remediation playbooks that proactively address SLO breaches before they impact end‑users.
  • Cross‑Team Collaboration: Work hand‑in‑hand with arenaflex service engineering, platform reliability, and product teams to map customer‑defined SLOs to internal platform SLOs, ensuring seamless correlation and root‑cause analysis.
  • Data‑Driven Insight Generation: Analyze SLO performance data, identify trends, and produce actionable insights that drive continuous improvement and risk mitigation.
  • Proactive Customer Engagement: Conduct regular SLO health reviews, present performance dashboards, and advise customers on optimization strategies.
  • Performance Optimization: Lead initiatives to enhance system scalability, latency, and throughput, consistently surpassing defined SLO thresholds.
  • Documentation & Knowledge Sharing: Author comprehensive documentation, runbooks, and best‑practice guides for both internal teams and external customers.
  • Mentorship & Thought Leadership: Coach junior engineers, champion reliability best practices, and contribute to arenaflex’s broader observability strategy.

Essential Qualifications

  • Education & Experience: Bachelor’s degree in Engineering, Computer Science, or a related discipline + minimum 4 years of experience designing, implementing, and launching commercial software or web services. Equivalent practical experience is also acceptable.
  • Reliability Expertise: At least 3 years of hands‑on experience in Site Reliability Engineering (SRE) or Customer Reliability Engineering (CRE) within a cloud environment (arenaflex Cloud, AWS, or GCP).
  • SLO/SLI Mastery: Proven track record of implementing and managing SLOs and SLIs for enterprise‑grade cloud customers.
  • Customer‑Facing Skills: Minimum 2 years of experience in a client‑facing role, demonstrating strong communication, empathy, and the ability to translate technical concepts into business value.
  • Technical Proficiency: Deep familiarity with observability stacks (e.g., Prometheus, Grafana, OpenTelemetry), scripting languages (Python, PowerShell, Bash), and infrastructure‑as‑code tools (Terraform, ARM templates).
  • Security Clearance: Ability to satisfy arenaflex Cloud background check and any additional security screening requirements.

Preferred Qualifications & Additional Strengths

  • Master’s degree in Engineering or related field + 6+ years of software industry experience.
  • 8+ years of experience building large‑scale, high‑availability cloud services.
  • Demonstrated success in designing automated remediation workflows and self‑healing systems.
  • Experience with AI‑driven anomaly detection, predictive analytics, or machine‑learning‑based reliability solutions.
  • Published articles, conference talks, or open‑source contributions related to observability, reliability, or SLO engineering.
  • Fluency in multiple programming languages and a passion for continuous learning.

Core Skills & Competencies

  • Analytical Mindset: Ability to dissect complex telemetry data, spot patterns, and formulate data‑driven recommendations.
  • Collaboration & Influence: Strong partnership skills to work across product, engineering, and support teams, driving consensus on reliability goals.
  • Communication Excellence: Clear, concise, and persuasive communication—both written and verbal—tailored to technical and non‑technical audiences.
  • Automation First: Passion for building reusable, scalable automation that reduces manual toil and accelerates incident response.
  • Customer Empathy: Deep understanding of customer business outcomes, enabling you to align technical solutions with strategic objectives.
  • Growth Mindset: Commitment to personal development, staying current with emerging cloud technologies, reliability frameworks, and industry best practices.

Career Growth & Learning Opportunities

arenaflex invests heavily in the professional development of its people. In this role you will:

  • Gain exposure to the most demanding enterprise workloads across a global customer base.
  • Participate in internal reliability guilds, hackathons, and mentorship programs.
  • Access a rich library of training resources, certifications, and conference sponsorships.
  • Progress to senior technical leadership positions such as Principal Reliability Engineer, Reliability Architecture Lead, or transition into product management roles focused on observability.
  • Contribute to arenaflex’s open‑source initiatives, enhancing your industry reputation.

Work Environment & Culture at arenaflex

Our teams operate with the agility of a start‑up while benefiting from the resources of a global leader. You’ll find:

  • Inclusive Culture: A workplace where diverse perspectives are celebrated, and every voice is heard.
  • Flexible Work Options: Hybrid or fully remote arrangements, flexible hours, and generous paid time off to support work‑life harmony.
  • Innovation‑Driven Atmosphere: Freedom to experiment, prototype, and iterate on new reliability solutions.
  • Collaborative Spaces: Virtual and physical collaboration hubs designed for brainstorming, pair‑programming, and knowledge sharing.
  • Recognition Programs: Regular acknowledgment of outstanding contributions through awards, spot bonuses, and public shout‑outs.

Compensation, Perks & Benefits

arenaflex offers a competitive total rewards package that includes:

  • Base salary ranging from $112,000 – $218,400 (adjusted for location, with higher ranges for major metros such as the San Francisco Bay Area and New York City).
  • Annual performance bonuses and equity grants aligned with company growth.
  • Comprehensive health, dental, and vision plans, including mental‑health resources.
  • Retirement savings plans with generous employer matching.
  • Paid parental leave, family‑care assistance, and flexible vacation policies.
  • Professional development stipend, certification reimbursements, and access to a global learning platform.
  • Wellness programs, on‑site fitness centers (where applicable), and employee resource groups.

Commitment to Equality & Accessibility

arenaflex is an equal‑opportunity employer. We evaluate all candidates without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by law.

If you require accommodations during the application or interview process, please let us know. We are dedicated to providing an accessible experience for all candidates.

Ready to Shape the Future of Cloud Reliability?

If you are a customer‑obsessed engineer with a passion for observability, automation, and delivering world‑class reliability, we want to hear from you. Join arenaflex’s mission‑driven team, work alongside industry leaders, and help our customers turn their most ambitious cloud aspirations into reality.

Apply Now and start your journey with arenaflex today!

``` Apply for this job

Related roles

Remote Data Entry & Workflow Systems Analyst – Advanced Analytics, Dashboard Development, and Process Optimization at arenaflex

Remote · USA Full-time

Remote Part‑Time Customer Assistance Representative – Airport Service & Kiosk Support for arenaflex

Remote · USA Full-time

Remote Part‑Time Principal Data Engineer – Data Analytics, Data Science & Cloud Architecture Leadership at arenaflex (US)

Remote · USA Full-time

Part‑Time Remote Data Entry & Marketing Analyst – Flexible Schedule Supporting arenaflex’s Retail Health Initiatives (arenaflex‑Part‑Time)

Remote · USA Full-time

Entry-Level Remote Data Entry Associate – Flexible Home‑Based Role for Teens at arenaflex

Remote · USA Full-time

Customer Experience Journey Strategy Manager – Business & Enterprise Communications at arenaflex

Remote · USA Full-time

Remote Customer Service Coordinator – CX Solutions, Home‑Based Customer Experience Specialist at arenaflex

Remote · USA Full-time

Remote Customer Service Representative – Payments & Digital Strategy Team – arenaflex Credit Union (Remote – Texas, USA)

Remote · USA Full-time

Remote Member Services Representative – Payments & Digital Strategy – Customer Service & Financial Operations at arenaflex

Remote · USA Full-time

Remote Customer Service & Payments Operations Specialist – Financial Services, Collections & Account Reconciliation – arenaflex (US)

Remote · USA Full-time

Prescription Prior Authorization Specialist

Remote · USA Full-time

Remote Customer Support Representative – arenaflex Integrated Healthcare Services & Patient Experience

Remote · USA Full-time

Experienced Part-time Chat Support Associate – Remote Customer Service Representative

Remote · USA Full-time

Delivery Operations Analyst

Remote · USA Full-time

Senior GRC Specialist

Remote · USA Full-time

Desarrollador Java Senior

Remote · USA Full-time

RN Case Manager, Bilingual Required, Hybrid Position, TrueCare

Remote · USA Full-time

Senior Strategic Implementation Partner

Remote · USA Full-time

Digital Marketing Assistant – Work From Home, No Experience Needed, Flexible Schedule

Remote · USA Full-time

Experienced Customer Support Representative – Remote Healthcare Services

Remote · USA Full-time