All roles

[Remote] ECO Event Management IST/SET Site Reliability Engineer

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. RIT Solutions, Inc. is seeking a Senior Site Reliability Engineer to join their IST/System Engineering Team. The role involves generating monitoring recommendations, improving enterprise reliability, and collaborating with various teams to enhance the stability of applications used by veterans.

Responsibilities

  • Utilize your skills in enterprise-level triage and incident resolution while gaining experience in VA system infrastructure
  • Use modern system monitoring tools to improve VA enterprise reliability and improve the quality of services provided to veterans
  • Work with system and application owners to obtain existing design and functionality, leverage comprehension of workflow systems and applications processes within multiple system environments and work across technology and development teams to diagnose outages and recommend changes to increase reliability
  • Use your hardware and software experience to help strengthen the systems the VA relies on
  • Your primary focus will be investigation, working with event management, application owners, DevOps teams, and system and network administrators to examine issues across enterprise applications and technology stacks
  • Partner with system and application owners to understand their platform designs and how they operate across different environments
  • This insight will help you diagnose outages, trace workflow issues, and recommend changes that enhance stability
  • Collaborate with developers and identity and access teams when deeper technical investigations are needed
  • You'll gain hands‐on experience with enterprise‐level triage and incident analysis, which will deepen your understanding of the VA's infrastructure
  • Tools like SolarWinds, Dynatrace, and Splunk will be part of your daily workflow, giving you the visibility needed to identify reliability concerns and support improvements to the services delivered to veterans

Skills

  • Deep expertise (3+ years) in two or more of the following tools used for troubleshooting application logging in an enterprise environment (Dynatrace, Splunk, SolarWinds, ServiceNow Operator Workspace)
  • Extensive experience in one or more Technology Areas (Network, Windows, Desktop, Unix/Linux, AWS or Azure Cloud, WebSphere Middleware, Java/JS Development, Microsoft or Oracle Database)
  • 8+ years of experience working with key indicators for IT system operability, reliability, application performance, and code quality
  • 8+ years of experience deploying, maintaining, and troubleshooting complex applications at an enterprise scale while working with cross-functional teams
  • 1+ years of experience in service virtualization, AWS or Azure Cloud technologies, and SaaS and PaaS implementation
  • Experience with using Microsoft Office, including Word, Excel, and PowerPoint
  • 2+ years independently leading a team to solve difficult technical challenges
  • HS diploma or GED and 20+ years of relevant professional experience or MA or MS degree in computer science, electronics engineering, or other engineering or technical discipline with 10+ years of relevant professional experience
  • Experience with test-driven development, distributed systems, microservices and cloud-native application implementation
  • Experience with the following tools: Oracle Enterprise Manager, Riverbed – Aternity, and ServiceNow VTBs
  • Possession of excellent written and verbal communication skills
  • Possession of strong critical thinking and error assessment capabilities
  • Virtual team management
  • Public Trust Clearance

Company Overview

  • Jobdiva Job Portal: https://www1.jobdiva.com/candidates/myjobs/searchjobsdone.jsp?a=xbjdnwgjodtga1y1im2g881fkkeiwd0775lbvq8yqgps8vb2q36w2vj1ga6xxork&compid=-1 Recruitment (contingency search and campus selection). It was founded in 2019, and is headquartered in Arlington, Virginia, USA, with a workforce of 201-500 employees. Its website is https://ritsolinc.com.
  • Apply To This Job

    Related roles

    [Remote] Sr Network Security Engineer

    Remote · USA Full-time

    [Remote] Population Health Data Quality Analyst

    Remote · USA Full-time

    [Remote] Full Stack .NET Developer

    Remote · USA Full-time

    [Remote] Wireless Network Systems Engineer (OpenWRT / RADIUS / PPSK)

    Remote · USA Full-time

    [Remote] Senior Android (Kotlin) Engineer — Digital Signage / Embedded Media Platform (Contract)

    Remote · USA Full-time

    [Remote] Staff Engineer - AI

    Remote · USA Full-time

    [Remote] Senior Manager, Artificial Intelligence Data Engineer

    Remote · USA Full-time

    [Remote] Senior Financial Analyst

    Remote · USA Full-time

    [Remote] Finance Specialist - Remote (only considering candidates in EST and CST time zones)

    Remote · USA Full-time

    [Remote] E-Commerce Operations Analyst

    Remote · USA Full-time

    Experienced Data Entry Representatives Wanted for Remote Work Opportunities at arenaflex

    Remote · USA Full-time

    Supervisor, Clinical Documentation Integrity, CDI

    Remote · USA Full-time

    Full-Stack & AI Engineer

    Remote · USA Full-time

    Linguistic AI Auditor (Tagalog)

    Remote · USA Full-time

    CRA II or Sr CRA (sponsor dedicated) - Min 3 years of prev exp in monitoring - Santiago, home-based

    Remote · USA Full-time

    Fullstackutvecklare inom BI

    Remote · USA Full-time

    Customer Service Representative

    Remote · USA Full-time

    Senior Engineer for CRM Customer Acquisitions (REMOTE) at arenaflex

    Remote · USA Full-time

    Client Services Specialist

    Remote · USA Full-time

    Customer Support Representative (Remote) at FedEx

    Remote · USA Full-time