All roles

Principal Site Reliability Engineer - Remote

Remote · USA Full-time New today

Join a dynamic team at the pulse of global markets, where we deliver innovative software and service solutions for essential financial reporting and capital markets transactions. At DFIN, we are a values-driven organization that empowers you to build a fulfilling career while bringing your authentic self to work every day. Our "Win as One" mentality ensures that our team's success is directly linked to Client, Shareholder and Employee Satisfaction. Recognized as one of AMERICA'S MOST LOVED WORKPLACES for five consecutive years and a Built In Best Places to Work for six years, we are committed to our employees' total well-being. Enjoy competitive compensation, a flexible workplace, comprehensive benefits, and opportunities for professional growth. Bring your passion and talents to DFIN - because being YOU thrives here. Summary: We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise. The Principal Site Reliability Engineer - Cloud is responsible for designing, building, securing, monitoring and maintaining our SaaS product cloud infrastructure so it is fast, cost effective, stable and optimized for our customers. SRE's at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements. You either have a SaaS cloud infrastructure background in Azure or AWS with a programmatic, automated mindset or are someone that comes with a software engineering background with SaaS cloud infrastructure experience in Azure or AWS. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can lead colleagues independently to deliver solutions to complex problems. Responsibilities:

  • Champion and implement a culture to maintain performant, reliable, secure, cost-effective platform cloud infrastructure in DFIN SaaS products based on operationalized processes you define
  • Champion security of our cloud infrastructure collaborating with Security and Governance teams and using static and dynamic tooling
  • Champion and implement application and cloud infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs
  • Optimize cloud infrastructure and application performance at scale while maintaining effective cost controls
  • Automate cloud infrastructure buildout and maintenance including system operational runbooks
  • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into operationalized work processes
  • Perform with broad independence and deliver on project milestones and tasks you define on schedule while communicating progress regularly
  • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations
  • Learn continuously and apply lessons learned
  • Evangelize best practices, eliminate bottlenecks, and improve process
  • Participate in on-call duties 365/24/7 and lead the triage and RCA of production incidents Qualifications:
  • 8+ years experience designing, building, securing, monitoring and maintaining cloud infrastructure in Azure or AWS
  • 5+ years experience creating, configuring, maintaining and monitoring Kubernetes clusters (AKS or EKS) in cloud infrastructure to optimize application performance and reliability
  • 5+ years building and deploying Infrastructure as Code with Terraform or similar technology
  • 5+ years experience with common cloud networking, firewall and load balancing configuration
  • 5+ years experience writing software in any modern software language such as C# .NET, Java
  • 5+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment
  • 5+ years experience implementing production performance, availability, and scalability monitoring and alerting using a tool such as New Relic, Dynatrace, DataDog or AppDynamics
  • 5+ years experience supporting public client facing revenue generating systems
  • Experiencing monitoring and preventing issues with databases and database queries (SQL) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor
  • Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts
  • Experience securing Windows or Linux systems in 24x7 pr

Apply tot his job Apply To this Job

Related roles

Senior Shopify Developer (Remote + Flexible)

Remote · USA Full-time

Site Reliability Engineer 2 DevOps | REMOTE (ship required)

Remote · USA Full-time

Site Reliability Engineer II- Data Platforms (Remote)

Remote · USA Full-time

Social Media Analyst Platform

Remote · USA Full-time

Work from Home | Internet Analyst | Social Media Evaluator

Remote · USA Full-time

AI Software Architect / Developer

Remote · USA Full-time

Software Engineering Manager - Container and Virtualisation Infrastructure

Remote · USA Full-time

Embedded Software Engineer (Remote with Travel)

Remote · USA Full-time

Software Consultant; ProjectSight US Posted

Remote · USA Full-time

Senior Consultant - Enterprise Software

Remote · USA Full-time

Immunogenicity Principal Scientist

Remote · USA Full-time

Experienced Part-Time Data Entry Specialist – Remote Work Opportunity at blithequark

Remote · USA Full-time

Investment Banking & Capital Markets — AI Residency

Remote · USA Full-time

Sr. Manager, Corporate Strategy (Hybrid 3x a week in Secaucus, NJ)

Remote · USA Full-time

Account Manager / Outside Sales Representative - Virginia Beach, VA area

Remote · USA Full-time

Sr. Corporate Travel Advisor, Humanitarian

Remote · USA Full-time

DELIVERY DRIVER - Part Time – Amazon Store

Remote · USA Full-time

REMOTE Outbound Phone Sales Position - Health & Wellness Programs and Supplements

Remote · USA Full-time

VIRTUAL JOB FAIR for CNA, LPN, RN, RNS, HSKP & Dietary! - Far Rockaway

Remote · USA Full-time

[Remote-Position] Customer Service Representative Agent Work From

Remote · USA Full-time