Daxko builds the software that powers fitness, wellness, and community organizations—everything from member management and payments to digital engagement and analytics. From small studios to the largest associations, we help thousands of fitness businesses streamline operations, enhance member experiences, and drive sustainable revenue growth. We’re looking for a Manager of Site Reliability Engineering (SRE) who is passionate about building resilient systems and leading teams that keep critical services running smoothly. In this role, you’ll guide a team responsible for the reliability, performance, and operational health of our production environments. You’ll partner closely with engineering leaders to ensure our systems remain secure, scalable, and available for the organizations and communities who depend on them. What You’ll Do As the Manager of Site Reliability Engineering, you will lead a team responsible for the operational reliability of Daxko’s production platforms. Your work will focus on creating stable, high-performing systems while empowering your team to continuously improve how we operate and support our products. You Will Also
Lead and support a team responsible for the reliability and performance of production systems, which includes:
Setting clear performance expectations and goals for team members
Providing ongoing coaching and real-time feedback
Ensuring team members have the training and resources they need to succeed
Coordinating on-call rotations and operational coverage
Supporting the team during critical incidents and outages
Managing team staffing, including hiring and headcount planning
Prioritize and coordinate work across operational initiatives, deployments, upgrades, and infrastructure improvements
Ensure high levels of system uptime, data integrity, and operational stability
Partner with Engineering Leads to align platform operations with product development needs
Maintain business continuity across all production assets
Monitor system health, performance, and capacity to proactively identify and resolve issues
Serve as a technical escalation point for complex infrastructure or platform challenges
Provide regular reporting on system availability, response times, and capacity trends
Ensure operations meet security, compliance, and regulatory requirements
Support and coordinate the team’s on-call rotation and incident response processes
Continuously improve operational practices through automation, tooling, and monitoring
Technologies You’ll Work With Our platform relies on modern infrastructure and cloud technologies. Strong experience with several of the following areas is important:
Linux-based systems
Web server technologies (NGINX, PHP, Traefik, F5)
Virtualization platforms such as VMware
Cloud platforms including AWS and Azure
Containerization and orchestration (Docker, Kubernetes, Dynos)
Messaging and caching technologies (Redis, RabbitMQ)
A strong security mindset and experience implementing infrastructure security controls are essential.
What You Bring You’re a thoughtful technical leader who enjoys solving complex operational challenges and helping engineers grow. We’re Looking For Someone Who Brings
Strong analytical and problem-solving skills
Clear communication and collaboration skills
Experience leading teams in fast-moving technical environments
The ability to balance multiple priorities and make thoughtful decisions under pressure
Strong organizational and time management skills
A customer-focused mindset and commitment to system reliability
Bachelor’s degree in a technical discipline or equivalent professional experience
3–5 years of experience leading or managing globally distributed engineering teams
3–5 years of experience in a Site Reliability Engineering or similar infrastructure-focused role
Preferred Experience
Experience serving as a technical lead on infrastructure or platform teams
Experience with modern observability and monitoring tools, such as OpenTelemetry, Instana, LogicMonitor, PagerDuty, or OpsGenie
Experience with infrastructure and automation tooling such as GitLab CI, Jenkins, Chef, Terraform, Elasticsearch, Kubernetes, or Rancher
Scripting experience in Ruby, Python, or Bash
Familiarity with SOC, PCI, or GDPR compliance standards
Experience working with issue tracking and collaboration tools such as the Atlassian suite
Experience supporting or developing applications built with Java, PHP, or Node
Experience automating operational processes and repetitive tasks
Daxko is dedicated to pursuing and hiring a diverse workforce. We are committed to diversity in the broadest sense, including thought and perspective, age, ability, nationality, ethnicity, orientation, and gender. The skills, perspectives, ideas, and experiences of all of our team members contribute to the vitality and success of our purpose and values. We truly care for our team members, and this is reflected through our offices, and benefits, and great perks. These perks are only for our full-time team members. Some Of Our Favorites Include 🏝 Flexible paid time off ⚕️ Affordable health, dental, and vision insurance options 💪 Monthly fitness reimbursement 🤑 401(k) matching 🍼 New-Parent Paid Leave 👖 Casual work environments 🏡 Flexible work - remote & hybrid All your information will be kept confidential according to EEO guidelines. Where you fall within the compensation range is based on how you demonstrate the skills and competencies needed for the role. We typically reserve the upper half of our compensation bands for team members who have grown within Daxko. In addition to base salary, some roles may be eligible for bonuses, commissions, or other performance-based incentives. We also offer a comprehensive benefits package, recognition programs, and plenty of opportunities to grow your career with us. The pay range for this role is $139,400 – $217,400 per year.