[Remote] Senior Site Reliability Engineer at ScienceLogic

Reston, Virginia, United States

ScienceLogic Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Software, SaaSIndustries

Requirements

  • 8-12 years of site reliability engineering, cloud operations or equivalent experience
  • Proven experience in managing complex Kubernetes environments in multiple Production systems
  • Working with Cloud Automation tools like CloudFormation, Terraform, aws-cli/CDK, Cloudformation
  • Scripting languages like Python, Bash, Perl
  • Exposure to Linux administration skills
  • Proven track record of operating production SaaS environments within security standards like FedRAMP, SOC2, ISO, PCI
  • Skilled at problem solving, algorithms, and data structures conforming to the modern SaaS security requirements
  • Building tools and scripting frameworks from scratch
  • Familiarity with basic networking, security and cloud engineering concepts
  • Bachelors or Master's degree in Computer Science, Information Systems or similar field

Responsibilities

  • Lead design reviews and buildout of secure systems for delivering new Artificial Intelligence Product in SaaS, aiming for 99.99% uptime
  • Design, automate, test, and monitor the use of cloud native technologies as a foundation for a service platform
  • Spend 75% of time on forward-looking priorities designing and building SaaS systems while remaining on supporting the Operations and Maintenance of the current SaaS infrastructure
  • Investigate and resolve customer and operational issues with the mentality of fixing and not just mitigating issues
  • Identify and automate measurement of operations SLAs and SLOs
  • Triage incident response, document SOPs, Runbooks, and train NOC team members
  • Write automation that can be easily supported and extended by others
  • Collaborate across the organization to design, build and operationalize SaaS services conforming to various security standards like FedRAMP, SOC2, ISO etc
  • Participate in the on-call rotation as assigned
  • Take full responsibility for the availability and performance of the platform
  • Work on special projects as assigned

Skills

Site Reliability Engineering
Cloud Technologies
Automation
SaaS
Incident Response
SOPs
Runbooks
FedRAMP
SOC2
ISO
Design Reviews
System Design
Monitoring
Troubleshooting

ScienceLogic

IT operations management platform for monitoring

About ScienceLogic

ScienceLogic specializes in IT operations management, providing a platform called SL1 that helps businesses monitor and manage their IT infrastructure and applications. The SL1 platform is designed for organizations that depend on technology, such as large enterprises, managed service providers (MSPs), and government agencies. It offers tools for automating and streamlining IT operations, ensuring that systems run smoothly and efficiently. Clients pay a subscription fee to access SL1, which comes in different service tiers to accommodate various needs and budgets. Additionally, ScienceLogic offers professional services to assist clients in implementing and optimizing the platform. The company's goal is to support businesses in maintaining high performance and reliability in their IT systems.

Reston, VirginiaHeadquarters
2003Year Founded
$228.8MTotal Funding
LATE_VCCompany Stage
Data & Analytics, Enterprise SoftwareIndustries
501-1,000Employees

Benefits

A remote-first culture
Comprehensive medical, dental & vision plans
401(k) plan with employer match
Flexible Paid Time Off (FTO)
Volunteer Time Off (VTO)
5-year service milestone sabbatical
Paid parental leave
Generous employee referral bonus program
Pet insurance
Well-stocked kitchen with rotating snacks and beverages
Regular virtual company-wide events

Risks

Emerging AIOps platforms may offer similar capabilities at lower costs.
Rapid AI advancements could outpace ScienceLogic's integration capabilities.
Potential AI regulatory changes in Europe may increase compliance costs.

Differentiation

ScienceLogic offers AI-driven monitoring for hybrid cloud management, enhancing IT efficiency.
The SL1 platform provides real-time views of IT components across cloud and on-premises.
ScienceLogic's subscription model ensures steady revenue with customizable service tiers.

Upsides

Growing demand for hybrid cloud management boosts ScienceLogic's SL1 platform adoption.
Interest in AIOps aligns with ScienceLogic's AI-driven monitoring solutions.
Partnerships like LTIMindtree expand market reach and enhance service offerings.

Land your dream remote job 3x faster with AI