Pythian

Linux Site Reliability Consultant

Costa Rica

Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Information Technology, ConsultingIndustries

Site Reliability Consultant

Location: Costa Rica | Remote | Work from Home Employment Type: Full Time

Position Overview

Pythian is building a next-generation Site Reliability Engineering team and is seeking motivated and talented individuals to join. As a Site Reliability Consultant, you will act as a technology leader and advisor for clients, as well as a mentor for other team members. Projects will focus on infrastructure architecture, automation, and intelligent monitoring systems, from design through implementation. If you are passionate about data and eager to advance your career, this role is for you.

Responsibilities

  • Operate, maintain, and administer solutions to enhance customer infrastructure's operational efficiency, availability, and visibility.
  • Plan maintenance activities, create design documentation, and develop standard procedures.
  • Provide Root Cause Analysis reports for outages/incidents (ITIL - Problem Management).
  • Observe and provide feedback on the current state of client infrastructure, identifying opportunities for improvement in resiliency, incident reduction, and automation of repetitive tasks.
  • Contribute to, improve, and maintain team documentation regarding client systems, infrastructure, procedures, policies, and schedules.
  • Gather and document information about client environments through audit activities, analyzing it to identify improvement opportunities and best practices.
  • Collaborate with teammates to foster continuous improvement in the team's working culture.
  • Act as a technology leader for clients and drive discussions on technology roadmaps.
  • Participate in an on-call rotation in an escalation capacity.

Requirements

  • Experience with Google and AWS Clouds, including infrastructure as code deployment (Cloud Formation, Terraform, Opsworks, etc.).
  • Proficiency in scripting and automation of administrative tasks using Python and Scala.
  • Solid understanding of microservices architecture and container technologies (Kubernetes is a must, Docker, lxc, etc.).
  • Clear understanding of software development lifecycles and best practices from an infrastructure perspective (PRs, merge, rebase, etc.).
  • Understanding of end-to-end operations of a ‘Business System’ versus its components.
  • Comprehensive systems hardware and network troubleshooting experience.
  • Experience with common Linux distribution platform installation, configuration, performance tuning, and cloud migration.
  • Knowledge of TCP/IP networking, NIC bonding, and network services configuration (DNS, NTP, DHCP, SMTP, etc.).
  • Experience with the operation and administration of virtual infrastructure, including at least one hypervisor (VMware, Hyper-V, KVM, etc.).
  • Ability to describe IaaS, PaaS, SaaS, their pros and cons, and use cases for virtualization and cloud.
  • Experience with administration of web servers and supporting technologies, including network load balancers.
  • Experience in the design, development, and deployment of Puppet.
  • Experience with system and application error investigation, troubleshooting of access/availability issues, including deep multi-system root cause analysis.
  • Experience managing networking devices, such as switches and firewalls from various vendors.
  • Solid understanding of DevOps tools, processes, and culture.
  • Ability to quickly learn new technologies.
  • Ability to provide accurate work scheduling and task estimations for work delivery.

What You Get in Return

  • Love your career: Competitive total rewards package. Opportunities to blog during work hours and take time off to volunteer for your favorite charity.
  • Love your work/life balance: Flexible remote work from home with no daily travel requirements to an office. All you need is a stable internet connection.

Skills

Site Reliability Engineering
Infrastructure Architecture
Automation
Intelligent Monitoring Systems
Root Cause Analysis
ITIL
Problem Management
Resiliency
Linux Administration
Documentation
Audit Activities
Best Practices

Pythian

Cloud migration and data management services

About Pythian

Pythian assists businesses in managing and optimizing their data and IT infrastructure through services like cloud migration, managed services, and advanced analytics. They help companies transfer their data to cloud platforms such as Google Cloud, AWS, and Microsoft Azure, while providing ongoing support for smooth operations. Pythian differentiates itself by offering specialized services in machine learning and data science, enabling businesses to turn their data into valuable insights. Their goal is to empower organizations to leverage cloud computing and advanced analytics to improve operations and drive growth.

Ottawa, CanadaHeadquarters
1997Year Founded
$20.4MTotal Funding
EARLY_VCCompany Stage
Consulting, Enterprise Software, AI & Machine LearningIndustries
501-1,000Employees

Benefits

Remote Work Options
Flexible Work Hours
Paid Vacation
Paid Sick Leave
Wellness Program
Professional Development Budget
401(k) Company Match

Risks

Emerging cloud providers offering lower-cost services increase competition.
Rapid AI advancements may outpace Pythian's current capabilities.
Economic downturns in key industries could reduce IT service spending.

Differentiation

Pythian offers specialized services in machine learning and AI for data insights.
Their EDP QuickStart provides rapid deployment of enterprise data platforms.
Pythian's global presence with experts in 22 countries enhances their service delivery.

Upsides

Growing demand for cloud migration boosts Pythian's service offerings.
Expansion in database management market offers growth opportunities for Pythian.
Increased focus on cybersecurity drives demand for Pythian's Adminiscope.

Land your dream remote job 3x faster with AI