Senior Site Reliability Engineer at Prove

United States

Prove Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
TechnologyIndustries

Requirements

  • Experienced in designing, implementing, maintaining, and deploying highly available, complex, scalable, and reliable systems leveraging automation, effective monitoring, and infrastructure-as-code
  • Ability to work closely with application engineering teams to ensure services meet high standards of reliability, performance, and security
  • Expertise in observability solutions, including Opentelemetry for companywide instrumentation standards
  • Proficiency in building advanced monitoring dashboards, metrics, logging, tracing systems, alerting thresholds, and automated responses based on SLOs
  • Leadership in Kubernetes cluster management, optimization, scaling, and container orchestration
  • Skills in designing and implementing infrastructure-as-code deployments (e.g., Terraform) for container-based applications
  • Experience optimizing container resource allocation, utilization, and building automated deployment pipelines
  • Knowledge of scalable cloud infrastructure on AWS, including security compliance, least-privilege access controls, and cost optimization
  • Capability to automate routine operational tasks to reduce toil
  • Strong incident response skills, including integration with incident management systems, leading responses, post-incident reviews, root cause analysis, and documentation
  • Ability to identify and resolve performance bottlenecks across the technology stack
  • Self-starting professional who thrives in a fast-paced environment, processes information quickly, makes intelligent decisions, demonstrates natural curiosity and tenacity, and excels in teamwork

Responsibilities

  • Design and implement comprehensive observability solutions across infrastructure and applications
  • Lead the initiative to establish a companywide instrumentation standard based on Opentelemetry
  • Build advanced monitoring dashboards for real-time visibility into system health and performance
  • Establish metrics, logging, and tracing systems for quick issue identification and resolution
  • Create alerting thresholds and automated responses based on service level objectives (SLOs)
  • Drive a culture of observability throughout the engineering organization
  • Lead Kubernetes cluster management, optimization, and scaling initiatives
  • Design and implement infrastructure-as-code deployments for container-based applications
  • Optimize container resource allocation and utilization
  • Build automated deployment pipelines for consistent, reliable releases
  • Establish best practices for containerization and orchestration across teams
  • Design, build, and maintain scalable cloud infrastructure on AWS
  • Implement infrastructure-as-code using tools such as Terraform
  • Automate routine operational tasks to reduce toil and improve efficiency
  • Ensure infrastructure security compliance and implement least-privilege access controls
  • Optimize cloud resource utilization and costs
  • Integrate observability-driven alerts with Incident Management systems
  • Lead incident response efforts during service disruptions
  • Conduct thorough post-incident reviews and implement preventative measures
  • Use observability data to perform root cause analysis and system improvements
  • Document incidents, responses, and lessons learned to build organizational knowledge
  • Identify and resolve performance bottlenecks across the technology stack

Skills

Site Reliability Engineering
Observability
Monitoring
Infrastructure as Code
Automation
High Availability
Scalability
Reliability

Prove

Identity verification and authentication solutions

About Prove

Prove specializes in identity verification and authentication services, primarily serving clients in the financial sector. Its solutions are designed to secure transactions across various platforms, including mobile, desktop, call centers, and chat services. Prove's products work by utilizing a privacy-first approach that incorporates decentralized data architecture and identity tokenization, ensuring that user consent is prioritized and data aggregation is minimized. This focus on security and privacy sets Prove apart from its competitors, as it has built a reputation as a trusted partner for major financial institutions. The company's goal is to provide scalable and effective authentication solutions that enhance security for over 1,000 enterprise customers and 500 banks globally, while also maintaining a commitment to user privacy.

New York City, New YorkHeadquarters
2008Year Founded
$245.2MTotal Funding
LATE_VCCompany Stage
Fintech, Cybersecurity, Financial ServicesIndustries
1-10Employees

Benefits

Dental, Vision, Health, & Life Insurance
Well-Being Reimbursement
401K / Retirement Plan
PTO / Vacation Policy
Paid Holidays
Maternity / Paternity Leave

Risks

Generative AI intensifies threats like scraping and fraud, challenging Prove's API solutions.
Deepfake technology threatens trust in Prove's phone-based authentication methods.
EU's eIDAS 2.0 regulation may increase Prove's operational costs to meet compliance.

Differentiation

Prove specializes in phone-centric identity verification, enhancing security and consumer privacy.
Prove's decentralized data architecture limits data aggregation, emphasizing a privacy-first approach.
Prove serves 9 of the top 10 US financial institutions, showcasing its industry leadership.

Upsides

Prove's self-service platform simplifies identity verification, improving customer experience and reducing fraud.
The rise of digital wallets increases demand for Prove's secure digital identity solutions.
Prove's expertise in phone-based authentication addresses the growing threat of business identity theft.

Land your dream remote job 3x faster with AI