Job Summary
The Cloud Engineer collaborates with the Cloud Architecture and engineering teams in the implementation and operation of Alarm.com’s multi-site data center infrastructure. This position works closely with the Network Operations, Software Engineering and Quality Engineering teams in meeting and exceeding Alarm.com’s internal and external SLAs around maximum availability, security, recoverability and compliance. This includes monitoring, validating changes, gathering and reporting metrics, testing, incident management, and change management.
Primary Job Responsibilities
- Designing and implementing private cloud, public cloud and hybrid solutions
- Monitoring, maintaining and updating hardened configurations and baselines
- Implementing availability and performance monitoring frameworks
- Implementing and testing system level High Availability and Disaster Recovery plans
- Tracking, controlling, and reporting status of system conditions, software, documentation, and infrastructure changes to management
- Troubleshoot and remediate issues across the infrastructure and create plans to prevent the same problem in the future
- Develop and maintain automation scripts or frameworks
- Providing high quality support to customers, prospects, management and peers
- Other Duties as assigned
Requirements
- BS in Computer Science or related field
- 3+ years relevant work experience in private / public cloud, SAN and networking
- Working knowledge of hyper-converged infrastructure
- Experience supporting multi-site and hot-hot architectures
- Working knowledge of Windows Server operating systems is a must, including security patching using SCCM, Failover Clustering, DHCP, DNS, and Active Directory
- Experience with VMware data center stack; vSphere, Cloud Foundations, Omnissa Horizon/ VDI, and the Aria Suite
- Understanding of storage infrastructure solutions such as NetApp Data ONTAP and Pure Storage Purity OS
- Proficient in block (FC, iSCSI) and file (NFS, SMB/CIFS) storage protocols
- Familiarity with Cisco MDS SAN switching, zoning, and fabric configuration
- Working foundation of datacenter networking fundamentals; layers of traffic, VLANs, segmentation of networks
- Experience with Cisco UCS Manager and Intersight for compute management; FlexPod and FlashStack experience preferred
- Experience implementing and maintaining monitoring frameworks on both commercial off-the-shelf and open source tools
- Experience implementing scripts for automation and systems management: PowerShell, Python, or Ansible
- Experience with enterprise backup and recovery solutions, such as Rubrik
- Familiarity with Jira or similar ITSM tools and working knowledge of ITIL-based processes
- Familiar with multi-tiered escalation and on-call procedures
- Experience working in SOX, FISMA, HIPAA, and PCI-compliant multi-tenant environments
- Working knowledge of Linux operating systems (Red Hat, Ubuntu, or CentOS) in enterprise settings
- Ability to work collaboratively within a team environment using Agile methodologies
- Self-directed approach with high degree of initiative to propose new solutions, troubleshoot, and resolve issues
- Rack, cable, and install servers, storage, and network hardware
- Perform hardware diagnostics, replacements, and upgrades
- Manage structured cabling (fiber and copper), labeling, and cable tracing
- Document rack elevation DCIM, and asset inventory
- Support OOB management setup
- Coordinate equipment deliveries, decommissions, and secure disposal
Additional Information
- Please note that sponsorship of new applicants for employment authorization, or any other immigration-related support, is not available for this position at this time.
Why Work for Alarm.com?
- Collaborate with outstanding people: We hire only the best. Our standards are high and our employees enjoy working alongside other high achievers.
- Make an immediate impact: New em