Position Summary
The Infrastructure Engineer is responsible for designing, implementing, and managing the cloud infrastructure and platform services that power the Company's applications, data systems, and internal tools. This role helps ensure systems are secure, reliable, and scalable to support ongoing business operations and growth.
Working closely with engineering, data, and product teams, the Infrastructure Engineer assists with managing cloud environments, automating infrastructure provisioning, supporting application deployments, and monitoring system performance. This role contributes to maintaining stable technology platforms and improving infrastructure processes that support the Company's products and services.
Essential Duties and Responsibilities
- Cloud Architecture: Architect and manage scalable, secure, and highly available cloud infrastructure primarily within Amazon Web Services (AWS), including EC2, RDS, S3, VPC, and IAM.
- Infrastructure-as-Code (IaC): Lead the implementation and maintenance of infrastructure using Terraform or similar tools, ensuring all environments are version-controlled and reproducible.
- Automation & Scripting: Develop and maintain automation scripts (Python, Bash, or Go) to eliminate manual tasks and improve operational efficiency.
- Containerization: Manage and optimize containerized application environments using Docker and orchestration platforms (e.g., Kubernetes or AWS ECS).
- CI/CD Pipeline Management: Design and support robust Continuous Integration and Continuous Deployment (CI/CD) pipelines to automate application releases and reduce deployment risk.
- Observability & Reliability: Implement and manage modern observability frameworks (logging, metrics, and tracing) to proactively monitor system health and performance.
- Performance Engineering: Monitor system capacity and performance, performing right-sizing exercises and scaling configurations to meet business growth.
- Security & Compliance: Enforce infrastructure security best practices, including access control (Least Privilege), network hardening, and automated vulnerability scanning.
- FinOps & Cost Control: Partner with leadership to monitor cloud spend and implement cost-optimization strategies.
- Incident Response: Troubleshoot complex infrastructure-related issues and participate in a blameless post-mortem culture to prevent recurrence.
- Documentation: Maintain high-quality technical documentation for infrastructure designs, standard operating procedures, and disaster recovery plans.
- On-Call Support: Participate in an on-call rotation to ensure the 24/7 reliability of production systems.
- Other duties as assigned.
Qualifications
Education
Bachelor's degree in computer science, Information Systems, Engineering, or a related field preferred. Equivalent practical experience may be considered.
Experience
- 3–5 years of experience in infrastructure engineering, DevOps, systems engineering, or related technology roles.
- Proven experience managing production-grade AWS environments.
- Strong hands-on experience with Terraform (module development and state management).
- Experience managing Linux-based systems at scale (Ubuntu, Amazon Linux, or RHEL).
- Direct experience building and maintaining CI/CD pipelines (e.g., GitHub Actions, GitLab CI, or Jenkins).
Knowledge, Skills, and Abilities
- Cloud Proficiency: Deep understanding of AWS networking (VPC, Peering, Transit Gateways) and security groups.
- Container Mastery: Proficiency in Docker; experience with Kubernetes (EKS) is highly desirable.
- Scripting: Proficiency in at least one scripting language (Python preferred).
- Version Control: Expert knowledge of Git-based workflows.
- Problem Solving: Strong analytical skills with the ability to troubleshoot complex, distributed systems.
- Communication: Excellent verbal and written communication skills with the ability to explain technical concepts to non-technical stakeholders.
Preferred Qualifications
- Experience with Serverless architectures (AWS Lambda, Fargate).
- AWS Certified Solutions Architect or SysOps Administrator certification.
- Experience with Data Infrastructure (managing Snowflake, Redshift, or Kafka clusters).
- Familiarity with container orchestration technologies such as Kubernetes or ECS.
Physical Requirements
This position primarily operates in a professional office or remote environment and routinely uses standard office equipment such as computers and phones. The ability to remain stationary for extended periods and operate a computer is required.
What We Offer:
- Flexible PTO
- Generous parental and caregiver leave
- 401K match
- Excellent medical, dental, and vision plans
- Remote-first culture
- Annual $1000 tuition reimbursement stipend
- And more!
The expected annual salary for this position is between $120,000 and $140,000, with the opportunity to earn an annual bonus of up to 15% of the base salary.