Senior Infrastructure Engineer
Role: Full-Time, Senior IC with Path to Tech Leadership
Location: Dallas, TX candidates considered with the expectation of frequent travel to Ashburn (typically 1 week per month, sometimes more for major deployments or incidents). Northern Virginia (Ashburn/Reston/Sterling) or DC metro preferred for proximity to our NTT VA1 datacenter.
Mission: Own HypeProxies' infrastructure end-to-end, stabilize our hosting fleet across Ashburn and Dallas, and grow into the technical leader of the engineering team.
The Company
HypeProxies is one of the fastest-growing infra-first proxy and server companies in the market.
We've built:
- 20+ cabinet footprint in Ashburn,VA and Dallas,TX
- A world-class brand and reputation
- 1000+ Servers sold and managed internally
- Proxmox-based VPS platform serving hundreds of B2B and Data Collection teams
- Custom proxy engine and VPS platform
- A small, technical team that outperforms teams of greater size
What we need next is a Senior Infrastructure Engineer who can own the entire infrastructure layer.
A scrappy, hands-on operator who blends:
- Deep Linux and virtualization expertise
- Linode / Vercel level infrastructure thinking
- Hosting/ISP operational instincts
- Hardware-level fluency (iDRAC, IPMI, BIOS, BGP,Redfish Api)
- Bias toward action and self-direction
- Ai-Native
- Ability to write code and automate, not just operate
- Extreme Ownership
- Documentation discipline
What You Will Own
- Own Proxmox at scale across VA1 and DA6 (500+ VPS Servers)
- Own the monitoring and alerting buildout (Fix gaps in hardware and software monitoring )
- Hardware asset tracking and make it the source of truth
- Own backup architecture (we sell backup services and need it bulletproof)
- Own access control hardening across infrastructure
- Lead the migration from Squid to our custom proxy engine alongside the CEO
- Build runbooks for every standard procedure
- Drive Ansible playbook adoption for repeatable deployments
- Own or delegate response to alerts end-to-end
- Coordinate with Datacenter remote hands when physical work is needed
- Visit Datacenters in person when on-site work is required
- Eventually lead a small team of junior engineers and interns as we grow
You will be the technical backbone of the company.
Responsibilities
1. Infrastructure Stability & Operations
- Own day-to-day health of the entire Infrastructure
- Build and maintain monitoring systems (node health, iDRAC checks, network alerts etc)
- Standardize firmware, BIOS, and iDRAC configurations across all Servers
- Audit end to end and implement automation across the entire fleet
- Implement bulletproof backup architecture for VPS customers
- Drive incident response and post-mortems
2. Documentation & Knowledge Capture
- Build a dependable runbook library
- Document standard procedures: Squid restart, Proxmox node deployment, iDRAC bootstrap, network failover
- Capture tribal knowledge from existing team and integrate into documents
- Standardize onboarding procedures for technical hires
3. Automation & Tooling
- Write Ansible playbooks for repeatable infrastructure deployments
- Build ai-enabled monitoring agents that page the right person at the right time
- Identify manual processes and automate them with AI
- Reduce toil systematically
4. Network & Hardware Coordination
- Work alongside the Network Engineers on network architecture (BGP, transit, IX peering)
- Coordinate hardware procurement, installation, and decommissioning
- Manage the relationship with remote hands at our Datacenter POPs.
- Visit DC’s personally when issues require on-site eyes
5. Leadership Trajectory
- In the first 6-12 months: prove ownership as a strong senior IC
- In the next 12-24 months: hire and mentor junior engineers and interns
- Long-term: grow into Head of Engineering / CTO as the company scales
KPIs
- Fleet uptime above 99.95%
- Monitoring coverage at 100% of production nodes within 90 days
- Documented runbooks for every standard procedure within 6 months
- Backup system fully implemented and tested within 90 days
- Reduction in time-to-resolve for incidents quarter over quarter
- Migration completed without customer-impacting outages
You Are a Fit If…
You:
- Have 4-8 years in hosting operations, ISP infrastructure, or B2B colo environments
- Have run Proxmox, Proxy Software, KVM, or VMware at meaningful scale
- Are fluent in Linux administration, iDRAC, IPMI, and Server hardware
- Have worked with BGP, transit, and data center networking concepts
- Have written Ansible, Bash, or Python to automate infrastructure work
- Are using Ai Agents to automate projects and monitoring
- Can identify issues independently and fix them without oversight
- Can hold yourself to high standards without daily oversight
- Want to grow into a tech leadership role over the next 24 months
- Are excited to work directly with the CEO on hard technical problems
You Are Not a Fit If…
We want to ensure a good fit by being transparent:
- You need clear processes, defined run-books, and a mature ecosystem to be productive - we don't have those yet, you'll be building them
- You're coming from a 1000+ person enterprise and expect that pace and structure
- You wait for tickets to be assigned to you instead of grabbing them
- You see chaos as a blocker rather than an opportunity to create order
- You're looking for a clock in clock out 9-to-5 with predictable workloads
- You haven't built or broken something on your own time in the past year
What Makes This Role Special
- You'll own the infrastructure for a profitable, growing B2B infra company at a pivotal scaling moment
- You'll work directly with the CEO on architecture decisions
- You'll have full ownership of your domain
- You'll grow into a technical leadership role naturally as the team scales
- You'll see your decisions ship the same day you make them
- You'll be a key hire in a small team where your fingerprints will be visible on everything
Compensation & Benefits
- Base: $100,000 - $150,000. The salary range vary based on location, years of experience, background and skill set.
- Performance bonus tied to fleet uptime, project delivery, and customer-impacting incident reduction
- Equity after 12 months based on performance
- Health insurance
- 4 weeks of paid PTO per year, with an additional 2 days added for each completed year as part of a progressive PTO policy.
If you want to join a fast-growing product company, be among the first technical hires, and have the chance to help shape and scale the infrastructure as the company grows, come talk to us.