Senior Infrastructure Engineer

HypeProxies • Full-time • Dallas, TX, US • 6d ago

Senior Infrastructure Engineer

Role: Full-Time, Senior IC with Path to Tech Leadership

Location: Dallas, TX candidates considered with the expectation of frequent travel to Ashburn (typically 1 week per month, sometimes more for major deployments or incidents). Northern Virginia (Ashburn/Reston/Sterling) or DC metro preferred for proximity to our NTT VA1 datacenter.

Mission: Own HypeProxies' infrastructure end-to-end, stabilize our hosting fleet across Ashburn and Dallas, and grow into the technical leader of the engineering team.

The Company

HypeProxies is one of the fastest-growing infra-first proxy and server companies in the market.

We've built:

20+ cabinet footprint in Ashburn,VA and Dallas,TX
A world-class brand and reputation
1000+ Servers sold and managed internally
Proxmox-based VPS platform serving hundreds of B2B and Data Collection teams
Custom proxy engine and VPS platform
A small, technical team that outperforms teams of greater size

What we need next is a Senior Infrastructure Engineer who can own the entire infrastructure layer.

A scrappy, hands-on operator who blends:

Deep Linux and virtualization expertise
Linode / Vercel level infrastructure thinking
Hosting/ISP operational instincts
Hardware-level fluency (iDRAC, IPMI, BIOS, BGP,Redfish Api)
Bias toward action and self-direction
Ai-Native
Ability to write code and automate, not just operate
Extreme Ownership
Documentation discipline

What You Will Own

Own Proxmox at scale across VA1 and DA6 (500+ VPS Servers)
Own the monitoring and alerting buildout (Fix gaps in hardware and software monitoring )
Hardware asset tracking and make it the source of truth
Own backup architecture (we sell backup services and need it bulletproof)
Own access control hardening across infrastructure
Lead the migration from Squid to our custom proxy engine alongside the CEO
Build runbooks for every standard procedure
Drive Ansible playbook adoption for repeatable deployments
Own or delegate response to alerts end-to-end
Coordinate with Datacenter remote hands when physical work is needed
Visit Datacenters in person when on-site work is required
Eventually lead a small team of junior engineers and interns as we grow

You will be the technical backbone of the company.

Responsibilities

1. Infrastructure Stability & Operations

Own day-to-day health of the entire Infrastructure
Build and maintain monitoring systems (node health, iDRAC checks, network alerts etc)
Standardize firmware, BIOS, and iDRAC configurations across all Servers
Audit end to end and implement automation across the entire fleet
Implement bulletproof backup architecture for VPS customers
Drive incident response and post-mortems

2. Documentation & Knowledge Capture

Build a dependable runbook library
Document standard procedures: Squid restart, Proxmox node deployment, iDRAC bootstrap, network failover
Capture tribal knowledge from existing team and integrate into documents
Standardize onboarding procedures for technical hires

3. Automation & Tooling

Write Ansible playbooks for repeatable infrastructure deployments
Build ai-enabled monitoring agents that page the right person at the right time
Identify manual processes and automate them with AI
Reduce toil systematically

4. Network & Hardware Coordination

Work alongside the Network Engineers on network architecture (BGP, transit, IX peering)
Coordinate hardware procurement, installation, and decommissioning
Manage the relationship with remote hands at our Datacenter POPs.
Visit DC’s personally when issues require on-site eyes

5. Leadership Trajectory

In the first 6-12 months: prove ownership as a strong senior IC
In the next 12-24 months: hire and mentor junior engineers and interns
Long-term: grow into Head of Engineering / CTO as the company scales

KPIs

Fleet uptime above 99.95%
Monitoring coverage at 100% of production nodes within 90 days
Documented runbooks for every standard procedure within 6 months
Backup system fully implemented and tested within 90 days
Reduction in time-to-resolve for incidents quarter over quarter
Migration completed without customer-impacting outages

You Are a Fit If…

You:

Have 4-8 years in hosting operations, ISP infrastructure, or B2B colo environments
Have run Proxmox, Proxy Software, KVM, or VMware at meaningful scale
Are fluent in Linux administration, iDRAC, IPMI, and Server hardware
Have worked with BGP, transit, and data center networking concepts
Have written Ansible, Bash, or Python to automate infrastructure work
Are using Ai Agents to automate projects and monitoring
Can identify issues independently and fix them without oversight
Can hold yourself to high standards without daily oversight
Want to grow into a tech leadership role over the next 24 months
Are excited to work directly with the CEO on hard technical problems

You Are Not a Fit If…

We want to ensure a good fit by being transparent:

You need clear processes, defined run-books, and a mature ecosystem to be productive - we don't have those yet, you'll be building them
You're coming from a 1000+ person enterprise and expect that pace and structure
You wait for tickets to be assigned to you instead of grabbing them
You see chaos as a blocker rather than an opportunity to create order
You're looking for a clock in clock out 9-to-5 with predictable workloads
You haven't built or broken something on your own time in the past year

What Makes This Role Special

You'll own the infrastructure for a profitable, growing B2B infra company at a pivotal scaling moment
You'll work directly with the CEO on architecture decisions
You'll have full ownership of your domain
You'll grow into a technical leadership role naturally as the team scales
You'll see your decisions ship the same day you make them
You'll be a key hire in a small team where your fingerprints will be visible on everything

Compensation & Benefits

Base: $100,000 - $150,000. The salary range vary based on location, years of experience, background and skill set.
Performance bonus tied to fleet uptime, project delivery, and customer-impacting incident reduction
Equity after 12 months based on performance
Health insurance
4 weeks of paid PTO per year, with an additional 2 days added for each completed year as part of a progressive PTO policy.

If you want to join a fast-growing product company, be among the first technical hires, and have the chance to help shape and scale the infrastructure as the company grows, come talk to us.