Site Reliability Engineer (SRE) / Production Engineer (PE) - Kubernetes & Cloud Infrastructure Job at Fireworks AI, Bay County, FL

UXdXNWx4QVpIY1ZpbS9qRlFqbzR5b3dBWnc9PQ==
  • Fireworks AI
  • Bay County, FL

Job Description

About Us:

Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We’ve been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own function calling and multi-modal models. Fireworks is funded by top investors, like Benchmark and Sequoia, and we’re an ambitious, fun team composed primarily of veterans from Pytorch and Google Vertex AI.

The Role:

We’re seeking a highly skilled SRE/PE with deep expertise in Kubernetes (k8s), cloud networking, and infrastructure automation. This role will focus on reducing incident response time, implementing auto-remediation, optimizing auto-scaling, and improving cluster efficiency and service health. You’ll design systems that balance performance, cost, and reliability while working onsite with our Redwood City team.

Key Responsibilities:

  1. Incident Response & Reliability Engineering:

  2. Kubernetes & GPU Cluster Optimization:

  3. Cloud Networking & Service Health:

  4. Monitoring & Observability:

  5. Automation & Infrastructure-as-Code (IaC):

Minimum Qualifications:

  • 3+ years in SRE/PE/DevOps roles with production-grade Kubernetes experience.

  • Proficiency in cloud networking (AWS/GCP/Azure VPCs, firewalls, DNS) and service monitoring (Prometheus, Alertmanager, Grafana).

  • Hands-on experience with incident management and improving system reliability/SLOs.

  • Strong scripting/coding skills (Python/Go/Bash) for automation and tooling.

  • Familiarity with object storage (S3, GCS) and data pipeline integration.

Preferred Qualifications

  • Experience with GPU clusters (NVIDIA GPUs, MIG, CUDA) and AI/ML workloads.

  • Knowledge of auto-scaling technologies (K8s HPA/VPA) and auto-remediation frameworks.

  • Expertise in service meshes (Istio)

Why Fireworks AI?

  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.

  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.

  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.

  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.

Job Tags

Similar Jobs

State of Illinois

INVESTMENT ANALYST-PUBLIC EQUITY Job at State of Illinois

 ...investments career with TRS is an opportunity to help Illinois public educators achieve their promised retirement security. The Investments...  ...interactions Minimum Requirements Bachelors degree in finance, accounting, economics, business, or related field Advanced... 

STP

CDL Truck Driver Job at STP

 ...We are looking for a CDL A Truck Driver in the Tolland CT area to operator a dump trucktransporting goods and materials to designated destinations. CDL Truck Driver Duties and Responsibilities Responsibilities include operating dumb truck transporting construction... 

KB Consulting Group

Google Cloud Engineer Job at KB Consulting Group

KB Consulting Group's Client is seeking 2 Google Cloud Engineers. The candidate must possess the following skills: - Good English - Ability to multi task and communicate well Must Know: AWS Engineering Forseti AWS Config and Rules. DevOps

hatch I.T.

Data Specialist Job at hatch I.T.

 ...hatch I.T. is partnering with VIA to find a Data Specialist. See details below: About the Role: As a Data Specialist at VIA, you will play a pivotal role in the growth of their solutions. Your key responsibilities will include translating customer domain knowledge... 

Alaska Airlines

Ramp Service Agent Job at Alaska Airlines

**Company** Alaska Airlines**The Team**Our airport teams work together to move guests and their belongings from curb to cabin, creating remarkable experiences along the way. Whether customer-facing or behind the scenes, we want to hear from you if you can be welcoming...