Software Engineer, Cloud Infrastructure [Early Career]

Fireworks AI


Date: 10 hours ago
City: Redwood City, CA
Contract type: Full time
About Us:

Here at Fireworks, we're building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We've been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own function calling and multi-modal models. Fireworks is funded by top investors, like Benchmark and Sequoia, and we're an ambitious, fun team composed primarily of veterans from Pytorch and Google Vertex AI.

The Role:

As a Software Engineer on our Cloud Infrastructure team, you will help design and build the core systems that power Fireworks AI's generative AI platform. You will work on projects that support distributed AI workloads across multiple cloud environments, building tools and services that keep our platform fast, reliable, and scalable.

This is an exciting opportunity for a new graduate who wants to grow as a software engineer and work hands-on with cloud infrastructure, distributed systems, and large-scale machine learning platforms. You will collaborate closely with experienced engineers, learn from technical mentors, and contribute to the next generation of AI infrastructure.

Key Responsibilities:

  • Contribute to the design and development of scalable backend infrastructure that supports distributed training, inference, and data pipelines
  • Build and maintain core backend services such as job schedulers, autoscalers, resource managers, and model serving systems
  • Support performance optimization, cost efficiency, and reliability improvements across compute, storage, and networking layers
  • Collaborate with ML, DevOps, and product teams to translate research and product needs into infrastructure solutions
  • Learn and apply modern cloud technologies including Kubernetes, Ray, Kubeflow, and MLFlow
  • Participate in code reviews, technical discussions, and continuous integration and deployment processes

Minimum Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
  • Strong programming skills in Python, C++, or a similar language
  • Solid understanding of computer systems concepts such as networking, storage, and distributed computing
  • Familiarity with cloud platforms like AWS, GCP, or Azure, and containerization tools like Docker or Kubernetes
  • Knowledge and interest in cloud infrastructure, distributed systems, and machine learning

Preferred Qualifications:

  • 2+ years of relevant industry experience through internships, co-ops, or full-time employment
  • Experience with ML frameworks such as PyTorch, TensorFlow, Vertex AI, or SageMaker
  • Exposure to infrastructure-as-code and CI/CD tools such as Terraform, ArgoCD, or GitHub Actions
  • Contributions to open-source infrastructure or ML projects
  • Strong problem-solving skills and curiosity to learn new technologies

Deadline to apply: November 7th at 11:59 PM PT

Why Fireworks AI?

  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
  • Build What's Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.

Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume