Fundamental Logo

Fundamental

DevOps Engineer

Posted 4 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Greece
Senior level
Remote
Hiring Remotely in Greece
Senior level
Design, build, and operate cloud infrastructure and Kubernetes clusters for GPU/ML workloads; implement GitOps, IaC (Terraform), CI/CD, monitoring, automation (Python/Bash), and cost-optimization alongside ML engineers.
The summary above was generated by AI
About Fundamental

Fundamental is an AI company pioneering the future of enterprise decision-making. Founded by DeepMind alumni, Fundamental has developed NEXUS – the world's most powerful Large Tabular Model (LTM) – purpose-built for the structured records that actually drive enterprise decisions. Backed by world class investors and trusted by Fortune 100 companies, Fundamental unlocks trillions of dollars of value by giving businesses the Power to Predict.

At Fundamental, you'll work on unprecedented technical challenges in foundation model development and build technology that transforms how the world's largest companies make decisions. This is your opportunity to be part of a category-defining company from the ground-up. Join the team defining the future of enterprise AI.

Key responsibilities
  • Design and implement cloud infrastructure from the ground up

  • Build and maintain Kubernetes clusters optimized for GPU workloads and ML applications, as well as Production SaaS hosting

  • Implement GitOps practices using ArgoCD for continuous deployment

  • Develop infrastructure as code using Terraform

  • Create and maintain CI/CD pipelines for infrastructure and application deployment

  • Implement monitoring and observability solutions for distributed systems

  • Automate infrastructure management with Python and Bash

  • Collaborate with ML engineers to optimize infrastructure for model training and serving

  • Implement and maintain cost optimization strategies (FinOps) for cloud resources

  • Monitor and optimize cloud spending, especially for GPU-intensive workloads

Must have
  • 5+ years of experience in cloud infrastructure and DevOps

  • 3+ years of experience with Python

  • Strong experience with AWS and GCP cloud platforms

  • Deep expertise in Kubernetes, including multi-cluster management, GPU workload optimization, resource scheduling and autoscaling, and network policies and security

  • Experience with GitOps tools (ArgoCD preferred)

  • Extensive experience with cloud networking, including VPC design, load balancer configuration, network security and segmentation, and cross-cloud networking solutions

  • Strong CI/CD expertise, preferably with GitHub Actions

  • Proficiency in infrastructure as code (Terraform)

  • Experience with monitoring and observability tools

  • Experience with FinOps practices and cloud cost optimization

Nice to have
  • Experience with ML workflow tooling (MLflow, Kubeflow, or similar)

  • Experience with FastAPI and Backend applications

  • Familiarity with data platforms like Databricks or Snowflake

  • Exposure to SRE practices or cloud security certifications

  • Hands-on experience with Prometheus, Grafana, or Datadog

Benefits
  • Competitive compensation with salary and equity

  • Comprehensive health coverage for you and your dependents

  • Paid parental leave for all new parents, inclusive of adoptive and surrogate journeys

  • Relocation support for employees moving to join the team in one of our office locations

  • A mission-driven, low-ego culture that values diversity of thought, ownership, and bias toward action

Similar Jobs

3 Days Ago
Remote
Mid level
Mid level
Artificial Intelligence • Information Technology • Machine Learning • Software • Analytics
Own and evolve GCP-based infrastructure for an AI evaluation platform: manage Terraform, GKE, databases, CI/CD, observability, secrets, and cost/reliability. Collaborate with backend, ML, and frontend teams to make deployments repeatable, secure, and reliable.
Top Skills: AlembicAWSBashCeleryClickhouseCloud SqlDockerDocker ComposeExpressFastapiGCPGithub ActionsGkeGrafanaHelmIbmKubernetesLangfuseLitellm ProxyMySQLNode.jsOpentelemetryPostgresPrometheusPythonRedisTerraformUv
18 Days Ago
In-Office or Remote
Mid level
Mid level
Software • Analytics • Cybersecurity
The DevOps Engineer will support the development, deployment, automation, and security of cloud infrastructure and software delivery pipelines, with a focus on CI/CD and DevSecOps practices.
Top Skills: AWSAzureAzure DevopsCi/CdCloudFormationDockerGitlab CiGCPKubernetesPulumiTerraform
Senior level
Agency • Artificial Intelligence • Blockchain • Web3
Develop and maintain backend services in Go, manage distributed validator sets, and automate infrastructure using Kubernetes, Terraform, and AWS. Collaborate with cross-functional teams to deploy scalable Web3 solutions, monitor and troubleshoot performance and security issues, and contribute to architecture and design to improve reliability and scalability.
Top Skills: AWSGoInfrastructure As CodeKubernetesTerraformWeb3

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account