lock-svg project
Successfully occupied
View project information dropdown icon
Wallet icon Coin icon Rate 7 000 € - 9 000 € / month info
Timer icon Form of cooperation Full-time
Briefcase icon Sector Telco
Location icon Location 100% Remote

info The reward is calculated upon delivery of 20 MD per month (1MD=8h)

Project duration 8-20 months with possibility of extension
Period of cooperation 01.05.2026 - 31.12.2026
Start date ASAP or by agreement
Technology
  • UNIX/Linux
  • Ansible
  • Terraform
Languages
  • English flag English - active, B2/C1/C2

Project description

  • design, build, and operate infrastructure (compute, network, storage) for AI cloud
  • provisioning and management of bare-metal servers and GPU nodes (PXE boot, OS, firmware)
  • coordination with data center teams for hardware lifecycle activities (installations, upgrades, storage expansion)
  • design and operation of infrastructure based on the NVIDIA AI stack
  • automation and configuration using Ansible and Terraform (IaC)
  • management and patching of Debian-based Linux environments
  • management of firmware and infrastructure lifecycle
  • implementation and management of IAM solutions (Keycloak, Entra ID, AD)
  • operation and support of AI and HPC workloads (bare metal + Kubernetes)
  • implementation of monitoring and observability (Prometheus, Grafana)
  • management of high-performance storage solutions (WEKA / Hitachi)
  • management of infrastructure documentation and assets (e.g., NetBox)
  • adherence to and improvement of ITIL processes (incident, problem, change management)
  • creation of runbooks and operational procedures
  • ensuring high infrastructure availability (ZERO outage approach)
  • technical consulting and solution delivery within projects focused on the NVIDIA stack

Project requirements

  • min. 5 years of active project experience as an Infrastructure Engineer
  • experience with installation, administration, and operation of server hardware
  • advanced knowledge of Linux (Debian) in a production environment
     
  • advanced experience with:
    • Infrastructure as a Code (Ansible, Terraform)
    • GPU infrastructure / NVIDIA GPU platform
    • AI / GPU orchestration stack
    • GPU cloud platform stack and its layers
    • IP, routing, VLAN, DNS, firewall, L1/L2
    • IAM systems (Keycloak, Entra ID, LDAP/AD)
    • monitoring (Prometheus, Grafana)
    • ITIL processes (incident, problem, change)
       
  • experience with operation in a 24/7 mission-critical environment
     
  • great advantage: experience with:
    • large-scale GPU clusters or HPC environments
    • high-performance storage environments (WEKA by Hitachi)
    • GitOps and infra deployment approaches
    • Redfish technology for hardware management
    • sovereign cloud and data compliance requirements
    • Kubernetes or distributed systems
  • analytical thinking
  • communication skills
  • team player
Are you interested in this project?
Recommend an IT specialist Do you know anyone who could use this project? Recommend him and get a reward 800 €!
Hire an IT specialist Do you need a similar IT freelancer for your project? Hire a specialist
New to the world of IT freelancing ?

Freedom, flexibility, greater control over finances and career. Freelancing has evolved and offers much more today. See what's in store for you and how it will change your life.

Are you interested in this project?
Recommend an IT specialist Do you know anyone who could use this project? Recommend him and get a reward 800 €!
Hire an IT specialist Do you need a similar IT freelancer for your project? Hire a specialist
32 206

Titans that have
joined us

746

Clients that have
joined us

699 462

Succcessfully supplied
man-days