all systems operational
Get started
/ gpu vps · kvm + pci passthrough

GPU VPS.Train. Render. Stream.

KVM virtual machines with full PCI passthrough to NVIDIA RTX PRO 4500 and RTX PRO 6000 Blackwell. The whole GPU goes to your VM — no vGPU slicing, no shared CUDA cores. Boots in minutes. Fixed monthly price. No surprise hourly bill.

  • 100%GPU passed to your VM
  • PCIe 5full-bandwidth passthrough
  • 10 Gbit/suplink, unmetered
  • ~5 minprovisioning · in stock
live · gpu-7
RTX PRO 6000 Blackwell
running
CUDA utilization74%
VRAM
71.4 / 96 GB
Temp
62 °C
Power
412 W
SM clock
2.61 GHz
Active workload
PID 8421llama-3.1-70b · vLLM68 GB
PID 8422whisper-large-v33.4 GB
/ hardware

Two cards. Both workstation-class.

Pro-grade GPUs passed to your VM via PCIe Gen 5 passthrough. ECC VRAM, proper datacenter cooling. Not gaming silicon, not consumer drivers, not vGPU slicing.

entry

RTX PRO 4500 Blackwell

32 GB GDDR7 ECC

Generation: Blackwell · 10,496 CUDA cores

  • VRAM32 GB GDDR7 · ECC
  • CUDA cores10,496
  • Tensor cores328 (5th gen)
  • RT cores82 (4th gen)
  • FP32 compute49.8 TFLOPS
  • TDP200 W
  • Memory bus256-bit · 672 GB/s
  • NVENC / NVDEC3 / 3 (AV1)
Best for
  • LLM inference up to 30B params
  • Stable Diffusion / Flux
  • Real-time encoding (12+ streams)
  • 3D rendering · Blender / Octane
flagship

RTX PRO 6000 Blackwell

96 GB GDDR7 ECC

Generation: Blackwell · 24,064 CUDA cores

  • VRAM96 GB GDDR7 · ECC
  • CUDA cores24,064
  • Tensor cores752 (5th gen)
  • RT cores188 (4th gen)
  • FP32 compute125.5 TFLOPS
  • TDP600 W
  • Memory bus512-bit · 1.79 TB/s
  • NVENC / NVDEC4 / 4 (AV1)
Best for
  • LLM inference 70B+ in single GPU
  • Fine-tuning · LoRA / QLoRA
  • Heavy 8K rendering · DCC pipelines
  • Video AI · upscaling, generation
/ plans

Pick a slice. Boot in minutes.

Best for inference
/ gpu

GPU VPS · RTX PRO 4500

32 GBVRAM
$550/mo
  • GPU1× RTX PRO 4500 Blackwell · 32 GB · full passthrough
  • CPU16 vCPU · EPYC 9354 (3.25 GHz)
  • RAM128 GB DDR5
  • Disk1 TB NVMe Gen5
  • Network10 Gbit/s · unmetered
  • DDoS1.6 Tbit/s included
KVM · cloud-initSnapshots freeNative IPv6API / web panel
Order now
In stock · Kyiv, UA · ~5 min provisioning
Flagship · 70B in one card
/ gpu

GPU VPS · RTX PRO 6000

96 GBVRAM
$1200/mo
  • GPU1× RTX PRO 6000 Blackwell · 96 GB · full passthrough
  • CPU32 vCPU · EPYC 9354 (3.25 GHz)
  • RAM256 GB DDR5
  • Disk2 TB NVMe Gen5
  • Network10 Gbit/s · unmetered
  • DDoS1.6 Tbit/s included
KVM · cloud-initSnapshots freeNative IPv6API / web panel
Order now
In stock · Kyiv, UA · ~5 min provisioning

Need multi-GPU? We can pass 2× / 4× cards into one VM, or build a dedicated bare-metal rig — talk to engineering.

/ workloads

Built for the things people actually do with GPUs.

If your job involves CUDA, an encoder, or a render farm — we've already racked the right hardware.

01

AI / LLM inference

Run Llama 3.1 70B in a single Blackwell VM, or 8B/13B-class models on a 4500 slice. vLLM, llama.cpp, TGI — all native CUDA inside the VM.

vLLMTGITritonCUDA 12.5
02

Diffusion / generative video

Flux, SD3, AnimateDiff, video upscaling. 96 GB VRAM lets you keep large checkpoints and ControlNets resident — no swapping per request.

ComfyUIAuto1111SD3Flux
03

3D rendering · DCC pipelines

Blender · Cycles, Octane, Redshift, V-Ray RT. Pro driver stack means scenes that crash on consumer cards just work. Long renders priced flat — no per-minute meter.

BlenderOctaneHoudiniRedshift
04

Video AI · streaming

Real-time AV1 encoding (NVENC), upscaling, frame interpolation, automatic captioning (Whisper). Pair with our 100/400 G nodes for distribution.

NVENC AV1WhisperRIFETopaz
05

Fine-tuning · LoRA

Train your own 7B–13B LoRAs in a few hours on the 6000. Full PCIe Gen 5 + 1.79 TB/s memory means fewer epochs lost to bandwidth.

LoRAQLoRAPEFTAxolotl
06

CUDA / scientific compute

Full-passthrough VM. ECC VRAM. No noisy neighbour on the GPU. Run your own MPI jobs, OptiX ray tracing, CFD solvers — root access, not a sandbox.

CUDAOptiXcuDNNTensorRT
/ software

Pick an image. Boot. Start jobs.

Pre-baked images keep you from spending the first day fighting drivers. Or bring your own — bare-metal means root.

Drivers & runtime
  • CUDA 12.5
  • cuDNN 9
  • TensorRT 10
  • NCCL 2.22
  • NVIDIA 555+ driver
Frameworks (preset images)
  • PyTorch 2.5
  • TensorFlow 2.17
  • JAX 0.4
  • vLLM
  • TGI
  • ComfyUI
Containers & orchestration
  • Docker + nvidia-runtime
  • Podman
  • k3s ready
  • Slurm-compatible
OS choices
  • Ubuntu 22.04 / 24.04 LTS
  • Debian 12
  • Rocky 9
  • Custom ISO
/ faq

Questions we hear all the time.

More? Ping us on Telegram or reach the NOC at noc@hostfory.com.

  • Exactly. KVM virtual machine for the OS layer, but the entire GPU is passed through the PCIe bus directly to your VM. Every CUDA core, all VRAM, no vGPU slicing, no shared driver. CPU and RAM are dedicated slices of an EPYC host — no oversubscription on GPU plans.

/ get started

Spin up a GPU VPS. Now.

$550 for the 32 GB RTX PRO 4500. $1,200 for the 96 GB Blackwell. Flat monthly price. Full GPU passthrough, unmetered 10G network, real engineers on Telegram.