Question 1

What GPU is in each tier?

Accepted Answer

Tier-specific. The lite tier is RTX 4090 24GB (Ada Lovelace, 16384 CUDA cores, 82.58 TFLOPs FP16); pro is RTX 4090 24GB × 2; beast is H100 80GB SXM (Hopper, 14592 CUDA cores, 989.4 TFLOPs FP16) — the latter is procurement-sensitive. Spec is on the plan detail page; if the listed GPU is unavailable at order time the operator surfaces the substitute via /contact before charging.

Question 2

What does "offshore GPU hosting" actually buy me vs cloud?

Accepted Answer

Two things. (1) Jurisdiction — the box runs in Iceland or Romania, neither of which is subject to US export-control orders that gate cloud GPU access for some workloads (open-weight LLMs from non-US authors, certain red-team / jailbreak-eval workloads, etc.). (2) Billing privacy — Monero / no-KYC vs cloud-card-on-file. The trade-off vs hyperscaler GPU is honestly documented at /playbook/ai-inference.

Question 3

Is vLLM / Ollama / llama.cpp preinstalled?

Accepted Answer

Yes — the gpu-* tiers ship with vLLM (latest stable), Ollama (with the model registry mirror configured), llama.cpp (CUDA-compiled), CUDA 12.x toolkit, cuDNN, NCCL, PyTorch with CUDA support, and the HuggingFace transformers stack. Customers needing other inference engines (TensorRT-LLM, mlc-llm, exllama2) install via the package manager — the box is a normal Linux machine with NVIDIA driver + container runtime support.

Question 4

Can I serve LLM endpoints publicly?

Accepted Answer

Yes — the AUP (/legal/aup) does not restrict serving open-weight LLM inference endpoints. Customers running a public chat / API surface should configure their own rate-limits and authentication; the operator does not provide a hosted gateway. Per /docs (Caddy-fronted vLLM is the common pattern), the xmrhost hardening defaults already cover the OS layer.

Question 5

Where is the GPU server hosted?

Accepted Answer

Iceland (Reykjavik, RIPE) — the GPU catalog is Iceland-only because the Romanian racks do not have the GPU-density power / cooling provisioned. Iceland's hydroelectric + geothermal power is the operator's preference for GPU workloads on cost and emissions footprint; jurisdictional posture is at /location/is.

Question 6

Do I need to pay in Monero?

Accepted Answer

No. XMR is recommended; OxaPay accepts BTC, Lightning, LTC, ETH, and USDT. GPU-tier orders settle the same way VPS orders do — per-order Monero subaddress on XMR (MRL-0006), straight invoice on the transparent rails. No card, no fiat. The /why-monero rationale applies identically.

[$ ] GPU Lite — RTX 4090 — no-KYC offshore GPU servers (Iceland, Romania, Monero)

$ xmrhost-cli spec --plan=gpu-lite

$ xmrhost-cli regions --plan=gpu-lite

after you click order

$ cat /etc/xmrhost/baseline.d/*

$ grep -l 'gpu-lite' /usr/share/doc/xmrhost/playbook/

$ faq -p gpu-lite

$ ls /guide