Quick Answer
io.net offers access to 200,000+ GPUs across 130+ countries, including NVIDIA's latest H100 SXM and H100 PCIe, A100 80GB/40GB, L40S, RTX 4090, RTX 4080, A40, and RTX 3090 Ti. Unlike traditional cloud providers with limited inventory and waitlists, io.net's decentralized network aggregates GPUs from independent providers worldwide, ensuring instant availability for both consumer and enterprise workloads. Prices range from $0.18/hr for RTX 4090 (ideal for inference) to $2.20/hr for H100 SXM (optimal for training), with most GPU types available on-demand in under 2 minutes.
Complete GPU Inventory Breakdown
Here's every GPU type available on io.net with pricing, availability, and ideal use cases:
| GPU Model | Memory | Compute (TFLOPS) | Price/hr | Availability | Best For |
|---|---|---|---|---|---|
| NVIDIA H100 SXM | 80GB HBM3 | 60 FP16 | $2.20 | High | LLM training (70B+), distributed training |
| NVIDIA H100 PCIe | 80GB HBM3 | 51 FP16 | $1.49 | High | LLM inference, fine-tuning (7-13B) |
| NVIDIA A100 80GB | 80GB HBM2e | 39 FP16 | $1.49 | Very High | LLM training (7-30B), multi-task workloads |
| NVIDIA A100 40GB | 40GB HBM2e | 39 FP16 | $1.20 | Very High | Mid-size training, batch inference |
| NVIDIA L40S | 48GB GDDR6 | 91 FP16 | $0.75 | High | Inference, rendering, video processing |
| NVIDIA A40 | 48GB GDDR6 | 37 FP16 | $0.85 | High | Professional visualization, AI inference |
| NVIDIA RTX 4090 | 24GB GDDR6X | 82 FP16 | $0.18 | Very High | Cost-efficient inference, small models |
| NVIDIA RTX 4080 | 16GB GDDR6X | 55 FP16 | $0.35 | High | Gaming, development, light AI workloads |
| NVIDIA RTX 3090 Ti | 24GB GDDR6X | 40 FP16 | $0.28 | High | Budget training, inference, image gen |
| NVIDIA RTX 3090 | 24GB GDDR6X | 35 FP16 | $0.28 | Very High | Legacy workloads, cost optimization |
Availability status: Very High (95%+ uptime), High (90%+ uptime). Pricing as of April 2026.
GPU Architecture Comparison
Understanding GPU architectures helps you choose the right hardware:
Hopper Architecture (H100):
- Released: 2022
- Manufacturing: TSMC 4N process
- Key features: Transformer Engine (2x faster LLM training), FP8 precision, 80GB HBM3 at 3 TB/s bandwidth
- Best for: Llama 3 70B training, GPT-4 scale inference, distributed training clusters
- Premium over Ampere: 1.5x performance at 1.4x cost (better value)
Ada Lovelace Architecture (RTX 4090/4080):
- Released: 2022
- Manufacturing: TSMC 4N process
- Key features: DLSS 3, AV1 encoding, 4th gen Tensor Cores
- Best for: Llama 3 8B inference, Stable Diffusion generation, ComfyUI workflows
- Value proposition: 90% of A100 inference performance at 15% of the cost
Ampere Architecture (A100, A40, RTX 3090):
- Released: 2020-2021
- Manufacturing: TSMC 7nm
- Key features: Multi-Instance GPU (MIG), sparse tensor cores, PCIe Gen 4
- Best for: Proven training workloads, established production inference
- Mature ecosystem: Widest framework support, most optimization guides
GPU Availability by Region
io.net's decentralized network provides global GPU access:
| Region | H100 | A100 | RTX 4090 | L40S | Avg. Latency to US |
|---|---|---|---|---|---|
| North America | High | Very High | Very High | High | <20ms |
| Europe (West) | High | High | Very High | High | 80-120ms |
| Europe (East) | Medium | High | High | Medium | 100-140ms |
| Asia-Pacific | Medium | High | Very High | High | 150-200ms |
| South America | Low | Medium | High | Low | 120-180ms |
Regional Selection Tips:
- Lowest cost: Eastern Europe, Southeast Asia (15-25% cheaper electricity)
- Lowest latency: Select region closest to your end users
- Compliance: EU regions for GDPR, US regions for HIPAA/SOC 2
- No regional lock-in: Move workloads between regions instantly
Real-Time GPU Availability Dashboard
Unlike AWS/Azure with instance waitlists, io.net shows live availability:
# Check real-time GPU availability
io availability --gpu all
# Output:
GPU Model Available Price/hr Regions
H100 SXM 1,247 $2.20 US-West, EU-West, APAC
A100 80GB 3,891 $1.49 All regions
A100 40GB 5,203 $1.20 All regions
RTX 4090 28,432 $0.18 All regions
L40S 2,156 $0.75 US-West, EU-West
Availability Guarantees:
- RTX 4090/3090: 99.5% availability (28,000+ GPUs)
- A100 40GB/80GB: 99.2% availability (9,000+ GPUs)
- H100 SXM/PCIe: 98% availability (1,500+ GPUs)
- L40S: 95% availability (2,000+ GPUs)
During peak demand (US business hours), availability may temporarily drop to 90-95% for H100s, but you can usually provision within 5-10 minutes.
GPU Selection Guide by Use Case
Large Language Model Training:
| Model Size | GPU Recommendation | Quantity | Rationale |
|------------|-------------------|----------|-----------|
| <7B (Llama 3 8B) | A100 80GB | 1-2 | Fits in single GPU, full fine-tune possible |
| 7-13B | A100 80GB or H100 | 2-4 | LoRA on 1-2 GPUs, full fine-tune on 4 |
| 30-40B | H100 SXM | 4-8 | Requires NVLink, FSDP or DeepSpeed |
| 70B+ | H100 SXM | 8-16 | NVSwitch interconnect critical |
LLM Inference:
| Throughput | GPU Recommendation | Quantity | Cost/Month (24/7) |
|------------|-------------------|----------|-------------------|
| <100 req/day | RTX 4090 | 1 | $130 |
| 1K-10K req/day | RTX 4090 or L40S | 2-4 | $260-2,160 |
| 10K-100K req/day | L40S or H100 PCIe | 4-8 | $2,160-8,582 |
| 100K+ req/day | H100 SXM cluster | 8-32 | $12,672-50,688 |
Image Generation (Stable Diffusion, Midjourney-scale):
| Images/Day | GPU Recommendation | Quantity | Cost/Month |
|------------|-------------------|----------|------------|
| <1,000 | RTX 4090 | 1 | $65 (12hr/day) |
| 1K-10K | RTX 4090 | 2-4 | $130-260 (12hr/day) |
| 10K-100K | RTX 4090 or L40S | 8-16 | $1,036-2,160 (24/7) |
| 100K+ | RTX 4090 cluster | 32+ | $4,147+ (24/7) |
Video Processing and Rendering:
| Workload | GPU Recommendation | Key Features |
|----------|-------------------|--------------|
| Real-time rendering | RTX 4090 | DLSS 3, ray tracing |
| Video encoding | L40S or RTX 4090 | AV1 encoding, NVENC |
| 3D visualization | A40 or L40S | Professional drivers, ECC memory |
GPU Performance Benchmarks
Real-world performance comparison for common AI tasks:
Llama 3 8B Inference (tokens/second):
| GPU | FP16 | INT8 | FP8 (H100) |
|-----|------|------|------------|
| H100 SXM | 142 | 287 | 385 |
| H100 PCIe | 118 | 245 | 320 |
| A100 80GB | 95 | 178 | N/A |
| L40S | 87 | 165 | N/A |
| RTX 4090 | 82 | 156 | N/A |
Stable Diffusion XL (512x512 image generation time):
| GPU | SDXL (default) | SDXL (optimized) |
|-----|----------------|------------------|
| H100 | 0.8s | 0.4s |
| RTX 4090 | 1.2s | 0.6s |
| A100 | 1.4s | 0.7s |
| RTX 3090 | 2.1s | 1.0s |
Fine-tuning Llama 3 8B LoRA (samples/second):
| GPU | Batch 1 | Batch 4 | Batch 16 |
|-----|---------|---------|----------|
| H100 SXM | 3.2 | 11.8 | 42.5 |
| A100 80GB | 2.1 | 7.8 | 28.3 |
| RTX 4090 | 1.8 | 6.5 | 24.1 |
Multi-GPU Configurations
For distributed training and high-throughput inference:
Pre-Configured Clusters:
# 2-GPU NVLink cluster for 13B model training
io launch --gpu A100 --count 2 --network nvlink --disk 200GB
# 8-GPU NVSwitch cluster for 70B model training
io launch --gpu H100 --count 8 --network nvswitch --disk 1TB
# 4-GPU inference cluster with load balancing
io launch --gpu RTX4090 --count 4 --mode inference --autoscale
Cluster Networking Options:
| Configuration | Bandwidth | Latency | Use Case | Premium Cost |
|---|---|---|---|---|
| PCIe Gen 4 | 64 GB/s | ~5μs | Independent tasks | Included |
| NVLink (2-way) | 600 GB/s | <1μs | Small model parallel | +$0.10/hr per GPU |
| NVSwitch (8-way) | 900 GB/s | <1μs | Large model training | +$0.25/hr per GPU |
| InfiniBand (multi-node) | 200-400 Gb/s | ~2μs | 16+ GPU clusters | +$0.50/hr per GPU |
GPU Availability Alerts and Reservations
Never wait for GPU availability:
Real-Time Alerts:
# Get notified when H100s become available in US-West
io notify --gpu H100 --region us-west --email [email protected]
# Auto-provision when GPU becomes available
io autoprovision --gpu H100 --count 8 --max-price 2.50
Enterprise Reserved Capacity:
For mission-critical workloads requiring guaranteed availability:
- Reserved pools: Dedicated GPU allocation for your account
- Volume discounts: 10-20% off on top of base pricing
- SLA guarantees: 99.9% availability with credits for downtime
- Minimum commitment: 10 GPUs for 3 months
Contact [email protected] for reserved capacity pricing.
Related Questions
How quickly can I get a GPU on io.net?
Most GPUs provision in under 2 minutes. RTX 4090 and A100 (highest availability) typically spin up in 30-60 seconds. H100s may take 2-5 minutes during peak demand. There are no reservation queues or waiting lists - if a GPU shows as available, you can provision it immediately. For guaranteed instant access, enterprise plans include reserved capacity.
Can I switch GPU types during my project?
Yes. io.net allows you to stop one instance and start another with a different GPU type instantly. Your data persists in attached storage volumes. For example, you might use RTX 4090 for development ($0.18/hr), then switch to 8x H100 for production training ($17.60/hr), then back to L40S for inference ($0.75/hr). No migration fees or data transfer costs.
What if my preferred GPU type is unavailable?
io.net shows live availability across all regions. If H100s are unavailable in US-West, check EU-West or US-East. The platform also suggests alternative GPUs - for example, 2x A100 can often replace 1x H100 for training at similar total cost. Set up availability alerts to get notified when your preferred GPU becomes available.
Are GPUs shared or dedicated?
All io.net GPUs are dedicated bare-metal instances. You get 100% of the GPU's compute, memory, and bandwidth - no virtualization, no sharing, no "noisy neighbor" issues. This is different from some cloud providers that use MIG (Multi-Instance GPU) to split GPUs. You have root access and full control over the GPU.
How does GPU quality control work on a decentralized network?
io.net runs automated health checks on every GPU every 6 hours: memory tests, compute benchmarks, and thermal monitoring. GPUs that underperform (>10% below expected benchmarks) are automatically removed from the marketplace. Provider reputation scores factor in uptime, performance, and user ratings. You can see provider ratings before provisioning and request GPU replacement if performance issues occur.
Browse io.net's GPU Inventory
Access GPUs instantly:
- H100 SXM at $2.20/hr - 68% cheaper than AWS ($6.98/hr)
- RTX 4090 at $0.18/hr - Best price-performance for inference
- A100 80GB at $1.49/hr - Proven training performance
- No waitlists - 99%+ availability across all GPU types
View real-time availability → or launch a GPU now →
Last updated: April 2026 | GPU availability and pricing subject to real-time market conditions
