System Requirements

Hardware and OS requirements for each Purple8 Graph deployment tier, with practical data-size context to help you right-size your deployment.

Tiers at a glance

Tier	Deployment	Node cap	Edge cap	Queries/min
Desktop Free	Local app	1M	10M	1,000
Desktop Pro	Local app	10M	100M	5,000
Pro Cloud	Managed (shared)	25M	250M	5,000
Cloud Plus	Managed (dedicated)	100M	1B	20,000
Self-Hosted Server	Your infra	Unlimited	Unlimited	Unlimited
Server Enterprise	On-prem / VPC	Unlimited	Unlimited	Unlimited

Desktop Free & Desktop Pro

Desktop Free — local macOS / Windows app, 1M node cap, no key required Desktop Pro — local macOS / Windows app, 10M node cap, $29/mo

What fits?

Use case	Nodes	Edges	Typical graph
Personal knowledge graph	50K–200K	200K–1M	Notes, people, books, events
Collection & content hierarchy	100K–500K	500K–2M	Libraries, taxonomies, catalogs
Feature flag graph	200K–1M	1M–5M	Features, roles, tenants, cohorts
Code dependency graph	up to 1M	up to 10M	Files, functions, imports, call chains
Process similarity / dedup	up to 10M (Pro)	up to 50M (Pro)	Processes, tasks, steps, outcomes

Recommended hardware — Desktop tiers

Component	Desktop Free (min)	Desktop Pro (recommended)
CPU	2 cores	4–8 cores
RAM	4 GB	16 GB
Disk	10 GB SSD	100 GB SSD
OS	macOS 13+, Windows 10+	macOS 14+, Windows 11
Python	3.10+	3.11+

RAM breakdown (1M nodes, 768-dim, int8):

Component	Memory
Graph store (RocksDB)	~400 MB
HNSW index (int8 quantization)	~900 MB
OS + Python runtime	~300 MB
Total	~1.6 GB

For MacBooks

Runs on 8 GB RAM with P8G_VECTOR_QUANTIZATION=int8. Use P8G_VECTOR_BACKEND=diskann on machines with < 4 GB available.

Pro Cloud & Cloud Plus

Pro Cloud — managed shared instance, 25M nodes, $99/mo Cloud Plus — managed dedicated instance, 100M nodes, $499/mo

Purple8 manages the infrastructure. No hardware to provision.

What fits?

Use case	Nodes	Edges	Context
Mid-size enterprise knowledge graph	5M–25M	50M–250M	Products, customers, orgs, transactions
Access management graph (AD replacement)	5M–15M	20M–100M	Users, roles, resources, policies, tenants
Multi-document RAG corpus	10M–25M	50M–250M	Full corpus of a large enterprise wiki
Healthcare patient graph	10M–25M	50M–200M	Patients, diagnoses, medications, encounters
Duplicate artifact detection	5M–20M	20M–100M	Knowledge artefacts, similarity edges

Managed infra specs (for reference)

Spec	Pro Cloud	Cloud Plus
Compute	Shared (2–4 vCPU)	Dedicated 16 vCPU
RAM	Shared pool	64–128 GB dedicated
Storage	200 GB	1 TB NVMe
Queries/min	5,000	20,000
Concurrent connections	10	100
Backups	Daily, 30-day retention	Daily + PITR 7-day
Region	us-east-1 (default)	Choice of US / EU / APAC

Self-Hosted Pro & Server

Self-Hosted — your infra (any cloud or bare metal), unlimited nodes, $299/mo

What fits?

Deployment	Nodes	Edges	Use case
Single-tenant production	25M–100M	250M–1B	Product + customer graph
Multi-tenant SaaS	50M–200M	500M–2B	Per-tenant sub-graphs
Process transformation graph	50M–200M	250M–1B	Enterprise process mining
Feature enablement platform	20M–100M	100M–500M	Features × tenants × roles × cohorts

Recommended hardware — Self-Hosted

Single node (up to ~50M nodes)

Component	Minimum	Recommended
CPU	8 cores	16 cores
RAM	32 GB	128 GB
Disk	500 GB NVMe SSD	1 TB NVMe SSD
OS	Ubuntu 22.04+	Ubuntu 22.04+
Network	1 GbE	10 GbE

With Raft HA + read replica (up to ~100M nodes)

Component	Primary	Replica ×2
CPU	16 cores	8 cores
RAM	128 GB	64 GB
Disk	1 TB NVMe	1 TB NVMe
Network	10 GbE	10 GbE

Cloud instance recommendations (Self-Hosted)

Provider	Instance	vCPU	RAM	Notes
AWS	`r6i.2xlarge`	8	64 GB	Good baseline
AWS	`r6i.4xlarge`	16	128 GB	Full 100M nodes with DiskANN
GCP	`n2-highmem-8`	8	64 GB
GCP	`n2-highmem-16`	16	128 GB
Azure	`Standard_E8s_v5`	8	64 GB
Azure	`Standard_E16s_v5`	16	128 GB

Server Enterprise

Enterprise — on-prem, internal cloud, or dedicated VPC (GCP / AWS / Azure), custom pricing

What fits at enterprise scale?

Deployment	Nodes	Edges	Shards	Use case
Large enterprise	50M–200M	500M–2B	3–5	Global org + product + customer graph
Regulated industry (finance, health)	100M–500M	1B–5B	3–8	Compliance graph + RBAC + audit
Hyperscale / platform	500M–1B+	5B+	8–16	Platform-level knowledge graph
Multi-region federated graph	200M–1B	1B–10B	8–24	Cross-region data sovereignty

Recommended hardware — Enterprise

3-shard cluster with Raft HA (per shard group: 3 nodes)

Each shard group = 1 Raft leader + 2 followers (quorum = 2/3):

Component	Per shard node	Per shard group (×3)
CPU	32 cores	96 cores
RAM	256 GB	768 GB
Disk (NVMe)	2 TB	6 TB
Network	25 GbE	25 GbE

For 3 shard groups (9 nodes total): ~2.3 TB RAM, ~18 TB NVMe

Minimum viable enterprise (3 shards, no replicas)

Component	Per node	Cluster total (×3)
CPU	16 cores	48 cores
RAM	128 GB	384 GB
Disk	500 GB NVMe	1.5 TB NVMe

Cloud instance recommendations (Enterprise)

Provider	Instance	vCPU	RAM	Per-node cost (est.)
AWS	`r6i.8xlarge`	32	256 GB	~$1.61/hr
AWS	`r6i.16xlarge`	64	512 GB	~$3.22/hr
GCP	`n2-highmem-32`	32	256 GB	~$1.54/hr
Azure	`Standard_E32s_v5`	32	256 GB	~$1.67/hr

Storage sizing guide

Use this table to estimate disk requirements before provisioning:

Nodes	Avg properties	Edges (×4 nodes)	768-dim vectors	RocksDB store	HNSW index	DiskANN index	Total (est.)
100K	5 props	400K	Yes	~500 MB	~300 MB	~120 MB	~1 GB
1M	5 props	4M	Yes	~5 GB	~3 GB	~1.2 GB	~10 GB
10M	5 props	40M	Yes	~50 GB	~30 GB	~12 GB	~95 GB
100M	5 props	400M	Yes	~500 GB	~300 GB	~120 GB	~950 GB
1B	5 props	4B	Yes	~5 TB	~3 TB	~1.2 TB	~9.5 TB

With AES-256-GCM encryption (KMS enabled): add ~10% overhead to the RocksDB store column.

Vector backend decision guide

Backend	RAM required	Disk required	Best for
`hnswlib` (default)	High (full index in RAM)	Low	≤ 5M nodes, fast queries, lots of RAM
`hnswlib` + `int8`	Medium (4× compression)	Low	5M–25M nodes, good balance
`hnswlib` + `binary`	Low (32× compression)	Low	25M+ nodes, approximate recall acceptable
`diskann`	Low (2 GB RAM)	High (NVMe required)	Any scale, memory-constrained hosts

Set with:

bash

P8G_VECTOR_BACKEND=diskann           # switch to DiskANN
P8G_VECTOR_QUANTIZATION=int8         # or int8 / binary for hnswlib

Feature matrix by tier

Feature	Desktop Free	Desktop Pro	Pro Cloud	Cloud Plus	Self-Hosted Server	Enterprise
Cypher / Search / Traversal	✅	✅	✅	✅	✅	✅
Full-text (BM25)	✅	✅	✅	✅	✅	✅
Export / Import	✅	✅	✅	✅	✅	✅
ACID transactions	✅	✅	✅	✅	✅	✅
KMS encryption	❌	✅ (local)	✅	✅	✅	✅
WAL durability	❌	✅	✅	✅	✅	✅
CDC / EventBus	❌	✅	✅	✅	✅	✅
Journey engine	❌	✅	✅	✅	✅	✅
Human-in-the-loop	❌	✅	✅	✅	✅	✅
Connectors	❌	✅	✅	✅	✅	✅
Graph algorithms	❌	✅	✅	✅	✅	✅
OLAP analytics engine	❌	✅	✅	✅	✅	✅
Backup / restore	❌	Local	Managed	Managed + PITR	Self-managed	Self-managed
Read replicas	❌	Local	❌	✅	✅	✅
Raft HA	❌	❌	❌	❌	✅	✅
Sharding + federation	❌	❌	❌	❌	❌	✅
RBAC	❌	❌	❌	❌	❌	✅
SSO (OIDC/SAML)	❌	❌	❌	❌	❌	✅
Audit trail	❌	❌	❌	❌	❌	✅
VPC / on-prem deploy	❌	❌	❌	Add-on	✅	✅
Node limit	1M	10M	25M	100M	Unlimited	Unlimited
Queries/min	1,000	5,000	5,000	20,000	Unlimited	Unlimited
Support	Community	Email	Email	Priority	Priority	24/7 SLA

System Requirements ​

Tiers at a glance ​

Desktop Free & Desktop Pro ​

What fits? ​

Recommended hardware — Desktop tiers ​

Pro Cloud & Cloud Plus ​

What fits? ​

Managed infra specs (for reference) ​

Self-Hosted Pro & Server ​

What fits? ​

Recommended hardware — Self-Hosted ​

Single node (up to ~50M nodes) ​

With Raft HA + read replica (up to ~100M nodes) ​

Server Enterprise ​

What fits at enterprise scale? ​

Recommended hardware — Enterprise ​

3-shard cluster with Raft HA (per shard group: 3 nodes) ​

Minimum viable enterprise (3 shards, no replicas) ​

Storage sizing guide ​

Vector backend decision guide ​

Feature matrix by tier ​

System Requirements

Tiers at a glance

Desktop Free & Desktop Pro

What fits?

Recommended hardware — Desktop tiers

Pro Cloud & Cloud Plus

What fits?

Managed infra specs (for reference)

Self-Hosted Pro & Server

What fits?

Recommended hardware — Self-Hosted

Single node (up to ~50M nodes)

With Raft HA + read replica (up to ~100M nodes)

Server Enterprise

What fits at enterprise scale?

Recommended hardware — Enterprise

3-shard cluster with Raft HA (per shard group: 3 nodes)

Minimum viable enterprise (3 shards, no replicas)

Storage sizing guide

Vector backend decision guide

Feature matrix by tier