Deployment Patterns

This guide covers different Dgraph deployment patterns, from simple development setups to production-grade highly available clusters.

Pattern Selection Guide

Pattern	Use Case	Nodes	HA	Sharding
Basic	Dev/Test environments, non critical production	1 Zero, 1 Alpha	❌	❌
HA	Production, <1TB	3 Zeros, 3 Alphas	✅	❌
Distributed	Dev/Test environments for large dataset	1 Zero, 2+ Alphas	❌	✅
Distributed HA	Large production, >10TB	3 Zeros, 6+ Alphas	✅	✅

Learning Environment

Best for: First-time users, tutorials, local experimentation

Docker

docker run -d -p 8080:8080 -p 9080:9080 \
  -v ~/dgraph:/dgraph \
  --name dgraph \
  dgraph/standalone:latest

Includes: Zero + Alpha in one container Access: Ratel UI at http://localhost:8080

Command Line (Linux)

# Terminal 1: Start Zero
dgraph zero --my=localhost:5080

# Terminal 2: Start Alpha
dgraph alpha --my=localhost:7080 --zero=localhost:5080

Key Flags:

--my: Address and port that other nodes will connect to (default: localhost:5080 for Zero, localhost:7080 for Alpha)
--zero: Alpha connects to Zero using this address
--wal: Directory for write-ahead log entries (default: zw for Zero, w for Alpha)
--postings: Directory for Alpha data storage (default: p)
--bindall: Set to true for machine-to-machine communication (default: true)

Pros: Quick to start, minimal resources Cons: No HA, no persistence guarantees, not for production

Basic cluster

Best for: Development teams, staging environments, CI/CD

Architecture

┌──────────────┐
│  Dgraph Zero │ :5080, :6080
└──────┬───────┘
       │
┌──────▼───────┐
│ Dgraph Alpha │ :7080, :8080, :9080
└──────────────┘

Docker Compose

version: "3.8"
services:
  zero:
    image: dgraph/dgraph:latest
    volumes:
      - dgraph-zero:/dgraph
    ports:
      - 5080:5080
      - 6080:6080
    command: dgraph zero --my=zero:5080
    
  alpha:
    image: dgraph/dgraph:latest
    volumes:
      - dgraph-alpha:/dgraph
    ports:
      - 8080:8080
      - 9080:9080
    command: dgraph alpha --my=alpha:7080 --zero=zero:5080

volumes:
  dgraph-zero:
  dgraph-alpha:

Start with: docker-compose up -d

Command Line

# Start Zero
dgraph zero --my=<IP>:5080 -w data/zero

# Start Alpha
dgraph alpha --my=<IP>:7080 --zero=<ZERO_IP>:5080 -p data/alpha/p -w data/alpha/w

Flag Reference:

-w / --wal: Directory for write-ahead log entries
-p / --postings: Directory for Alpha data storage
--bindall: Set to true for network communication (default: true)
--v=2: Recommended log verbosity level for better diagnostics

Pros: Realistic setup, persistent storage Cons: No HA, single point of failure

HA cluster

Best for: Production workloads up to 10TB

Architecture

Zero Cluster (3 nodes) - Raft Group 0
    ├─ Zero 1 :5080 (Leader)
    ├─ Zero 2 :5080 (Follower)
    └─ Zero 3 :5080 (Follower)
         │
Alpha Group 1 (3 replicas) - Raft Group 1
    ├─ Alpha 1 :7080, :8080, :9080 (Leader)
    ├─ Alpha 2 :7080, :8080, :9080 (Follower)
    └─ Alpha 3 :7080, :8080, :9080 (Follower)

Setup Steps

1. Start Zero Cluster:

# Zero 1 (on host1) - First Zero initializes the cluster
dgraph zero --my=host1:5080 --raft "idx=1" --replicas=3

# Zero 2 (on host2) - Uses --peer to join existing cluster
dgraph zero --my=host2:5080 --raft "idx=2" --peer=host1:5080

# Zero 3 (on host3) - Uses --peer to join existing cluster
dgraph zero --my=host3:5080 --raft "idx=3" --peer=host1:5080

Important Notes:

Raft IDs: Each Zero node must have a unique Raft ID set via --raft "idx=N". Dgraph does not auto-assign Raft IDs to Zero nodes.
Cluster Initialization: The first Zero node starts the cluster. All subsequent Zero nodes must use --peer=<first-zero-address> to join the existing cluster. If --peer is omitted, a new independent cluster will be created.
Replication: The --replicas=3 flag on Zero controls how many Alpha replicas will be in each Alpha group.

2. Start Alpha Cluster:

# Alpha 1 (on host1)
dgraph alpha --my=host1:7080 --zero=host1:5080,host2:5080,host3:5080

# Alpha 2 (on host2)
dgraph alpha --my=host2:7080 --zero=host1:5080,host2:5080,host3:5080

# Alpha 3 (on host3)
dgraph alpha --my=host3:7080 --zero=host1:5080,host2:5080,host3:5080

Important Notes:

Zero Connection: Alphas can connect to any Zero in the cluster; list all Zeros for redundancy.
Group Assignment: Zero automatically assigns Alphas to groups based on the --replicas setting. With --replicas=3, the first 3 Alphas join Group 1.
Alpha Raft IDs: Unlike Zero nodes, Alpha nodes receive their Raft IDs automatically from Zero.

Kubernetes (Helm)

helm repo add dgraph https://charts.dgraph.io
helm install my-dgraph dgraph/dgraph \
  --set zero.replicaCount=3 \
  --set alpha.replicaCount=3

Characteristics:

Tolerates 1 node failure (in each group)
All data replicated 3x
No sharding (all predicates on all Alphas)
Suitable for datasets up to ~1TB

Pros: High availability, automatic failover Cons: Storage scales vertically only

Distributed (Multi-Group) - Basic

Sharding, No HA

Best for: Development with large datasets (>10TB)

Architecture

┌──────────────┐
│  Dgraph Zero │ :5080
└──────┬───────┘
       │
       ├─ Group 1: Alpha 1 :7080
       ├─ Group 2: Alpha 2 :7081 (port offset)
       └─ Group 3: Alpha 3 :7082 (port offset)

Setup (Single Host with Port Offsets)

# Start Zero with replicas=1 (no replication)
dgraph zero --my=localhost:5080 --replicas=1

# Start Alpha nodes with port offsets
dgraph alpha --my=localhost:7080 --zero=localhost:5080 -p data/p1 -w data/w1
dgraph alpha --my=localhost:7081 --zero=localhost:5080 -p data/p2 -w data/w2 -o 1
dgraph alpha --my=localhost:7082 --zero=localhost:5080 -p data/p3 -w data/w3 -o 2

Characteristics:

3 Alpha groups (no replication within groups)
Data sharded by predicate across groups
Horizontal storage scaling
No fault tolerance

Pros: Horizontal scalability, handles large datasets Cons: No HA, any node failure loses data

Distributed - HA (Production Large-Scale)

Best for: Production workloads >10TB, high traffic, mission-critical

Architecture

Zero Cluster (3 nodes)
    └─ Replicates cluster metadata
         │
         ├─ Group 1: Alpha 1,2,3 (3 replicas)
         │   └─ Predicates: name, age, email
         │
         ├─ Group 2: Alpha 4,5,6 (3 replicas)
         │   └─ Predicates: friend, follows
         │
         └─ Group 3: Alpha 7,8,9 (3 replicas)
             └─ Predicates: location, company

Setup (9 Alpha Nodes across 3 Hosts)

Zeros (3 nodes):

# Host 1: Zero 1
dgraph zero --my=host1:5080 --raft "idx=1" --replicas=3

# Host 2: Zero 2
dgraph zero --my=host2:5080 --raft "idx=2" --peer=host1:5080

# Host 3: Zero 3
dgraph zero --my=host3:5080 --raft "idx=3" --peer=host1:5080

Alphas (3 groups × 3 replicas = 9 nodes):

# Host 1: Alphas 1, 4, 7
dgraph alpha --my=host1:7080 --zero=host1:5080,host2:5080,host3:5080 -p p1 -w w1
dgraph alpha --my=host1:7081 --zero=host1:5080,host2:5080,host3:5080 -p p4 -w w4 -o 1
dgraph alpha --my=host1:7082 --zero=host1:5080,host2:5080,host3:5080 -p p7 -w w7 -o 2

# Host 2: Alphas 2, 5, 8
dgraph alpha --my=host2:7080 --zero=host1:5080,host2:5080,host3:5080 -p p2 -w w2
dgraph alpha --my=host2:7081 --zero=host1:5080,host2:5080,host3:5080 -p p5 -w w5 -o 1
dgraph alpha --my=host2:7082 --zero=host1:5080,host2:5080,host3:5080 -p p8 -w w8 -o 2

# Host 3: Alphas 3, 6, 9
dgraph alpha --my=host3:7080 --zero=host1:5080,host2:5080,host3:5080 -p p3 -w w3
dgraph alpha --my=host3:7081 --zero=host1:5080,host2:5080,host3:5080 -p p6 -w w6 -o 1
dgraph alpha --my=host3:7082 --zero=host1:5080,host2:5080,host3:5080 -p p9 -w w9 -o 2

Group Assignment:

Zero automatically assigns Alphas 1,2,3 → Group 1
Zero assigns Alphas 4,5,6 → Group 2
Zero assigns Alphas 7,8,9 → Group 3

Characteristics:

3 groups with 3x replication each
Tolerates 1 node failure per group
Data sharded across groups
All predicates replicated 3x within their group

Pros: Maximum scalability and availability Cons: Higher operational complexity, more resources

Configuration Flags Reference

Common Flags

Flag	Component	Description	Default
`--my`	Zero/Alpha	Address:port that other nodes connect to	`localhost:5080` (Zero) `localhost:7080` (Alpha)
`--zero`	Alpha	Address(es) of Zero node(s) to connect to	Required
`--peer`	Zero	Address of existing Zero to join cluster	None (creates new cluster if omitted)
`--raft "idx=N"`	Zero	Unique Raft ID for Zero node (required for HA)	`1`
`--replicas`	Zero	Number of Alpha replicas per group	`1`
`-w` / `--wal`	Zero/Alpha	Directory for write-ahead log entries	`zw` (Zero) `w` (Alpha)
`-p` / `--postings`	Alpha	Directory for data storage	`p`
`--bindall`	Zero/Alpha	Bind to `0.0.0.0` for network access	`true`
`--v=2`	Zero/Alpha	Log verbosity level (recommended: 2)	`0`

Configuration Methods: Flags can be set via command-line arguments, environment variables, or configuration files. See Config for details.

Best Practices

Node Placement

Different Physical Hosts: Run each replica on a separate machine
Availability Zones: Distribute across 3 AZs when possible
Network Latency: Keep inter-node latency <5ms for best performance

Resource Planning

Deployment	CPUs/Node	RAM/Node	Disk/Node
Development	2 cores	4GB	50GB
Small Production	8 cores	16GB	250GB SSD
Large Production	16 cores	32GB	1TB NVMe

Scaling Strategy

Vertical First:

Start with HA single-group (3 Alphas)
Increase CPU/RAM per node as load grows

Horizontal When:

Dataset >1TB
Query latency increases despite vertical scaling
Need to isolate hot predicates

Add 3 Alphas at a time to maintain replication factor

Deployment Checklist

Before production deployment:

Set --replicas=3 on Zero nodes
Configure persistent storage volumes
Enable TLS for client connections
Set up IP whitelisting for admin endpoints
Configure monitoring (Prometheus/Grafana)
Set up binary backups (Enterprise)
Test failover scenarios
Document cluster topology
Plan capacity for 2x growth

Deployment Patterns

Pattern Selection Guide

Learning Environment

Docker

Command Line (Linux)

Basic cluster

Architecture

Docker Compose

Command Line

HA cluster

Architecture

Setup Steps

Kubernetes (Helm)

Distributed (Multi-Group) - Basic

Architecture

Setup (Single Host with Port Offsets)

Distributed - HA (Production Large-Scale)

Architecture

Setup (9 Alpha Nodes across 3 Hosts)

Configuration Flags Reference

Common Flags

Best Practices

Node Placement

Resource Planning

Scaling Strategy

Deployment Checklist

Next Steps