Infrastructure Setup Guide
graph TD
A[Validator Node] -->|Private Network| B[Sentry Nodes]
B -->|Public Network| C[Internet]
A -->|Metrics| D[Monitoring]
D -->|Alerts| E[Operations Team]
Loading
Validator Node:
CPU: 8+ cores
RAM: 64GB+
Storage: 4TB+ NVMe SSD
Network: 1Gbps+
Sentry Nodes (each):
CPU: 4+ cores
RAM: 32GB
Storage: 2TB SSD
Network: 1Gbps
# Private Network (validator-sentry)
Interface: eth1
Network: 10.0.0.0/24
Firewall: Allow only necessary ports
# Public Network (sentry-internet)
Interface: eth0
Firewall: Rate limiting enabled
DDoS Protection: Enabled
# System preparation
apt update && apt upgrade -y
apt install ufw fail2ban prometheus node-exporter
# Create service user
adduser cosmos --disabled-password
usermod -aG sudo cosmos
# UFW setup
ufw default deny incoming
ufw default allow outgoing
ufw allow 22/tcp
ufw allow 26656/tcp # P2P for sentry only
ufw enable
# fail2ban
cp /etc/fail2ban/jail.conf /etc/fail2ban/jail.local
systemctl enable fail2ban
systemctl start fail2ban
# config.toml
[p2p ]
pex = false # validator
addr_book_strict = false
persistent_peers = " sentry_node_ids"
private_peer_ids = " validator_node_id"
# app.toml
minimum-gas-prices = " 0.025uatom"
pruning = " custom"
pruning-keep-recent = " 100"
┌─────────────┐
│ Validator │
└──────┬──────┘
│
┌──────┴──────┐
│ Sentry │
└─────────────┘
┌─────────────┐
│ Validator │
└──────┬──────┘
│
┌──────┴──────┐
│ Sentry │
└─────────────┘
Monitor primary validator
Detect failures automatically
Switch to backup validator
Resume operations
# Create volumes
lvcreate -L 4T vg0 -n cosmos_data
# Mount options
mount -o noatime,data=ordered /dev/vg0/cosmos_data /data
# Daily backups
0 0 * * * /scripts/backup_blockchain.sh
# Weekly state sync
0 0 * * 0 /scripts/state_sync.sh
# Kernel parameters
net.core.rmem_max=2097152
net.core.wmem_max=2097152
net.ipv4.tcp_rmem=4096 87380 2097152
net.ipv4.tcp_wmem=4096 65536 2097152
# Systemd service
[Service]
User=cosmos
LimitNOFILE=65535
LimitNPROC=65535
Regular state snapshots
Configuration backups
Key management
Recovery documentation
# Quick recovery
1. Deploy new instance
2. Restore configuration
3. Import keys
4. Sync blockchain
5. Resume validation
Monthly recovery tests
Quarterly failover tests
Annual full DR test
Documentation Requirements
1. Configuration Management
Node configurations
Network layouts
Security settings
Update procedures
2. Operational Procedures
Start/stop procedures
Upgrade processes
Emergency responses
Contact information
Metrics collection
Alert configurations
Dashboard setups
Response procedures