Run a Validator
Validator Hosting
Below are the suggestions for properly hosting a validator node in a Polygon Edge network. Please pay careful attention to all the items listed below to make sure that your validator setup is properly configured to be secure, stable and performant.
Knowledge base
Before trying to run the validator node, please read through this document thoroughly. Additional docs that might be helpful are:
Minimum system requirements
Type | Value | Influenced by |
---|---|---|
CPU | 2 cores |
|
RAM | 2 GB |
|
Disk |
|
|
Service configuration
polygon-edge
binary needs to run as a system service automatically upon established network connectivity and have start / stop / restart functionalities. We recommend using a service manager like systemd.
Example systemd
system configuration file:
Binary
In production workloads polygon-edge
binary should only be deployed from pre-built GitHub release binaries - not by manually compiling.
INFO
By manually compiling develop
GitHub branch, you may introduce breaking changes to your environment.
For that reason it is recommended to deploy Polygon Edge binary exclusively from releases, as it will contain information about breaking changes and how to overcome them.
Please refer to Installation for a complete overview of installation method.
Data storage
The data/
folder containing the entire blockchain state should be mounted on a dedicated disk / volume allowing for automatic disk backups, volume extension and optionally mounting the disk/volume to another instance in case of failure.
Log files
Log files need to be rotated on a daily basis (with a tool like logrotate
).
DANGER
If configured without log rotation, log files could use up all the available disk space which could disrupt the validator uptime.
Example logrotate
configuration:
Refer to the Logging section below for recommendations on log storage.
Additional dependencies
polygon-edge
is statically compiled, requiring no additional host OS dependencies.
Maintenance
Below are the best practices for maintaining a running validator node of a Polygon Edge network.
Backup
There are two types of backup procedures recommended for Polygon Edge nodes.
The suggestion is to use both, whenever possible, with the Polygon Edge backup being an always available option.
Volume backup: Daily incremental backup of the
data/
volume of the Polygon Edge node, or of the complete VM if possible.Polygon Edge backup: Daily CRON job which does regular backups of Polygon Edge and moves the
.dat
files to an offsite location or to a secure cloud object storage is recommended.
The Polygon Edge backup should ideally not overlap with the Volume backup described above.
Refer to Backup/restore node instance for instructions on how to perform backups of Polygon Edge.
Logging
The logs outputted by the Polygon Edge nodes should:
be sent to an external data store with indexing and searching capabilities
have a log retention period of 30 days
If this is your first time setting up a Polygon Edge validator, we recommend to start the node with the --log-level=DEBUG
option to be able to quickly debug any issues you might face.
INFO
The --log-level=DEBUG
will make the node's log output be as verbose as possible.
Debug logs will drastically increase the size of the log file which must be taken into account when setting up log rotation solution.
OS security patches
Administrators need to ensure that the validator instance OS is always updated with the latest patches at least once every month.
Metrics
System metrics
Administrators need to setup some kind of system metrics monitor, (e.g. Telegraf + InfluxDB + Grafana or a 3rd party SaaS).
Metrics that need to be monitored and that need to have alarm notifications setup:
Metric name | Alarm threshold |
---|---|
CPU usage (%) | > 90% for more than 5 minutes |
RAM utilization (%) | > 90% for more than 5 minutes |
Root disk utilization | > 90% |
Data disk utilization | > 90% |
Validator metrics
Administrators need to setup collection of metrics from Polygon Edge's Prometheus API to be able to monitor the blockchain performance.
Refer to Prometheus metrics to understand which metrics are being exposed and how to set up Prometheus metric collection.
Extra attention needs to be paid to the following metrics:
Block production time - if block production time is higher than normal, there is a potential problem with the network
Number of consensus rounds - if there is more than 1 round, there is a potential problem with the validator set in the network
Number of peers - if the number of peers drop, there is a connectivity issue in the network
Security
Below are the best practices for securing a running validator node of a Polygon Edge network.
API services
JSON-RPC - Only API service that needs to be exposed to the public ( via load balancer or directly ). This API should run on all interfaces or on a specific IP address ( example:
--json-rpc 0.0.0.0:8545
or--json-prc 192.168.1.1:8545
).INFOAs this is publicly facing API it is recommended to have a load balancer / reverse proxy in front of it to provide security and rate limiting.
LibP2P - This is the networking API used by nodes for peer communication. It needs to run on all interfaces or on a specific ip address (
--libp2p 0.0.0.0:1478
or--libp2p 192.168.1.1:1478
). This API should not be publicly exposed, but it should be reachable from all other nodes.INFOIf ran on localhost (
--libp2p 127.0.0.1:1478
) other nodes will not be able to connect.GRPC - This API is used only for running operator commands and noting else. As such it should run exclusively on localhost (
--grpc-address 127.0.0.1:9632
).
Polygon Edge secrets
Polygon Edge secrets ( ibft
and libp2p
keys ) should not be stored on a local file system.
Instead, a supported Secret Manager should be used.
Storing secrets to local file system should only be used in non-production environments.
Update
Following is the desired update procedure for validator nodes, described as step-by-step instructions.
Update procedure
Download the latest Polygon Edge binary from the official GitHub releases
Stop the Polygon Edge service ( example:
sudo systemctl stop polygon-edge.service
)Replace the existing
polygon-edge
binary with the downloaded one ( example:sudo mv polygon-edge /usr/local/bin/
)Check if correct
polygon-edge
version is in place by runningpolygon-edge version
- it should correspond to the release versionCheck the release documentation if there are any backwards compatibility steps needed to be done before starting
polygon-edge
serviceStart
polygon-edge
service ( example:sudo systemctl start polygon-edge.service
)Finally, check the
polygon-edge
log output and make sure everything is running without any[ERROR]
logs
DANGER
When there is a breaking release, this update procedure must be performed on all nodes as the currently running binary is not compatible with the new release.
This means that the chain must be halted for a short period of time ( until polygon-edge
binaries are replaced and service restarted ) so plan accordingly.
You can use tools like Ansible or some custom script to perform the update efficiently and minimize chain downtime.
Startup procedure
Following is the desired flow of the startup procedure for Polygon Edge validator
Read through the docs listed in Knowlege Base section
Apply the latest OS patches on the validator node
Download the latest
polygon-edge
binary from the official GitHub releases and place it in local instancePATH
Initialize one of the supported secrets managers using
polygon-edge secrets generate
CLI commandGenerate and store secrets using
polygon-edge secrets init
CLI commandTake note of
NodeID
andPublic key (address)
valuesGenerate
genesis.json
file as described in cloud setup usingpolygon-edge genesis
CLI commandGenerate default config file using
polygon-edge server export
CLI commandEdit
default-config.yaml
file to accommodate local validator node environment ( file paths, etc. )Create a Polygon Edge service (
systemd
or similar ) wherepolygon-edge
binary will run the server from adefault-config.yaml
fileStart Polygon Edge server by starting the service ( example:
systemctl start polygon-edge
)Check the
polygon-edge
log output and make sure the blocks are being generated and that there are no[ERROR]
logsCheck the chain functionality by calling a JSON-RPC method like
eth_chainId
Last updated