multiversx / mx-chain-go Goto Github PK

View Code? Open in Web Editor NEW

917.0 68.0 199.0 324.69 MB

⚡ The official implementation of the MultiversX blockchain protocol, written in golang.

Home Page: https://multiversx.com

License: GNU General Public License v3.0

Go 99.55% Dockerfile 0.02% Shell 0.42% Makefile 0.02%

blockchain golang multiversx cryptocurrency decentralized go p2p

mx-chain-go's People

Contributors

Stargazers

Watchers

Forkers

touhonoob cryptoxpointe longhuynh2811 rvr7 ilhamm02 mikehibbert lucianmincu toniweak beefe andreyvelde ryiuk flaviuss22 aa0231 wen1123 paulci6 kristie-dude alexpatru edwardtarantulo collinsjie bellamyblack mihaim05 mirugame tramyargabam grazzza plachpalach kurochkagril an0224 victoroshi lohit03 zillabytes oldboyiii exturuder forcedeltn randyarl joesixpack mystic999 lenin1218 bebokute pulsar5991 shoot13 gbuchukuri azon15 encryptedbear yusufemre00 kemaihon imeowyou alinporumb fxfactorial fahadahmed5 adrianslr natebolam alain-michel-pilot tronwallet benwhitejam bliiitz soroush01110011 jun0tpyrc donsn z0uz dkelljr rfesi tj327 radubarbos kimdj mielucristian zhiqiangxu claudiulataretu rqzrqh cojictin sfrias sava-alex flemisharchers stjordanis miiu96 isgasho heimofficial syriak bogdanndamian silviutroscot mihaieremia mariailoaie racoltacalin warlord-dev ctindogaru costin567890 zonaut dorucioclea kolbon4ik alexandru-calinoiu cryptoshit jonathanvillareal485 alexandru-g dinhduongha andrei-mv symcryptosis bogdan-rosianu italoadler dragos-rebegea panoptesdev ookamikb

mx-chain-go's Issues

Nodes closing v.1.0.11

I have 3 nodes running in a VPS (8 cores, 30gb ram and 800gb ssd)
In the last 3 days I saw that at some point some nodes go down. Attach you have the fatalerrors log file of the last one.
37b12118f12f_fatalErrors-2019-08-08-06-54-57.log

Ask me anything and I will upload what you need

Negative values on age

I checked my balance on https://testnet.elrond.com/?fbclid=IwAR0oziqDY3Edk0D18Hfxp5AKTQ0PJBR8ov0iLhgs50M-mRD_bwowNL-RX1U#/address/32ef1f68e26a3036d7550ca81c30031ba2d207b731ab077ac912ff38a708fae8 and I've seen that same ages have negative values.

port opening problem windows 10

why problem with connecting udp 123

sync validation error testnet v1.0.11

I encountered a sync validation error when resetting the node at the time. After resetting the node I was forced to run synchronously at nonce 56322, the front panel was synchronized, I was repeated 3 times before, currently I am in shard 1. is there a way to fix the error? this?

node is connected to 0 peers on v1.0.15

After (node) restart, it started to run normally.

$GOPATH not defined in README.md

First users installing elrond-go from the instructions in README.md will not have set their $GOPATH yet. Yet it is referenced in:
https://github.com/ElrondNetwork/elrond-go/blob/master/README.md

Solution could be something like:
# set $GOPATH if not set and export to ~/.profile along with Go binary path
if [[ $GOPATH=="" ]]; then
GOPATH="$HOME/go"
fi

occasional nonce errors on v1.0.8 (bd54836)

I've checked the log and found these two non repeating errors, on shard 2:

elrond- go/consensus/spos/commonSubround/subroundBlock.go","level":"info","line_number":330,"msg":"canceled round 12613 in subround (BLOCK), wrong nonce in block\n","time":"2019-07-11T07:00:55+03:00"}

elrond-go/process/block/baseProcess.go","level":"info","line_number":115,"msg":"nonce not match: local block nonce is 11945 and node received block with nonce 11947\n","time":"2019-07-11T07:00:55+03:00"}

Full log:

1fbc7a0835b9-2019-10-07-16-54-45.zip

v1.0.23 lots of RAM used in OS Win10

The node consumes too much data Ram in the direction of increasing steadily

Up to now has restarted Nodes 2 - 3 times!

45ef903587fe.zip
399dea8949e4.zip

Perpetual sync for block that doesn't exist yet

Started a fresh observe, left it syncing, but it never arrives because it's syncing for a block that doesn't exist yet.

As can be seen in this first screenshot, it's trying to sync to 22757 block while 214?? is the highest final block sync.

After ctrl+c and restart ./node it immediately comes back as synchronised. As can be seen in this second screenshot, from briefly after, it's now synced on 21496 block, significantly lower than which it was previously trying.

It seems it was originally trying to sync to "cross check block height: meta", which is ahead.

Half an hour later the obervser is still reporting it's in sync, on 21746 block, still lower than what it was trying to sync to. (See third screenshot.)

(Server time is UTC, as noted in the tmux right lower corner.)

time is out / tx pool over 100k v1.0.11 (branch, commit 0ed2c93)

ubuntu server 18.04, shard 2, code from branch above, started node with reset storage:

The log contains two sets of errors, the first with "too many open files" stopped after running "ulimit -n 65535" and restarting.

I believe the second error started after this, "time is out" with 104k transactions in the pool

time is out log shard 2.zip

edit:
I've updated the node to commit 8005ec and I get 'time is out' as well, but tx pool is 0 now

log after restart on newer commit.zip

Fork detected loop error on v1.0.11

Fork was detected at a certain nonce and hash and it started the rollback mechanism to a previous header with nonce and hash and after that it ended the fork choice. After that it started to request the same header with the same hash as the one where fork was detected and went into a loop, with rollback, request, fork detected, rollback, request, fork detected and so on.

Logs: http://s000.tinyupload.com/?file_id=22154264996387362718

Feature request - uptime in node output

I'd like to suggest to print the time a node process has been running in the output, this should make troubleshooting (others) easier. That way we can tell if someone successfully restarted their node or not.

shard header round is lower than last committed for this shard

logs.zip

Stream Reset Issue

Community bug hunt contribution - Testnet V1.0.5

Bugs

Fork choice detected (After more than 12 sec of latency in which one proposer tried to write on disk and then to broadcast the valid block with nonce n, the network already proposed another block with the same nonce n, and also the next proposer broadcast the block with nonce n + 1. Before the rest of the network to sync this block with nonce n + 1, they also received the block with nonce n from the slow proposer. Because they didn't sync yet the block with nonce n + 1 and the final block is still n - 1 (as k finality is 1) they will start a fork choice on the block with nonce n (as they have now two block with nonce n but with different round inside). As the rule is that when two VALID blocks have the same nonce, the one with the lower round will be selected, they will try to request the first block with nonce n. The problem is that the only one who got this block is the slower proposer, but because he also received the new block with nonce n + 1 which has been broadcast by the proposer of that round, this caused a fail of validity when it verified it against his own block with nonce n, and for this reason he considered that he is also on the fork and rolled back to the final block with nonce n - 1, and removed its own proposed block with nonce n. From now nobody will have this block anymore and all the consensus group from that specific shard is stuck, as nobody can resolve the request of this first block with nonce n. The solution for this case is that nobody from network should accept blocks, even they are valid, with the round inside them, lower than the current round - k finality. In this case any slow proposer has time (its round + next one) to broadcast the block to the network as this block to be taken into consideration.
Note on k finality

Block finality: block N is final only if block N-1, block N-2 … block N-K are signed. Metachain only notarized final blocks. Currently we have chosen K = 1;
You can read more about the K-finality here

Problem with the meta blocks which are added into node's shard pool but not notarised until the first cross shard transaction will happen. (this will cause that if there are many meta headers not notarised between two cross shard transactions with destination in node's shard, there is no enough time (in 4 seconds) to compute the validity checks for all of them and for this reason no more cross shard txs could be validated form that point (all will remain in state pending). The solution is that all the hashes from the validated metablocks, until that point, to be added in the creation of each shard block even if there are no cross shard transactions with destination in self shard. In this way the pool is emptied each round and the validation of the next cross shard transaction has to be fast as there are no many meta headers remained to be validated.

Enhancements

When a boostrap from storage starts it should feed again the elastic from the first block if the node is launch with a specific flag in this sense
Change "Insufficient founds" print on transaction validation process from info to debug

Join the validator community chat on Riot: https://riot.im/app/#/room/#elrond:matrix.org

v1.0.19 - shard 0 - constant cpu 100% - logs attached

logs.zip

header in metablock is not final (on branch, commit 9f4e529)

Observer connected to shard 2, using the code from branch "elrond-go-issue-248-Wrong-nonce-in-block".

This error is repeating, and it did not happen when I was running the observer on shard 0, same node version:

Total txs in pool: 566

header in metablock is not final
header in metablock is not final
requested header with nonce 20646 from network

received requested header with nonce 20646 from network and probable highest nonce is 49259

6702c52b8486-2019-07-07-09-39-15.zip

Wrong Nonce in Block (Forked)

logs.zip

Node Halted v1.0.11

logs.zip

Error building logger package from aarch64

I'm using this repo to run elrond-node in docker:
https://github.com/mrz1703/elrond-node

My current test machine is a Rasp py 4 with a 64 bits kernel (aarch64). When launching the first "docker-compose build" I get this error:

../../core/logger/redirectStderrLinux.go:12:9: undefined: syscall.Dup2

As stated on another github issue, dup2 is not supported on armv8:
rfjakob/gocryptfs#121

The file on "master/core/logger/redirectStderrLinux.go" should have a case for arm64 kernel in order to support dup3

Latency increased?

When playing CryptoBubbles.io, I noticed that the start time was delayed for up to 6 seconds, after having waited for 15 seconds after the previous round.
I guess this is due to a higher latency in the CB testnet? I also noticed on the CB testnet Explorer that the most recent transactions were sometimes visible after more than 20 seconds.
Any clues about this?

Edit: Latency now 7 seconds in the CB Testnet Explorer, so today, no extra delays! But... also just a small amount of CB players. ;)

node v1.0.15 build fail with go1.13 on Linux

I've connected remotely to someone who was having issues setting up the node on Linux.

I've recommended the _golang.sh script from elrond-go-scripts, but there's a problem, it downloads go1.13, and this is not compatible with v1.0.15, it fails at compile time.

In the end I had to remove go1.13 and install go1.12.9 and the problem goes away

mkdir access denied

In Windows, if you run the node with the parameter --storage-cleanup, you get the following error:
"error creating trie: error creating accountsTrieStorage: mkdir 8c1f16875103: Access is denied."
if the folder 8c1f16875103 already exists.

elrond-go fails to build on arm64 / aarch64 (redirectStderrLinux.go:12:9: undefined: syscall.Dup2)

Problem:
ubuntu@ubuntu:~/go/src/github.com/ElrondNetwork/elrond-go/cmd/node$ go build github.com/ElrondNetwork/elrond-go/core/logger ../../core/logger/redirectStderrLinux.go:12:9: undefined: syscall.Dup2

Solution: changes in the redirectStderrLinux.go
//+build linux darwin package logger import ( "os" "syscall" ) // redirectStderr redirects the output of the stderr to the file passed in func redirectStderr(f *os.File) error { err := syscall.Dup2(int(f.Fd()), int(os.Stderr.Fd())) if err != nil { return err } return nil }

Fix: new vestion of redirectStderrLinux.go
//+build linux darwin package logger import ( "os" "syscall" ) // redirectStderr redirects the output of the stderr to the file passed in func redirectStderr(f *os.File) error { err := syscall.Dup3(int(f.Fd()), int(os.Stderr.Fd()), os.O_CREATE|os.O_APPEND|os.O_WRONLY) if err != nil { return err } return nil }

pull request will follow shortly

Too Many Open Files

Ulimit is unlimited.

log.zip

Time is Out & Forking (v1.0.16)

Experiencing time is out, forking and/or no/slow syncing issues this run (via Docker on Ubuntu 18.04 LTS). Shard 4 logs will encompass the new node after the patch fix.

shard1.zip
shard2.zip
shard2b.zip
shard3.zip
shard3b.zip
shard4.zip
shard5.zip

Metachain does not output block in each round on high load

It appears that the metachain misses rounds if there is a high load on the shard chains. This might be caused by the fact that finality shard headers are requested by metachain validators in shards and the responses take too long.

panic: runtime error: slice bounds out of range

After 24 hours of running, my validator node has died with this message, not contained in the log file:

2019-07-04 22:04:06.355331617 Step 2: signature has been sent

panic: runtime error: slice bounds out of range

goroutine 24130 [running]:
github.com/libp2p/go-msgio.(*reader).ReadMsg(0xc0056c39f0, 0x0, 0x0, 0x0, 0x0, 0x0)
C:/Users/Sorin/go/pkg/mod/github.com/libp2p/[email protected]/msgio.go:216 +0x25f
github.com/libp2p/go-libp2p-secio.(*etmReader).fill(0xc00645cd20, 0x634, 0x0)
C:/Users/Sorin/go/pkg/mod/github.com/libp2p/[email protected]/rw.go:132 +0x3f
github.com/libp2p/go-libp2p-secio.(*etmReader).Read(0xc00645cd20, 0xc005fd9c20, 0xc, 0xc, 0x0, 0x0, 0x0)
C:/Users/Sorin/go/pkg/mod/github.com/libp2p/[email protected]/rw.go:171 +0x264
io.ReadAtLeast(0x2c919c8, 0xc00034d440, 0xc005fd9c20, 0xc, 0xc, 0xc, 0xc, 0x0, 0x0)
D:/Go/src/io/io.go:310 +0x8f
io.ReadFull(...)
D:/Go/src/io/io.go:329
github.com/libp2p/go-yamux.(*Session).recvLoop(0xc004a51860, 0x0, 0x0)
C:/Users/Sorin/go/pkg/mod/github.com/libp2p/[email protected]/session.go:499 +0xdb
github.com/libp2p/go-yamux.(*Session).recv(0xc004a51860)
C:/Users/Sorin/go/pkg/mod/github.com/libp2p/[email protected]/session.go:478 +0x32
created by github.com/libp2p/go-yamux.newSession
C:/Users/Sorin/go/pkg/mod/github.com/libp2p/[email protected]/session.go:128 +0x38f

Full log here:

1fbc7a0835b9-2019-03-07-21-07-26.zip

stats 1fbc7a0835b9-2019-03-07-21-07-26.zip

Feature request - random name when NodeDisplayName not set

I'd like to suggest that if NodeDisplayName isn't set, it gets a random name assigned, rather than everyone "N/A". (Can be random characters, based on say MAC address or public key, from a word list, etc.) This way we can help those who struggle to set it (and / or don't want to interrupt their uptime) find themselves on the validator list by
`cat $GOPATH/src/github.com/ElrondNetwork/elrond-go-node-*/config/config.toml | grep NodeDisplayName

Endless "Time is Out" Looping

"Time is out" errors is endlessly happening after upgrading to v1.07. It happens whether or not the db is wiped. Logs:

observer.zip
validator.zip

Invalid Key

Hey guys,
My v1.0.16 testnet node was running smooth, until I changed the name of the folder elrond-go to elrond-go-16. Then I run node.exe again and the firewall (Windows Defender) asked me if I should give it permission, which I answered "no" this time.

Running node.exe again, I got the Invalid Key error.

I run again node.exe with -use-log-view parameter and the last line before the app exited, were:
No views for current node
No AppStatusHandler used. Started with NilStatusHandler
could not create local data store: file missing [file=MANIFEST-000000]

Then I installed v1.0.17, built it and tried to reproduce this issue. The result is different, as the Invalid Key message appears only for a second, after which the app resumes normal operation, so it seems the issue is partially fixed.

One second later ...

Tried to reproduce it with v1.0.16 built from scratch and I get the same behaviour as in v1.0.17.
The only difference is that I disabled the AntiVirus (Avast).
After re-enabling it and trying to reproduce the issue again, node.exe crashed with the messages:
No AppStatusHandler used. Started with NilStatusHandler
closing the leveldb process loop
closing the timed batch handler
closing the timed batch handler
closing the leveldb process loop
closing the timed batch handler
closing the leveldb process loop
closing the timed batch handler
closing the leveldb process loop
could not create local data store: file missing [file=MANIFEST-000000]

The conclusion is that the issue is infact produced by the AntiVirus running node.exe in SandBox for a few seconds.

Problem after disconnecting the Internet

At night there was an unexpected shutdown of the Internet. After the restart, the node sync stuck on block 7695-7696.
On my machine two nodes work and both have problems with synchronization. One works in Windows 10, the second through VirtualBox in Windows 7. I wanted to test it. Each node has its own keys. One node is on shard 4, the second is " meta".

I decided to restart the node from the very first step by deleting the "go" folder and creating it again. I re-created the node on the manual from the site and ran it again. But that didn't solve my problem. At new synchronization I also got stuck on the block 7696

How to solve the problem ? My nickname in Riot: mystic999

logs and stats :
Node.zip

windows shutting down

windows 10 keep shutting down my nodes
04449deb2b39_fatalErrors-2019-10-10-09-28-27.log

loosing connection windows 10

node prob.pdf

could not create local data store: not supported cache type

Hi,

on windows with a laptop surface pro 6 (core i7 with 16GB of RAM) I have this error:

Starting as observer node...
Starting in shard: 0
closing the leveldb process loop
closing the timed batch handler
could not create local data store: not supported cache type

Connection to network suddenly lost

Just saw that one of multiple nodes on a VPS was suddenly offline.

So I checked and the node was still running, it just lost connection to the network somehow! Very strange. I stopped and restarted it and now it's synced again and connected to 332 peers and counting.

It was on shard 3. The machine has multiple nodes running on different REST-API ports, this one was on 8080.

Any idea what could've happened?

logs.zip
stats.zip

could not sync or create blocks

I cannot synchronize and create blocks

092823.log
https://matrix.org/_matrix/media/r0/download/matrix.org/LAkYDPPMgvVVSAwDjOXEWyXJ

Libp2p Build Error

The below happens occasionally while building. The only way to fix is to wipe the go/pkg dir (or possibly the libp2p dir in that).

Calling /node/stop twice gives incorrect responce

After curling http://localhost:8080/node/stop I get an "ok", which is expected.

However, when I call it again right after I still get and "ok" instead of the errors specified in api/node/routes.go (lines 110 - 119):


        if !ef.IsNodeRunning() {
		c.JSON(http.StatusOK, gin.H{"message": errors.ErrNodeAlreadyStopped.Error()})
		return
	}

	err := ef.StopNode()
	if err != nil && ef.IsNodeRunning() {
		c.JSON(http.StatusInternalServerError, gin.H{"error": fmt.Sprintf("%s: %s", errors.ErrCouldNotStopNode.Error(), err.Error())})
		return
	}

v1.0.23 linux connected 20 peers, lots of RAM used

logs.tar.gz

Feature request: Node uptime

It would be nice to have such a feature !

invalid public key string - arm64 / aarch64

After fixing the compilation problem for arm64 architecture, once the ./node is started, the following errors occurs:

`ubuntu@ubuntu:~/go/src/github.com/ElrondNetwork/elrond-go/cmd/node$ ./node -use-log-view
Starting node with version undefined/go1.12.9/linux-arm64

Process ID: 741

/home/ubuntu/go/src/github.com/ElrondNetwork/elrond-go/cmd/node/config/config.toml
Initialized with config from: ./config/config.toml
/home/ubuntu/go/src/github.com/ElrondNetwork/elrond-go/cmd/node/config/p2p.toml
Initialized with p2p config from: ./config/p2p.toml
/home/ubuntu/go/src/github.com/ElrondNetwork/elrond-go/cmd/node/config/genesis.json
Initialized with genesis config from: ./config/genesis.json
/home/ubuntu/go/src/github.com/ElrondNetwork/elrond-go/cmd/node/config/nodesSetup.json
Initialized with nodes config from: ./config/nodesSetup.json
NTP average clock offset: 0s
Start time formatted: Fri Sep 13 17:50:00 UTC 2019
Start time in seconds: 1568397000
/home/ubuntu/go/src/github.com/ElrondNetwork/elrond-go/cmd/node/config/initialNodesSk.pem
Starting with public key: 32854f7742a70f06b9a87ff0b62b1795a93418b65af3b2e557d2cd1ed5c0876f6ac341bdf39f1092771a7486ec03d905dd10036a4193a631c6e8a0d8958fab6334a0d6a97e2ffc3de456d8ddc52542445ed75c17944aedf2040ab22b18d8e7eb0cb9030ed38aeca03dc1131680fe0d89e1127c89e39a50167f716ed388dfacd8
Starting as observer node...
Starting in shard: 0
/home/ubuntu/go/src/github.com/ElrondNetwork/elrond-go/cmd/node/config/server.toml
No views for current node
No AppStatusHandler used. Started with NilStatusHandler
creatingShardDataPool from config
invalid public key string`

Raspberry Pi 4 - Ubuntu server 18.04 (arm64) + go1.12.9 linux/arm64

Docker Won't Detach

Not sure if this is a Docker or Elrond issue, but when running the node with --use-log-view in a Docker container and attaching to it, the ^P^Q detach command does not work.

server.toml indicates wrong parameter

After several trying to type node.exe, at least but not last i can connected again to server, i just tried the new update.bat script,,

but the problem is not in there, cause when i try again to do manual connection, the server.toml issue is back, it's prevent us to get connected to the server, after several try, at least but not last, i can get connected again,,

and then now is the time for trying to run the additional node, and it comes again, this time i just copied (just 1 file), server.toml from cmd/node/config, and overwritten it who were in Node1/config/ directory, and it did it, solve the issue by now, but i think it'll still remain, unless some one has fixing it,,

i found this issue on windows, i use windows 7 sp 1, didn't know how it goes on another os

Thank you for reading my report

multiversx / mx-chain-go Goto Github PK

mx-chain-go's People

Contributors

Stargazers

Watchers

Forkers

mx-chain-go's Issues

Bugs

Enhancements

Recommend Projects

Recommend Topics

Recommend Org