Describe the bug I have installed EvalNE, OpenNE library, PRUNE a

Hi Alex, Your answer really helps! I will try running the

[BUG] 1. TypeError: 'Results' object is not iterable. 2. TypeError: a bytes-like object is required, not 'str' about evalne HOT 7 CLOSED

dru-mara commented on July 21, 2024

[BUG] 1. TypeError: 'Results' object is not iterable. 2. TypeError: a bytes-like object is required, not 'str'

from evalne.

Comments (7)

Dru-Mara commented on July 21, 2024

Hi Xikun,

First of all, thank you for the detailed bug report, It's very easy to follow. Regarding the errors/warnings you are encountering:

Error 1: This is indeed a bug in the example, we made some changes to the library and missed to update this example accordingly, sorry about that. I'll push a fix for it in a few minutes. Your solution of removing that for loop is indeed correct :)

Error 2: This is also a bug and your solution should work fine for py3, however, I'm not entirely sure it will on py2. I'll need a bit more time to look into it.

Error 3: Although it says Error, this should simply be a warning. It basically tells you that you have not selected a train/validation split, so the library will compute one for you. This train/validation will have a fixed 90/10 split and otherwise the same parameters as your train/test split. The train/validation split is necessary in order to tune the hyperparameters of node2vec (as you can see tune_params is set to tuning p and q). In the next update, I will include an explicit train/validation edge split so the "error" disappears. I'll also make it a warning.

Warning 4: The warnings (which should be made a bit more clear) basically tell you the following:

WARNING:root:Output of method metapath2vec++ contains 2 more lines than expected. Will consider them part of the header and ignore them... Expected num_lines 703, obtained lines 705.

In this case, metapath2vec returned an embedding file with 705 lines. Out of those 703 lines were identified as embedding vectors corresponding to graph nodes (your graph after being preprocessed contained 703 nodes). The remaining two lines in the file were considered to be header lines and thus ignored by EvalNE. Most methods return header lines in the output files, so you will see that warning a lot. Finally, metapath indeed returns two header lines.

WARNING:root:Output provided by method metapath2vec++ contains 129 columns, 128 expected! Taking first column as nodeID...

This warning tells you that the output embedding file of metapath contained one more column than expected. In this case, you asked for 128-dimensional embeddings but EvalNE found 129 columns in the file. The library will automatically take the first of those columns as the nodeID and the remaining ones as the actual embeddings of nodes. The reason for having this warning is that there are two types of behaviours for NE methods, they either return the embeddings as:

NodeID0, x00, x01, ... x0d
NodeID1, x10, x11, ... x1d
...
NodeIDn, xn0, xn1, ... xnd

x01, x01, ... x0d
x11, x11, ... x1d
...
xn1, xn1, ... xnd

The warning basically tells you that metapath is a method that returns the data as in the first example and not as in the second one.

Finally, the results you are getting seem correct to me. I hope this helps, and thanks again for pointing out those bugs!

Alex

from evalne.

XikunHuang commented on July 21, 2024

Hi, Alex
Thanks for your quick and detailed reply. It really helps!

I encounter a new error when I run examples/node2vec/conf_node2vec.ini
Describe the bug
FileNotFoundError: [Errno 2] No such file or directory: './emb.tmp'
OSError: Execution of method node2vec did not generate node embeddings file.
Possible reasons: 1) method is not correctly installed or 2) wrong method call or parameters...

To Reproduce

OS used: Ubuntu18.0.4
EvalNE Version : 0.3.1
Snippet of code executed (for API) or conf file run (for CLI)
In file examples/node2vec/conf_node2vec.ini
- I have set thee correct dataset paths and method paths
- I replace EDGE_EMBEDDING_METHODS = average with EDGE_EMBEDDING_METHODS = hadamard
  Then run:
```
python3 evalne ./examples/node2vec/conf_node2vec.ini 
```
Full error output
This error is weird. Because "Repetition 0 of experiment" and "Repetition 1 of experiment" are OK. This error occurs during "Repetition 2 of experiment".

#################### Error message in file eval.log#############################

10-12-19 09:51:20 - INFO: ------ Repetition 2 of experiment ------
10-12-19 09:53:21 - WARNING: Output of method node2vec contains 1 more lines than expected. Will consider them part of the header and ignore them... Expected num_lines 4039, obtained lines 4040.
----------------------- many similar WARNING-----------------------------
10-12-19 10:00:55 - WARNING: Output provided by method node2vec contains 129 columns, 128 expected! Taking first column as nodeID...
10-12-19 10:01:00 - FileNotFoundError: [Errno 2] No such file or directory: './emb.tmp'
Traceback (most recent call last):
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/evaluation/evaluator.py", line 457, in _evaluate_ne_cmd
X = pp.read_node_embeddings(tmpemb, data_split.TG.nodes, self.dim, output_delim, method_name)
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/utils/preprocess.py", line 174, in read_node_embeddings
emb_skiprows = infer_header(input_path, len(nodes), method_name)
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/utils/preprocess.py", line 123, in infer_header
num_lines = sum(1 for _ in open(input_path))
FileNotFoundError: [Errno 2] No such file or directory: './emb.tmp'

During handling of the above exception, another exception occurred:

10-12-19 10:09:29 - WARNING: Output provided by method node2vec contains 129 columns, 128 expected! Taking first column as nodeID...
10-12-19 10:10:22 - ERROR: Exception occurred while evaluating param --p 2 --q 1 for method node2vec on Facebook.
Traceback (most recent call last):
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/evaluation/evaluator.py", line 457, in _evaluate_ne_cmd
X = pp.read_node_embeddings(tmpemb, data_split.TG.nodes, self.dim, output_delim, method_name)
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/utils/preprocess.py", line 174, in read_node_embeddings
emb_skiprows = infer_header(input_path, len(nodes), method_name)
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/utils/preprocess.py", line 123, in infer_header
num_lines = sum(1 for _ in open(input_path))
FileNotFoundError: [Errno 2] No such file or directory: './emb.tmp'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/evaluation/evaluator.py", line 313, in evaluate_cmd
write_weights, write_dir, verbose)
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/evaluation/evaluator.py", line 468, in _evaluate_ne_cmd
'\nSetting verbose=True can provide more information.'.format(method_name))
OSError: Execution of method node2vec did not generate node embeddings file.
Possible reasons: 1) method is not correctly installed or 2) wrong method call or parameters...
Setting verbose=True can provide more information.
10-12-19 10:11:12 - WARNING: Output of method node2vec contains 1 more lines than expected. Will consider them part of the header and ignore them... Expected num_lines 4039, obtained lines 4040.
----------------------- many similar WARNING-----------------------------
10-12-19 10:22:33 - WARNING: Output provided by method line contains 129 columns, 128 expected! Taking first column as nodeID...
10-12-19 10:22:37 - INFO: ====== Evaluating PPI network ======
10-12-19 10:22:38 - INFO: ------ Repetition 0 of experiment ------
10-12-19 10:24:26 - WARNING: Output of method node2vec contains 188 more lines than expected. Will consider them part of the header and ignore them... Expected num_lines 3852, obtained lines 4040.
10-12-19 10:24:26 - WARNING: Output provided by method node2vec contains 129 columns, 128 expected! Taking first column as nodeID...

#######################Error message in terminal########################
python3 ./node2vec_python3/src/main.py --input ./edgelist.tmp --output ./emb.tmp --dimensions 128 --walk-length 80 --num-walks 10 --window-size 10 --workers 8 --p 1 --q 1 --p 0.25 --q 0.25
Walk iteration:
1 / 10
2 / 10
3 / 10
4 / 10
5 / 10
6 / 10
7 / 10
8 / 10
9 / 10
10 / 10
Traceback (most recent call last):
File "/usr/lib/python3.6/runpy.py", line 193, in _run_module_as_main
"main", mod_spec)
File "/usr/lib/python3.6/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "evalne/main.py", line 329, in
main()
File "evalne/main.py", line 42, in main
evaluate(setup)
File "evalne/main.py", line 148, in evaluate
lp_coef = eval_other(setup, nee, i, scoresheet, repeat, nw_outpath)
File "evalne/main.py", line 266, in eval_other
write_dir=setup.write_dir_other[j], verbose=setup.verbose)
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/evaluation/evaluator.py", line 313, in evaluate_cmd
write_weights, write_dir, verbose)
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/evaluation/evaluator.py", line 462, in _evaluate_ne_cmd
results.append(self.evaluate_ne(data_split=data_split, X=X, method=method_name, edge_embed_method=ee))
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/evaluation/evaluator.py", line 641, in evaluate_ne
tr_edge_embeds, te_edge_embeds = self.compute_ee(data_split, X, edge_embed_method)
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/evaluation/evaluator.py", line 677, in compute_ee
tr_edge_embeds = func(X, data_split.train_edges)
File "/home/huangxk/workspace_python/embedding/EvalNE/venv_for_evlne/lib/python3.6/site-packages/evalne-0.3.1-py3.6.egg/evalne/evaluation/edge_embeddings.py", line 59, in hadamard
edge_embeds[i] = X[str(edge[0])] * X[str(edge[1])]
KeyError: '1413'
Progress on lp task: 33%|███████████████████████████████████████████████▎ | 3/9 [1:17:39<2:35:18, 1553.13s/it]

By the way

Before testing "hadamard" method, I have tested "average" method twice using conf_node2vec.ini. It is weird that I encounter the above errors in the first time, and succeed in the second time (I do nothing).

Thanks again for your time.
Xikun

from evalne.

Dru-Mara commented on July 21, 2024

Hi,

I can not replicate the error, could you please try running the evaluation as shown below, and let me know if it solves the issue?:
python3 -m evalne ./examples/node2vec/conf_node2vec.ini

I found that without the -m parameter some strange error might occur. Also, have you run multiple evaluations in the same directory? If so, that could explain the error you are getting. Or is it possible that the emb.tmp file generated by the library got somehow deleted?

Edit: Going over the log again I noticed a few things:

the error is not related to EvalNE, but to Node2vec
It seems that node2vec is not generating the correct embeddings file 'emb.tmp' and thus the library is not able to find it and read it
I noticed that you are running node2vec using python3. The original node2vec repo only mentions python2 so it is possible that the code is unstable when executed with python3 (especially the gensim library part). This might cause some executions to run successfully and others to fail. I would recommend creating a python2 virtualenv for node2vec and installing all the dependencies there, then in the conf file you would just need to specify something like:
python2 ./node2vec_python2/src/main.py .....
Also, a quick explanation of what is going on in the conf file:
The library will try to evaluate these methods:
NAMES_OTHER = node2vec deepWalk line
The command line calls corresponding to each method are in order:
METHODS_OTHER =
python ../../../methods/node2vec/main.py ...
python ../../../methods/node2vec/main.py ... --p 1 --q 1
../../../methods/LINE/linux/line ...
Note that deepwalk is node2vec with p=1 and q=1, so we specify them directly in the second line of the METHODS_OTHER variable.
For the first method listed in NAMES_OTHER, in this case node2vec, we will tune hyperparameters using grid search:
TUNE_PARAMS_OTHER = --p 0.25 0.5 1 2 4 --q 0.25 0.5 1 2 4
If there were a second line in TUNE_PARAMS_OTHER it would be assumed to contain the parameters you want to tune for deepwalk. If there were a third line in the variable it would be assumed to refer to LINE.
Keep in mind that it's perfectly fine to call some methods with python2 and others with python3 in the EvalNE conf files

I've never tried to run the original node2vec code using py3, but I did have similar issues with other methods/libraries. Please, let me know if running the method with py2 solves the issues.
Alex

from evalne.

XikunHuang commented on July 21, 2024

Hi Alex,
Your answer really helps!

I will try running the evaluation with -m parameter and tell you the result (it takes some time).
Yes, I do run multiple evaluations in the same directory when testing "average" method. In directory EvalNE/, I run two evaluations at the same time.
```
python3 evalne ./examples/node2vec/conf_node2vec.ini 
```
If I want to run two evaluations at the same time, I should run the command in different directory, right?
i.e.
```
In directory EvalNE/one_dir/
python3 -m evalne  relative/path/to/conf_node2vec.ini 

In directory EvalNE/another_dir/ 
python3 -m evalne  relative/path/to/conf_node2vec.ini 
```
Yes, I am running node2vec using python3. I have modified original node2vec so that it can work with py3 following this comment(aditya-grover/node2vec#35 (comment))
Keep in mind that it's perfectly fine to call some methods with python2 and others with python3 in the EvalNE conf files
This really helps! I will try to run original node2vec using py2 and tell you the result later.

Thanks.
Xikun

from evalne.

Dru-Mara commented on July 21, 2024

Hello,

Yes, you should run the evaluations from different directories as you mentioned. The library generates some temporal files which are used to communicate information to the methods executed, so, if several evaluations are run in the same folder the different processes will start messing up with each other's temporal files. I will modify this behaviour for the future versions of the library and make it possible to run many evaluations in the same folder.

Alex

from evalne.

XikunHuang commented on July 21, 2024

Hi,

Running the evaluation with -m parameter and py2 node2vec solves the issues.
Thanks for your help.

Xikun

from evalne.

Dru-Mara commented on July 21, 2024

Hi Xikun,

Great to hear! Since the original issue seems to be resolved I'll close it, but please let us know if you have any other issues or suggestions :)

Alex

from evalne.

[BUG] 1. TypeError: 'Results' object is not iterable. 2. TypeError: a bytes-like object is required, not 'str' about evalne HOT 7 CLOSED

Comments (7)

Related Issues (13)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent