markpkcollier / neuralturingmachine Goto Github PK

View Code? Open in Web Editor NEW

605.0 605.0 93.0 416 KB

Tensorflow implementation of a Neural Turing Machine

License: GNU Lesser General Public License v3.0

Python 100.00%

neuralturingmachine's People

Contributors

Stargazers

Watchers

Forkers

ml-lab codeaudit tanduong heruix mannykayy devhunteryz cclauss youngcraft hbcbh1999 joeran qihongl zxsted csky6688 tonyle9 zbhatti renly tony32769 chenxingqiang luckystar1992 shadowkun fanyuzeng buaazb dreadlord1984 xy1234552 deepomics-ai mutewall grandsmile severusvinegar lyfadvance ritabratamaiti emrys-hong zhongyunuestc graxel cl91 rikirolly ndrwmlnk hsqzzpf reneraab oarriaga tarsbase ricestudygroup julianpistorius galfaroi nkalavros alexmaehon geektoni pkulzb sigmaquan aminfallahi jbahire tonydeep jhicks-camgian habibzadeh gusuperstar yuq-1s ofirshm fbitct e9h shenghaolin jbdatascience dualwieldingrogue triper1022 pratyushdeka zixzeus leos900075 xrosliang hevince ziegenbalg alanhome qinyexie bmelaiths timcanby vatthaphon global-localhost global19 global19-atlassian-net kfuu freakeinstein sher12138 mainka1f jonnyhuanglzu 5l1v3r1 zyq001 cyrilinshell jmw150 saurabhbaghel dofolk shubaozhang hellowodex jozef-javorsky-dodo theranjitraut cnm13ryan

neuralturingmachine's Issues

Will each data in the batch have a different M eventually?

Will each data in the batch have a different M eventually?

Will M change during prediction?

I look forward to your answer.

Thank you

replicating results

Hi!

I have been trying to replicate the copy-task results from your paper; however, after running python3 run_tasks.py --experiment_name=trial01 the copy task does not seem to converge.

I obtained an error of 31.89375 per sequence and it gave the following results:

I ran it now multiple times but I always got similar results (I believe that you ran the experiments 10 times and took the mean per plot).

I checked the defaults parameters but everything seems to be in accordance to your paper.

Is there something that I am missing?

By the way, great job with the project :-)!

Different type of memory for NTMs

Maybe you could use locality sensitive hash (LSH) associative memory for NTMs?
http://gamespace.eu5.org/associativememory/index.html

recurrence on all state items?

NeuralTuringMachine/ntm.py

Lines 106 to 108 in 5644e3b

    
           return NTM_output, NTMControllerState( 
        
               controller_state=controller_state, read_vector_list=read_vector_list, w_list=w_list, M=M)

It looks there is a recurrence happening on all elements in the state. Wouldn't you need
to do a tf.stop_gradient() on each of these items (except controller_state) to be consistent with the paper? At the very least there should be no recurrence on the memory M, right?

''tuple unpacking is not supported in Python3'' error

Hi Dear @MarkPKCollier

I wanted to run the code using python 3.5 on PyCharm, an error occured in file 'Ncross_analysis.py'

''''
C:\Users\Eduline\PycharmProjects\python\venv\Scripts\python.exe C:/Users/Eduline/Desktop/GEOMETRİ1/Ncross_analysis.py
File "", line 15
def getLine_pixelPoint((x1, y1), (x2, y2)):
^
SyntaxError: invalid syntax
''''
and it seems like the error is because 'tuple unpacking is not supported in Python3'.

Could you please help me to solve this problem?
How can I change the written code (in the photo), so that it will not be problematic ?

Have a good day

Error 'tuple unpacking is not supported in Python3'

Hi @MarkPKCollier

I wanted to run the code using python 3.6, an error occurs in file 'generate_data.py', Line: 263

File "D:\...\NeuralTuringMachine-master\generate_data.py", line 263
    outputs = map(lambda (source, dest, edge):
                         ^
SyntaxError: invalid syntax

and it seems that the error is because 'tuple unpacking is not supported in Python3'.

What can I do? switch to Python 2?

Thank you.

what's the Prerequisites?

empty

Sorting Task

Hi @MarkPKCollier

Thanks for your awesome implementation and making the code available in such a well packaged form!

You mentioned in the paper that you tried 3 of the 5 experiments. I was wondering whether you were also able to reproduce the other 2 experiments (in particular the sorting experiment), and whether you can share some of your thoughts on those as well.

Thanks

Could you add some examples?

I am really struggling to understand how to integrate your code in my tf/keras projects, could you provide some basic examples showing how to compile and train a network?

Implementing details of copy and repeated copy tasks

Say we are considering the copy task and we want to give a sequence of length 10 to our model. So, following the code, we actually give our controller a sequence of length 21, where the 11th element is an end of sequence marker (a vector of 1s) and the last 10 elements are vectors of 0s. Where does the code specify that the output should start at the end of sequence marker?

Now, for the repeated copy task, it doesn't seem like we should provide the target output size the same way we do in the copy task. For example, given a sequence of length 10 that is to be copied 2 times, it wouldn't make sense to give as input a sequence of 31 vectors since we want the model to learn when to stop copying. So how should the input be structured in this case?

Problem on associative recall

Using ntm as the default setting works fine on copy task, but it doesn't work on associative recall task.

I run for more than 20000 training steps with 5 hours and 1 GPU, and the sequence error doesn't drop.

Does associative recall task needs another setting?

where I can find the formula that generate head parameters

sorry for the silly question. I am an engineer, not a researcher(I don't have a strong backgroud on theory and math).
I try to apply this external memory idea on my work. I read the original NTM paper: 1410.5401 and your paper, I can't find the formula that generate these head parameters:
k, beta, g, s gamma.

Do I miss something or there are some "common knowledge" for this kind of parameter?

	return NTM_output, NTMControllerState(
	controller_state=controller_state, read_vector_list=read_vector_list, w_list=w_list, M=M)