chengstone / cchess-zero Goto Github PK

View Code? Open in Web Editor NEW

464.0 18.0 165.0 3.28 MB

AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目，实现**象棋。

License: MIT License

Python 60.24% Jupyter Notebook 39.76%

cchess-zero's Introduction

cchess-zero

AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目，实现**象棋。

Author chengstone

e-Mail [email protected]

代码详解请参见文内jupyter notebook和↓↓↓

知乎专栏：https://zhuanlan.zhihu.com/p/34433581

博客：http://blog.csdn.net/chengcheng1394/article/details/79526474

欢迎转发扩散 ^_^

这是一个AlphaZero的实践项目，实现了一个**象棋程序，使用TensorFlow1.0和Python 3.5开发，还要安装uvloop。

因为我的模型训练的不充分，只训练了不到4K次，模型刚刚学会用象和士防守，总之仍然下棋很烂。

如果您有条件可以再多训练试试，我自从收到信用卡扣款400美元通知以后就把aws下线了：D 贫穷限制了我的想象力O(∩_∩)O

我训练的模型文件下载地址：https://pan.baidu.com/s/1dLvxFFpeWZK-aZ2Koewrvg

解压后放到项目根目录下即可，文件夹名叫做gpu_models

现在介绍下命令如何使用：

命令分为两类，一类是训练，一类是下棋。

训练专用：

--mode 指定是训练（train）还是下棋（play），默认是训练
--train_playout 指定MCTS的模拟次数，论文中是1600，我做训练时使用1200
--batch_size 指定训练数据达到多少时开始训练，默认512
--search_threads 指定执行MCTS时的线程个数，默认16
--processor 指定是使用cpu还是gpu，默认是cpu
--num_gpus 指定gpu的个数，默认是1
--res_block_nums 指定残差块的层数，论文中是19或39层，我默认是7

下棋专用：

--ai_count 指定ai的个数，1是人机对战，2是看两个ai下棋
--ai_function 指定ai的下棋方法，是思考（mcts，会慢），还是直觉（net，下棋快）
--play_playout 指定ai进行MCTS的模拟次数
--delay和--end_delay默认就好，两个ai下棋太快，就不知道俩ai怎么下的了：）
--human_color 指定人类棋手的颜色，w是先手，b是后手

训练命令举例：

python main.py --mode train --train_playout 1200 --batch_size 512 --search_threads 16 --processor gpu --num_gpus 2 --res_block_nums 7

下棋命令举例：

python main.py --mode play --ai_count 1 --ai_function mcts --play_playout 1200 --human_color w

许可

Licensed under the MIT License with the 996ICU License.

cchess-zero's People

Contributors

Stargazers

Watchers

Forkers

liviclee xiaoanshi femto yhcharles aspire130 shbfeng zhuzhenping zhengzhengzhj telesoho sduxzh yutaoxxx liucunyi1893 zqx15920 liujh168 jedz5 guoshengxu virtualpeer wplo muyili addingding mrhaozi soongxueyong lianpx dreamscape9999 jackylee1 schabc shubhampachori12110095 eugeneyilia jingruilea lostevangel dhuer jeffli678 1715509415 kkshsh eaglezzb feifeiloveu roughsoft yeahlouis ktpdpro jfcai2008 oy520 adkevinchang x12311231 johnparkerlee kelvict right0808 thurday yumjelly rie-long ianfan jamp78 lucianzhong wy-mazon gjlper ma-chengcheng xiaogaogaoxiao argog vizxu cccxy5 linzmin1927 tomas89wang willsmith2016 yclxf z475546259 xaoyaoo magic7 sherdencooper ylzf0000 xiecunjing zbw7415604 teaandzebra phenylazide dshwei xiniuniu clickme8 njustlcx cysun0226 zhuhaipeng7 chensul yaojh01 wsccurry wenhuacao xiaobin0860 citymap yueshenghou lingxuehan riseinrose michaelnix guoyi0 skyseezhang321 sonivre limeiyang dxxhjk sy15120 maxwellchiang zhengyajun0620 wwrssg jiankehtt stevetsin liuansheng0312

cchess-zero's Issues

如果我想自己打造一个自己的alphazero**象棋，该怎么入手。

怎麼辦

kenny@wade-X750JB:~/Desktop/cchess-zero$ python2 main.py
File "main.py", line 261
def Q(self, move) -> float:
^
SyntaxError: invalid syntax

帮助翻译成英文，以便机器学习的非中文爱好者阅读

Help translate into English for non-Chinese enthusiasts of machine learning to read

[X] - add README.en.md for English
[X] - translate cchess-zero.ipynb
[X] - translate Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm.ipynb

I did not remove the Chinese text in case you want to merge. It is up to you if you want to merge but I will leave this here.
如果你想合并，我没有删除中文文本。如果你想合并取决于你，但我会把它留在这里。

绘图时出错

运行下棋指令时，报错：_tkinter.TclError: no display name and no $DISPLAY environment variable
请问您有遇到过吗？

cpu和gpu用不满改哪里

我在我的5900x+rtx4090上跑起来了，可是cpu和gpu利用率都很低。如何增加cpu并发核心数量以及gpu利用率，加快下棋速度。
我的是ubuntu22.04版本。

系统是ubuntu18.04
以下是错误讯息：
(venv) kenny@wade-X750JB:~/Desktop/cchess-zero$ python main.py --mode play --ai_count 2 --ai_function mcts --play_playout 1200
Loading Models...
2018-08-29 15:55:02.524177: I tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
WARNING:tensorflow:From /home/kenny/Desktop/cchess-zero/policy_value_network.py:78: softmax_cross_entropy_with_logits (from tensorflow.python.ops.nn_ops) is deprecated and will be removed in a future version.
Instructions for updating:

Future major versions of TensorFlow will allow gradients to flow
into the labels input on backprop by default.

See @{tf.nn.softmax_cross_entropy_with_logits_v2}.

Traceback (most recent call last):
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1278, in _do_call
return fn(*args)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1263, in _run_fn
options, feed_dict, fetch_list, target_list, run_metadata)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1350, in _call_tf_sessionrun
run_metadata)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Assign requires shapes of both tensors to match. lhs shape= [1] rhs shape= [128]
[[Node: save/Assign_66 = Assign[T=DT_FLOAT, _class=["loc:@conv2d_16/bias"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:CPU:0"](conv2d_16/bias, save/RestoreV2:66)]]

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/training/saver.py", line 1725, in restore
{self.saver_def.filename_tensor_name: save_path})
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 877, in run
run_metadata_ptr)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1100, in _run
feed_dict_tensor, options, run_metadata)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1272, in _do_run
run_metadata)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1291, in _do_call
raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Assign requires shapes of both tensors to match. lhs shape= [1] rhs shape= [128]
[[Node: save/Assign_66 = Assign[T=DT_FLOAT, _class=["loc:@conv2d_16/bias"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:CPU:0"](conv2d_16/bias, save/RestoreV2:66)]]

Caused by op 'save/Assign_66', defined at:
File "main.py", line 1583, in
args.search_threads, args.processor, args.num_gpus, args.res_block_nums, args.human_color) # * args.num_gpus
File "/home/kenny/Desktop/cchess-zero/ChessGame.py", line 50, in init
processor=processor, num_gpus=num_gpus, res_block_nums=res_block_nums, human_color=human_color)
File "/home/kenny/Desktop/cchess-zero/main.py", line 1142, in init
self.policy_value_netowrk = policy_value_network(res_block_nums) if processor == 'cpu' else policy_value_network_gpus(num_gpus, res_block_nums)
File "/home/kenny/Desktop/cchess-zero/policy_value_network.py", line 148, in init
self.saver = tf.train.Saver()
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/training/saver.py", line 1281, in init
self.build()
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/training/saver.py", line 1293, in build
self._build(self._filename, build_save=True, build_restore=True)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/training/saver.py", line 1330, in _build
build_save=build_save, build_restore=build_restore)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/training/saver.py", line 778, in _build_internal
restore_sequentially, reshape)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/training/saver.py", line 419, in _AddRestoreOps
assign_ops.append(saveable.restore(saveable_tensors, shapes))
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/training/saver.py", line 112, in restore
self.op.get_shape().is_fully_defined())
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/ops/state_ops.py", line 216, in assign
validate_shape=validate_shape)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/ops/gen_state_ops.py", line 60, in assign
use_locking=use_locking, name=name)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/framework/op_def_library.py", line 787, in _apply_op_helper
op_def=op_def)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/util/deprecation.py", line 454, in new_func
return func(*args, **kwargs)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 3155, in create_op
op_def=op_def)
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/framework/ops.py", line 1717, in init
self._traceback = tf_stack.extract_stack()

InvalidArgumentError (see above for traceback): Assign requires shapes of both tensors to match. lhs shape= [1] rhs shape= [128]
[[Node: save/Assign_66 = Assign[T=DT_FLOAT, _class=["loc:@conv2d_16/bias"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:CPU:0"](conv2d_16/bias, save/RestoreV2:66)]]

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "main.py", line 1583, in
args.search_threads, args.processor, args.num_gpus, args.res_block_nums, args.human_color) # * args.num_gpus
File "/home/kenny/Desktop/cchess-zero/ChessGame.py", line 50, in init
processor=processor, num_gpus=num_gpus, res_block_nums=res_block_nums, human_color=human_color)
File "/home/kenny/Desktop/cchess-zero/main.py", line 1142, in init
self.policy_value_netowrk = policy_value_network(res_block_nums) if processor == 'cpu' else policy_value_network_gpus(num_gpus, res_block_nums)
File "/home/kenny/Desktop/cchess-zero/policy_value_network.py", line 149, in init
self.train_restore()
File "/home/kenny/Desktop/cchess-zero/policy_value_network.py", line 170, in train_restore
self.saver.restore(self.sess, tf.train.latest_checkpoint(self.save_dir))
File "/home/kenny/tensorflow/venv/lib/python3.6/site-packages/tensorflow/python/training/saver.py", line 1759, in restore
err, "a mismatch between the current graph and the graph")
tensorflow.python.framework.errors_impl.InvalidArgumentError: Restoring from checkpoint failed. This is most likely due to a mismatch between the current graph and the graph from the checkpoint. Please ensure that you have not altered the graph expected based on the checkpoint. Original error:

Assign requires shapes of both tensors to match. lhs shape= [1] rhs shape= [128]
[[Node: save/Assign_66 = Assign[T=DT_FLOAT, _class=["loc:@conv2d_16/bias"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:CPU:0"](conv2d_16/bias, save/RestoreV2:66)]]

InvalidArgumentError (see above for traceback): Restoring from checkpoint failed. This is most likely due to a mismatch between the current graph and the graph from the checkpoint. Please ensure that you have not altered the graph expected based on the checkpoint. Original error:

(venv) kenny@wade-X750JB:~/Desktop/cchess-zero$

little problem

您好，感谢您的分享现在想问您关于ma.py的程序中
if (x if abs(dx) ==1 else x+dx/2, y if abs(dy) ==1 else y+ (dy/2)) in board.pieces:
#print 'blocked'
return False
这块是什么意思呢

你好，我restore您百度网盘的权重时报错

NotFoundError (see above for traceback): Key BatchNorm/moving_mean not found in checkpoint
[[Node: save/RestoreV2 = RestoreV2[dtypes=[DT_FLOAT, DT_FLOAT, DT_FLOAT, DT_FLOAT, DT_FLOAT, ..., DT_FLOAT, DT_FLOAT, DT_FLOAT, DT_FLOAT, DT_INT32], _device="/job:localhost/replica:0/task:0/device:CPU:0"](_arg_save/Const_0_0, save/RestoreV2/tensor_names, save/RestoreV2/shape_and_slices)]]

能标注一下tensorflow的版本吗？？

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.