Code Monkey home page Code Monkey logo

pbvi_verilog's Introduction

PBVI_verilog

homework for DI

实现

实现2状态(Left,Right),3行动(Open-left, Open-right, Listen),16信念点(定点数表示,分别为$1/32,3/32,...,31/32$),决策深度为5

可能的扩展,增加信念点的数目,增加决策深度

分工

首先先对于代码进行实现,按照求解的步骤进行分工,最后一个人来实现最终的整合过程

Step 1

为每个action-observation构造中间体$\alpha_i^{a,o}(s)$ $$ \Gamma^{\alpha,o} \leftarrow \alpha_i^{a,o}(s) = \gamma\sum_{s'\in S}T(s,a,s')\Omega(o,s',a)\alpha_i'(s') $$ 可能的实际实现,8组并行的计算过程,每组对于16个信念点进行两点的乘累加操作

Step 2

对于观测点进行决策模拟,将决策降维至行为级 $$ \begin{aligned} i(b,o) = arg\max_{i}({\alpha^{a,o}i(s_0),\alpha^{a,o}i(s_1)}\cdot b) \end{aligned} $$ $$\Gamma^a_b = {R(s_0,a) + \sum{o\in O}\alpha{i(b,o)}^{a,o}(s_0),R(s_1,a) + \sum_{o\in O}\alpha_{i(b,o)}^{a,o}(s_1)}$$

Step 3

对于已有的$\Gamma^a_b$,选择最合适的行为并得到最后的$\alpha_i(s)$ $$a(b) = arg \max_{a\in A}(\Gamma^a_b\cdot b)$$ $$\alpha_i(s) = \Gamma^{a(b)}_b(i,s)$$

Step 4

根据最终的观测点结果进行模拟的决策过程

整合

对于所有step的内容进行连接,并搭建测试框架

pbvi_verilog's People

Contributors

kinling9 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.