Code Monkey home page Code Monkey logo

arcprize's Introduction

Hi there ๐Ÿ‘‹ I'm Moon ๐ŸŒ™

Full-stack engineer, entrepreneur and product designer with a passion for building things that make a difference.

Website / Portfolio

Projects

Simian

  • https://simian.mov
  • Synthetic data generator for image and video generation
  • pip install simian

Bgent

  • https://bgent.org
  • why bgent? because 'agent' was already taken on npm`
  • npm install bgent

Character Studio

The Alliance

Upstreet

Magick

TinyAGI

Autocoder

agentmemory

easycompletion

Groups I Support

arcprize's People

Contributors

lalalune avatar pharaouk avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

arcprize's Issues

Refinement Network

Instead of trying to predict all of the tokens at once, we should predict some, keep the ones with high confident, and try again with a modify attention mask.

Data difficulty thresholding

The data needs to get harder incrementally. It's pointless to train a fresh model on something a 100 IQ person probably couldn't solve. Starting with ultra simple tasks until loss is down, and then introducing more and more complex tasks into the mix. Eventually taking out the tasks which are under the threshold difficulty of the real problem.

Add evals + wandb

We should evaluate every 100 epochs to get a sense of improvement

rope

sinusoidal position encoding has some flaws that rope could solve

WFC synthesis

Wave Function Collapse has a knowable pattern. Discovering this patten from the output image should teach our model the kind of program search we really want.

Add dropout

Right now we are training with no dropout, which is guaranteed to cause some trouble

Add additional data augmentation and visualize

We are kind of blindly just matrix transforming, color swapping and augmenting.

  • Implement visualization of current augmentation to verify that it makes sense
  • Add noise patterns underneath black as an additional augmentation time
  • Add some augmentation that replaces token 0 with one of the colors not used in the puzzle

Teacher Forcing

Review our teacher forcing strategy.

One idea that might be interesting is to set it to 1 - loss. So we force until out loss below 0, then start to back off. By the end the model shouldn't care about sequence order.

data leakage?

src = src.to(device)
output = model(src)
target = src[:, model.num_context_tokens:].reshape(-1)
loss = criterion(output.view(-1, num_tokens + 1), target.view(-1))

Your model looks at the entire source?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.