Code Monkey home page Code Monkey logo

shakespearetotrumpnlp's Introduction

ShakespeareToTrumpNLP

Files

The main results are in src/model_hw2/data/t_to_s/trump_fake_2

The model that produced these results are src/model_hw2/saved_models/model_TS_2

.
├── data
│    ├── shakespeare  
│    │   ├── alllines.txt.txt                         # All Shakespeare Lines 
│    │   ├── clean_modern.txt                         # Modern Shakespeare lines before tokenizing
│    │   ├── clean_original.txt                       # Original Shakespeare lines before tokenizing
│    │   └── processed
│    │        ├── all_proc.txt                        # All Lines after cleaning and tokenizing
│    │        ├── modern_proc.txt                     # Modern Shakespeare lines after tokenizing
│    │        └── original_proc.txt                   # Original Shakespeare lines after tokenizing
│    └── trump
│    │        ├── speeches
│    │        │    ├── cln_speech.txt
│    │        │    └── speech.txt
│    │        └── tweets 
│    │             ├── cln_tweets.txt 
│    │             ├── tweets.json.tx
│    │             └── tweets_01-08-2021.csv.txt
└──src                    
     ├── baseline 
     │    └── baseline.py                              # Creates baseline model
     ├── cleaning   
     │    ├── shake_cleaning.py                        # Cleans and tokenizes Shakespeare lines
     │    └── trump_cleaning.py                        # Cleans and tokenizes Trump tweets and speeches
     └── model_hw2
         ├── data
         │    ├── s_to_t
         │    │    ├── shake_fake_1                     # Pseudo Shakespeare data produced by IBT
         │    │    ├── shake_fake_2                     # Pseudo Shakespeare data produced by IBT
         │    │    └── shake_fake_3                     # Pseudo Shakespeare data produced by IBT
         │    ├── t_to_s
         │    │    ├── trump_fake_1                     # Pseudo Shakespeare data produced by IBT
         │    │    ├── trump_fake_2                     # Pseudo Shakespeare data produced by IBT
         │    │    └── trump_fake_3                     # Psuedo Trump data produced by IBT 
         │    ├── temp_combine.txt                      # Temporary total combined data of (source, target) (100%)
         │    ├── temp_dev.txt                          # Temporary combined dev data of (source, target) for training (80%)
         │    ├── temp_train.txt                        # Temporary total combined data of (source, target) (20%)
         │    └── truth
         │        ├── all_proc.txt                     # Copied from data/shakespeare/processed/
         │        ├── modern_proc.txt                  # Copied from data/shakespeare/processed/
         │        ├── original_proc.txt                # Copied from data/shakespeare/processed/
         │        └── trump.txt                        # Combined Trump tweets and speeches
         ├── driver.py                                 # Main file to run IBT process
         ├── layers.py                                 # HW2 Layers for NMT
         ├── output.txt                                # Trainer output for some of the epochs
         ├── transformer.py                            # Modified HW2 Transformer for GPU
         └── saved_models   
             ├── model_ST_1                            # IBT Process Shakespeare to Trump Model --> first
             ├── model_ST_2                            # IBT Process Shakespeare to Trump Model --> best
             ├── model_ST_3                            # IBT Process Shakespeare to Trump Model --> diverge
             ├── model_TS_1                            # IBT Process Trump to Shakespeare Model --> seed
             ├── model_TS_2                            # IBT Process Trump to Shakespeare Model --> best
             └── model_TS_3                            # IBT Process Trump to Shakespeare Model --> seed

Instructions

To start the IBT process run: python3 src/model_hw2/driver.py

shakespearetotrumpnlp's People

Contributors

billyporter avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.