Code Monkey home page Code Monkey logo

Hi there πŸ‘‹

I am Jude Capachietti. I am a creative data scientist and design thinker with experience in python, statistical analysis, machine learning, and time series analysis. With a background in math education and non-profit management, I use data storytelling to help make numbers, data, and abstract ideas make sense to people and organizations so that they can make informed data-driven decisions.

Highlighted Projects

I built this webapp using Flask as a webapp server; python, numpy, pytorch for machine learning; and as a way to showcase my programming abilities. I have a text analysis classifier which shows metrics on the tone of text. These are whether the tone sounds positive or negative, whether the gender of the author sounds male, or female, and the age it sounds like the author is. I built the classiers using both Deep Averaging Neural Networks and using a Random Forest, and trained them on the Blog Authorship Corpus. I am happy with the way this classifier works, but I believe it tends to be overconfident. In the future, I plan to expand the text data I use to train these classifiers and also not to overfit the classifiers to the dataset as well. I believe they currently may be a little overfit to these specific text examples. Additionally, I have a name gender classifier. It is under portfolio, and "Name Gender Classifier." It estimates the gender of an entered name. I built this classifier using a bayesian predictive model and numpy. I have it running in python backend of my webpage. In my Jupyter Notebook it had 80% accuracy on my test set, which is 30% better than a 50/50 guess. I would love any feedback on this project. It has many parts, and I have designed the architecture and implemented it all myself. You can see the project in its current state down below!

Check the actual webpage and ML webapps here: My personal webpage

==============================================================================================

I built this webapp using a basic LAMP stack, Linux, Apache, MySQL, and PHP. I built with data vizualation and time saving in mind. I wanted to make it easy for my friends to read the surf forecasts, so I created a place where all the information they need to make a decision is in one place, and the ideal times to surf are colored in green (or yellow if the surf is really good). The goal was to minimized clicking and maximize checking speed of many different locations. This was acoomplished by having many graphs of various spots around New England in one page with color coding by surf quality. These spot names can then be clicked on for more details, and even wind maps. Godaddy says this domain name is worth about $700, so it does have some users still. Let me know if you are interested in buying it XD.

Check the site here: Surf forecasting site

==============================================================================================

This is a project I did as a Machine Learning Student at Cornell where I had to implement a neural network from scratch using NumPy. I learned the mechanics of the Relu activation function and how that leads to 'kinks' or 'hinges' in the linear approximatin of the data. I know there are other activations funvtions as well such as the sigmoid function too.

==============================================================================================

I created a google chrome extension that analyzes text in a similar manner to the text analysis classifier on my webpage. It tells the user what highlighted text sounds like in terms of the tone, gender and age of the author. I use the same RESTful API endpoint that I created for my personal webpage to do the analysis for this extension. I simply send over the highlighted text to the endpoint, then parse and display the response. I submitted this extension to the Google Chrome Webstore, and I hope they approve it so people can use this!

==============================================================================================

This notebook is a place to try out topic modeling and classification. I trained Word Embeddings with Word2Vec and used an ensemble of Random Forests Classifiers to classify the data. The reason it was necessary to use several Random Forest Classifiers is because the data set has multi label data, that samples may have more than one label. The dataset I am using is of research paper titles and abstracts and they fall into one or more of 5 categories: 'Computer Science', 'Physics', 'Mathematics', 'Statistics', 'Quantitative Biology', or 'Quantitative Finance'. Interestingly, the title seems to be a better indicator of the category of paper than the abstract. The data set can be found here.

==============================================================================================

  • πŸ”­ I’m currently working on fine tuning my text analysis classifiers.
  • 🌱 I’m currently learning devops and networking things!
  • πŸ‘― I’m looking to collaborate on a machine learning project
  • πŸ€” I’m looking for help with devops and networking things!
  • πŸ˜„ Pronouns: He/Him/His
  • ⚑ Fun fact: I love surfing!

Jude Capachietti's Projects

adventofcode icon adventofcode

This is my code for doing the https://adventofcode.com/ 2021 challenge.

blackjack_simulation icon blackjack_simulation

This repository is for code around a blackjack simulation project I am creating. I want to measure the expected returns on varying blackjack strategies.

deep-averaging-neural-network icon deep-averaging-neural-network

This is the last project I completed for my course in Neural Networks. It creates a deep averaging Neural Network for sentiment analsys.

js_ocr_digits icon js_ocr_digits

this is the project folder for creating an OCR digit reader in JS... Converting from python

massachusetts-surf icon massachusetts-surf

This website is going to display all the information I need to figure out where & when to surf (around Massachusetts) without clicking

mediapipes-hand-exploration icon mediapipes-hand-exploration

This is an example of using Googles mediapipe framework to detect hands in images and video. This mapping is then used as the starting point for ML models. In other words, it is the encoding process for further use.

multilayer-perceptron icon multilayer-perceptron

This was a project I did as a Machine Learning Student at Cornell where I had to implement a neural network from scratch using NumPy. The MLP jupyter notebook file is where my code can be seen.

nlp-age-gender-classifier icon nlp-age-gender-classifier

This is a classifier trained the Blog Authorship Corpus, a 2004 dataset from blogger including many entire blogs. It's found here https://u.cs.biu.ac.il/~koppel/BlogCorpus.htm I am using bag of words vectorization to convert the blogs into a form machine learning algorithms can work on. I am using an ensemble CART Tree approach at the moment although I may change that later.

posenet-react-native icon posenet-react-native

This is a react native app with a posenet model running on the camera image. It currently outputs the predictions to the console. When running an app it will display the overall confidence in the model at the bottom of the screen.

python-functions icon python-functions

This repo as of now mainly consists of a function `find_table_names_from_sql_file()` that will output the names of tables in an .sql file, assuming there are no syntax errors in that file. If .sql file is empty, it will return an empty list.

pythonbackendmodelapi icon pythonbackendmodelapi

This is a back end for a web app using flask that will make it easy to run AI models on a webpage

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.