Code Monkey home page Code Monkey logo

Projects I've worked on during my studies (and links to their repositories)

NOTE: The projects are ordered from most recently completed to oldest.
DISCLOSURE: My Data Science course puts heavy focus on groupwork, therefore most of these projects are collaborative where I have worked on them in varying degrees. Only the data visualization is fully mine.

👽 Bachelor Project - Measuring polarization in Subreddits

For my bachelor project I worked with two other students to measure how polarized subreddits are. We used a Python library (PRAW) to scrape user data within subreddits, then we created networks of the users based on which users replied to other users posts. We used pre-trained word embeddings to characterize the users based on their posts, and a pre-trained sentiment classifier to measure the sentiments of comments made to posts. Using the network structure, these interactions, and some clever algorithms, we measured whether users with differing characteristics interacted more positively or negatively than users with similar characteristics. In this way, we found that some subreddits were more polarized than others, and that some were much more similar than we initially thought. The graph below shows our main results. The further right, and the further away from the y-axis 0, the more polarized the subreddits are.

Bachelor Project Showcase

🐦 DevOps MiniTwit Project

My group of 5 students was given an outdated Twitter-like website, where we had to work collaboratively to update it to modern standards. To do this we used GitHub and its CI/CD, secrets, kanban board and other features. We also had quality gates through SonarCloud, Snyk, and MegaLinter. We used Django (Python) for the backend of the website, and Docker and Docker Hub to containerize it. We hosted it on DigitalOcean (An IaaS cloud service), but sadly due to the cost of hosting it we have since "destroyed" our "droplets".

📈 Data Visualisation Project

Using Tableau I created an interactive visualisation showcasing ammunition statistics from the videogame "Escape from Tarkov".

EfT Showcase

Feel free try a (less functional) public version here!

For more details read the paper I wrote about it here.

📊 Cross-platform sentiment analysis (in depth project)

For this project, we investigated a method of training a model on multiple domains to then predict on an unseen domain. For this we used review data from multiple different domains such as Amazon. We wanted to see how we could choose different amounts of each training domain to train on in order to get better performance on the test domain, and we ended up finding a couple different methods of preprocessing that effectively boosted the performance of our model by up to a 5% increase in accuracy.

🍸 Forensic Identification of Glass Fragments

We used machine learning techniques to analyse various properties of glass as an input, and make predictions of what type of glass the sample was. We wrote two classifiers from scratch, one a feed forward neural network (which I did), and the other a decision tree classifier. Furthermore we implemented an ensemble method, and then compared all 3 approaches to the problem.

🌐 Airport Network Analysis

For this project we had to choose a network dataset and analyse its various properties. We chose a dataset of the worlds airports and the connections between them. A fun part of our project was to analyse the unrealistic hypothetical scenario of "which airports would still be above sea level should all of the ice in the world melt". I used the python libraries "networkx" and "cartopy" to create maps that showed which airports would be underwater at different sea-levels:

Network Analysis Showcase

Notice how many of the coastal airports become submerged (red) and how many of the connections are lost!

😠😄😢 Tweet Sentiment Classification

We used machine learning and natural language processing techniques to automate sentiment classification of tweets. We learned two models to predict whether a tweet contained hate-speech or not, and to predict whether a tweet was of the emotion of anger, joy, optimist or sadness.

👩‍⚕️👨‍⚕️ Skin Lesion Detection in Medical Treatment Using Machine Learning

We used machine learning and image processing techniques to learn a KNN algorithm to be able to analyse images of skin lesions and predict whether the lesions were Melanoma (a type of skin cancer) or benign (not harmful).

⛅ How weather possibly influences the spread of Covid-19

We investigated the effects of weather on the spread of Covid-19 in Germany by analysing related data using python. We discovered a negative correlation between the amount of ultra violet light (UV-index) and humans being infected with Covid-19, suggesting that sunnier days leads to less infections. Whether it is due to the ultra violet light killing Covid-19, or due to changes in human social interaction during sunnier days remained inconclusive.

🚴💥🚗💨 Road Collision Analysis of Leeds

We investigated the overall road safety in Leeds by analysing traffic data using python. We made visualisations and drew the conclusion that many bicycle-vehicle collisions occur at junctions, and so measures should be taken to improve safety particularly in those areas.

Aidan Stocks's Projects

aml-pokemon icon aml-pokemon

For using vision transformers and auto-encoders with pokemon.

itu-minitwit-ci icon itu-minitwit-ci

A repository for the fourth session in "DevOps, Software Evolution and Software Maintenance" at IT University of Copenhagen

outplayed icon outplayed

Final work for the 2023 "Advanced Applied Statistics" Course as ITU

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.