Code Monkey home page Code Monkey logo

Hi there! 馃憢

As a data journalist, I focus on data-driven investigations that expose abuses of power. My work includes scraping and cleaning data, creating data memos, conducting research and fostering understanding of data work within the team.

About me

  • 馃捇 Data analysis at The Examination
  • 馃殌 NLP enthusiast
  • 馃搶 Always interested in collaborating on data-driven projects
  • 馃摣 How to reach me: [email protected]

Contents


NLP

Repository Description
discursos-milei Scraper y an谩lisis de discursos de Javier Milei
ai4foia Proof-of-concept to recommend recipients for FOIA requests
hackathon-somos-nlp-2023 Fine-tuning LLMs for detecting hate speech categories in Spanish
customized-headlines Proof-of-concept to create customized headlines from news content based on demographic data
explained-recommendations API for a system recommendation explained using generative AI
opportunities-db Scraper to extract data from opportunity-related websites (e.g. funds, scholarships, etc.) and convert them into structured data
ner-spanish A repository for extracting Named Entity Recognition (NER) in Spanish data
pmdm Fine-tuned pre-trained language model that detects hate speech against women in Spanish and Portuguese
attackdetector Research for hate speech on Twitter against journalists and environmental activists in Mexico and Brazil
topicos-discursos-amlo Analysis with topic modeling to AMLO's speeches
bad-bunny Analysis of Bad Bunny's songs

Data Analysis

Repository Description
travesticidios-argentina Data analysis on court decisions on transvesticides in Argentina from 2018 to 2023
elecciones-argentina-2023 Data analysis of attacks against journalists in Twitter during the elections in Argentina in 2023
recomendaciones-escritoras Recommendation system for Latin American women writers
cancilleria-colombia Data analysis of public servants of Foreign Affairs in Colombia
gptzero-ai-articles Data analysis of articles talking about ChatGPT that were created with generative AI models
capir-transfronteriza2-2023 Data analysis and topic modeling of anti-rights groups from Brazil, Ecuador and Colombia
migrantes-desaparecidos-eeu Data analysis on missing migrants en route to the U.S.
covid19-venezuela Data analysis on covid-19 deaths in Venezuela
violencia-obstetrica-cuba Data analysis of obstetric violence in Cuba

Data Visualization

Repository Description
ping-pong-caba Mapa con ubicaciones de mesas de ping pong en lugares p煤blicos de CABA
comision-revision-bolivia Map showing the rate of femicides in Bolivia per 100,000 women from 2013 to 2020
escritoras-latinas Web scraping of Wikipedia entries for Latin American women writers and network graph visualization
wifi-gratuito-cdmx Map showing locations of public free internet service in Mexico City [ARCHIVED]
mapa-huertos Map with locations of urban orchards in Mexico City [ARCHIVED]
maps-examples Maps examples using folium and prettymaps modules in Python [ARCHIVED]
directorix-disidente Digital directory of professions to build networks among the queer community of Mexico City [ARCHIVED]

Web Scraping

Repository Description
cij-argentina Scraper to convert PDF files from the CIJ website in Argentina into structured data
pdf-2-ner Web application to convert scanned PDF files to text-based data and apply Named Entity Recognition (NER) to extract entities in Spanish

Tools

Repository Description
pubmed-scraper A python command-line tool which scrapes PubMed based on keywords search and URL extraction
oportunidades-perioidstas-latam Sitio web para difundir oportunidades para periodistas en Latinoam茅rica
meta-threat-disruptions Track updates on Meta鈥檚 threat disruptions website
numerical-expressions A python command-line tool which describes the change between two numerical values
data-annotator Web application for text-based data labeling [ARCHIVED]

Project Templates

Repository Description
cookiecutter-data-analysis-extensive A cookiecutter template for data analysis projects using Python
cookiecutter-data-analysis-lite A starter template for data analysis projects that offers a simplified and beginner-friendly structure
cookiecutter-data-journalism A cookiecutter template for data journalism projects using Python

Learning Resources

Repository Description
csvconf-nlp Sesi贸n de introducci贸n a NLP en la csv,conf,v8 de Puebla, M茅xico en 2024
taller-cookiecutter Taller sobre c贸mo crear plantillas de proyectos para an谩lisis de datos
taller-python Jupyter notebooks for learning the basics of Python
learn-python Collection of Python scripts organized by topics
learn-react-d3 Examples for data visualization with React and D3.js
learn-scrollama Examples for scrollytelling with scrollama
twitter-python Examples for Twitter data collection with Tweepy in Python [ARCHIVED]

Fer Aguirre's Projects

opportunities-db icon opportunities-db

Scraper to extract data from opportunity-related websites (e.g. funds, scholarships, etc.) and convert them into structured data (work-in-progress)

pdf-2-ner icon pdf-2-ner

Web application for information extraction and named entity recognition for PDF files (work-in-progress).

ping-pong-caba icon ping-pong-caba

Mapa con ubicaciones de mesas de ping pong en lugares p煤blicos de CABA

pmdm icon pmdm

Political Misogynistic Discourse Monitor team from the 2021 JournalismAI Collab Challenges

pubmed-scraper icon pubmed-scraper

A python command-line tool which scrapes PubMed based on keywords search and URL extraction

taller-python icon taller-python

Cuadernos de Jupyter para aprender los fundamentos de Python 馃悕

twitter-python icon twitter-python

Examples for Twitter data collection with Tweepy in Python 馃惁

wifi-gratuito-cdmx icon wifi-gratuito-cdmx

Mapa con locaciones de servicio p煤blico de internet gratuito en Ciudad de M茅xico 馃寪

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    馃枛 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 馃搳馃搱馃帀

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google 鉂わ笍 Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.