Code Monkey home page Code Monkey logo

Luis Yamada | Sr Staff Data Engineer @ Nubank

About me

✨ Creating bugs since 2008 - 10 years working with data projects

🎯 Goals: become a black-belt Jiu Jitsu fighter 🥋

Certifications

associate-solutions-architect-icon.jpg

Technical Expertise

  • AWS: S3, EC2, EKS, CloudFront, Kinesis (Stream, Firehose, Analytics), Lambda, EMR, RDS, Aurora, Redshift, Glue, Athena, DynamoDB, SNS, SQS, VPC, IAM, SSM, API Gateway, ELB, CloudFormation, EKS, Quicksight, ECR, CodeCommit, CodeBuild, CodePipeline, OpenSearch, Lake Formation
  • Hadoop: HDFS, Kafka, Hive, HBase, Spark (Batch/Streaming), Flume, Sqoop, Storm, Nifi
  • Open Source: Pandas, AWS Data Wrangler, AWS Lambda Powertools, Flask
  • Elastic: Elasticsearch, Kibana, LogStash, Beats
  • Backend/API Rest: Springboot, Flask, Quarkus, RabbitMQ, gRPC
  • Frontend: HTML, CSS, React (Javascript), Figma design
  • Databases: MS SQL Server, MySQL, PostgreSQL, Oracle, MongoDB, Redis, Prometheus, Apache Cassandra
  • Business Intelligence: MS SQL Integration/Reporting Services, MS Power BI, Tableau, Einstein Analytics, Grafana
  • Development: Python, Java, Scala
  • DevOps: Git, Jenkins, Airflow, Terraform, Splunk
  • Virtualization/Containers: Docker Hub, Kubernetes, Openshift

I'm interested in many forms of technology:

  • Data Engineering
  • Data Science / ML
  • DevOps
  • Observability
  • Cloud

I'm experienced in couple dev languages:

python logo java logo nodejs logo scala logo

Social Media

Stats

stats graph languages graph

Luis Yamada's Projects

aws-cost-explorer-data-collector icon aws-cost-explorer-data-collector

A serverless stack capable to collect data from AWS Cost Explorer API and publish as parquet into Amazon S3, serving it as a AWS Glue table for further queries

commons-lambda-layers icon commons-lambda-layers

This repo stores python packages (zipped) and a CloudFormation script to deploy them as AWS Lambda layers

data-app-on-eks icon data-app-on-eks

This is a DevOps guideline about how to quickly create an Amazon EKS cluster with all dependencies needed to run a data app, deployed by a CI/CD pipeline built on top of AWS services.

docker-rastreio-correios icon docker-rastreio-correios

(PT-BR) O objetivo deste projeto é executar uma aplicação Python dentro de um container Docker (ou container Kubernetes utilizando a docker image) para rastrear e monitorar remessas dos correios

got-sentence-analysis icon got-sentence-analysis

This project is related to a series of data analysis related to the worldwide famous tv show 'Game of Thrones', more specifically about all sentences said during the 8 seasons

hs-decks-analysis icon hs-decks-analysis

This project is a POC of data analysis using Streamlit (https://www.streamlit.io/) to support us on the data exploration, reading data from decks built on https://www.hearthpwn.com/ and with help of the open API https://hearthstonejson.com/

ipynb-to-py icon ipynb-to-py

Export python script from Jupyter Notebook to runnable .py file

jmx_exporter icon jmx_exporter

A process for exposing JMX Beans via HTTP for Prometheus consumption

manga-chapter-watcher icon manga-chapter-watcher

A web scraper + news watcher for monitoring and notifying through e-mail about new chapters of your favorite mangas

marvel-br-api icon marvel-br-api

(PT-BR) Essa API foi desenvolvida para fornecer dados do endpoint de personagens da Global Marvel API, traduzindo os textos para o português para melhor atender aos usuários brasileiros. Tentando melhorar a experiência do usuário do que usar os dados de tradução da API oficial da Marvel, persistindo em um MariaDB para diminuir as chamadas para a API da Marvel

monitor-fiis icon monitor-fiis

A python web scrap and data analytics project used to identify key metrics and BI insights about Brazilian Real Estate Investment Fund (aka FIIs)

msk-logs-parser icon msk-logs-parser

This AWS Lambda function is capable of decompress the compressed broker logs generated by Amazon MSK clusters (.gz format), parse the data, and save them into Amazon S3 as an AWS Glue table.

myanimelist-data-collector icon myanimelist-data-collector

This project represents a whole process of Anime data collection, preparation, and delivery as a data app, powered by technologies like Pandas, Jupyter, and Streamlit

pivs-streamer-core icon pivs-streamer-core

A python microservice which runs in loop gathering data from Twitch and Twitter APIs about streamings, followers, and videos. As new events are noticed, the application sends messages to RabbitMQ queues for other applications consume.

realtime-analytics-elastickibana icon realtime-analytics-elastickibana

This project brings multiple components to build and run a fast data application, messaging broker, streaming service, indexed db and real-time analytics dashboard

spark-on-k8s icon spark-on-k8s

Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Cluster.

valorant-matches-event-producer icon valorant-matches-event-producer

An application built in Java using Quarkus as its core, to handle the production of events to Apache Kafka topics with high performance, about fake Valorant matches for future use in data pipelines.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.