Code Monkey home page Code Monkey logo

About me

I am a passionate and skilled data engineer 👨‍💻 with three years of experience in the field. My expertise lies in using tools like GCP, Python, Spark, SQL, Talend Open Studio, Tableau, Power BI, Docker, Terraform , dbt, and Airflow . I have successfully designed and developed robust data pipelines and architectures, ensuring data integrity and accuracy ✔️. With my expertise in cloud computing platforms like GCP ☁️, I have implemented scalable solutions. I love transforming complex data sets into actionable insights using visualization tools like Tableau and Power BI 📈. I have also mastered containerization 📦, infrastructure provisioning ⚙️, and workflow orchestration 🎼. I am driven to deliver excellence in data engineering and thrive on pushing boundaries 💪. Request resume.

I also write about data engineering on

🌐communities:@DataTalksClub @TechUp Africa

📫 lets talk: mail me at [email protected] or through my socials

𝕏.com linkedin spotify

Skills


• Data Modelling • DataOps • Data versioning and source control • Cloud data engineering • Extract Transform Load (ETL) • Data Warehousing • Data stream and real time processing • Orchestration & Workflow automation • Data governance and quality management

Tech stack




Latest Articles

TitleLink
Running Transformations on BigQuery using dbt Cloud: step by stephttps://dev.to/wachuka_james/running-transformations-on-bigquery-using-dbt-cloud-step-by-step-11bo
Debugging Python Data Pipelineshttps://dev.to/wachuka_james/debugging-python-data-pipelines-a-step-by-step-guide-11g7
Using pyspark to stream data from coingecko API and visualise using dashhttps://dev.to/wachuka_james/using-pyspark-to-stream-data-from-coingecko-api-and-visualise-using-dash-5g43

James Mwangi's Projects

coingecko-streamapp icon coingecko-streamapp

a streaming app and a dashboard for visualizing cryptocurrency data fetched from the CoinGecko API. The streaming app retrieves real-time cryptocurrency information using Spark Streaming and stores it in a PostgreSQL database.

event-driven-microservices icon event-driven-microservices

This project demonstrates an event-driven microservices architecture using Apache Kafka for event streaming and webhook integration with external services

mapreduce icon mapreduce

mapreduce techniques in hadoop-joins, job counters, inputs/outputs

mysql_gcp icon mysql_gcp

using airflow to extract data from mysql transform and load into bigquery

ploomber icon ploomber

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

podcasts_pipeline icon podcasts_pipeline

Building a four-step data pipeline using Airflow to download podcast episodes.

prefect-postgresql-sensors icon prefect-postgresql-sensors

The prefect_postgres_sensors package provides Prefect sensors for monitoring changes or conditions within a PostgreSQL database.

python_etl icon python_etl

Using python-sql to create ETL between mysql and postgresql and windows scheduler to automate the job.

python_tweepy icon python_tweepy

Using python and tweepy to followback friends on twitter. This task uses the windows scheduler to follow back every 5 minutes

shell_ icon shell_

first attempt at windows task scheduling

soda-core icon soda-core

:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

weatherbot icon weatherbot

weatherbot -using weather map API and telegram API

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.