Code Monkey home page Code Monkey logo

Welcome

Jacquelyn Harland's Portfolio

Data Engineer/Scientist - Technical Proficiencies

* Areas of Expertise: Statistical Concepts, Information Management, Process Improvement, Software Documentation, Systems Planning, Project & Program Management, Customer Interactions, User Acceptance Testing, Leadership. Team collaporation on Git
* Tools: SQL/NoSQL, R, Python (Pandas, NumPy, Flask), ETL, Airflow Support, Tableau, Power BI Visualization, D3.js, Plotly, Matplotlib, Machine Learning-Supervise, Unsupervised, Neural Netork Creation and management of Docker containers, Warehousing, Probability, Statistics.
* Databases: Azure Databricks, Data Factory, SQLAlchemy, MongoDB, PostgreSQL, AWS, SQL Server.

SKILLS PROGRAMMING LANGUAGES & TOOLS:

MACHINE LEARNING Natural Language Processing, Scikit-learn, PyTorch, Keras ,TensorFlow

BIG DATA ANALYTICS Hadoop, PySpark, AWS, Google, Cloud, S3

STATISTICS Modeling & Forecasting

PYTHON PROGRAMMING Python, Pandas, NumPy ,Matplotlib, Seaborn, Flask, API, Interactions, Social ,Media Mining

DATABASES MySQL, MongoDB, ETL, SQL

FRONT-END WEB VISUALIZATION HTML, CSS, Bootstrap, Dashboarding, JavaScript, Charting D3.js, Plotly, Geomapping, with Leaflet.js

BUSINESS INTELLIGENCE SOFTWARE Tableau Dashboard Creation & Geo Mapping

LEADERSHIP, Team Managment

Project 1: Belly Button Biodiversity Dashboard using Plotly

Resources

Data Source: BellyButton_bar_chart_starter_code.js, BellyButton_bubble_chart_starter_code.js, BellyButton_bubble_chart_starter_code.js and index.html • Data Tools: ECMAScript, JavaScript, JSON and IO (Web Server) • Software: ES6+, ECMAScript and Visual Studio Code 1.50.0\

Overview

Plotly provides graphing, analytics, and statistics tools, as well as scientific graphing libraries for Python, R, MATLAB, Perl, Julia, Arduino, and REST. Plotly is an open-source library that provides a list of chart types as well as tools with callbacks to make a dashboard. The charts we have constructed and embedded are all made in chart studio of plotly. When Mrs. Roza came to us, she had partially completed a dashboard. A completed panel for demographic information allows for a user-friendly convenience. We have constructed a chart to visualize the bacterial data for each volunteer allowing them to identify the top 10 bacterial species in their belly buttons. Furnishing this information will allow Improbable Beef the ability to identify a species, as a candidate, to manufacture synthetic beef. Roza's volunteers will be able to identify whether that species is found in their navel. The interactive dashboard explores the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels. The dataset reveals that a small handful of microbial species (also called operational taxonomic units, or OTUs, in the study) were present in more than 70% of people, while the rest were relatively rare.

Elements Constructed:

  1. Horizontal Bar Chart
  2. Bubble Chart
  3. Gauge Chart
  4. Customize the Dashboard
  5. Report on the Belly Button Biodiversity Dashboard analysis

Interactive Dashboard

1: Horizontal Bar Chart

Using JavaScript, Plotly, and D3.js, we created a horizontal bar chart to display the top 10 bacterial species (OTUs) when an individual’s ID is selected from the dropdown menu on the webpage. The horizontal bar chart displays the sample_values as the values, the otu_ids as the labels, and the otu_labels as the hover text for the bars on the chart.

Code and Image

 1. Code is written to create the arrays when a sample is selected from the dropdown menu.
 2. Code is written to create the trace object in the buildCharts() function, and it contains the following:
     o	The y values are the otu_ids in descending order.
     o	The x values are the sample_values in descending order
     o	The hover text is the otu_labels in descending order.
 3.	Code is written to create the layout array in the buildCharts() function that creates a title for the chart.
 4.	When the dashboard is first opened in a browser, ID 940’s data should be displayed in the dashboard, and the bar chart has the              following:
      o	The top 10 sample_values are sorted in descending order
      o	The top 10 sample_values as values
      o	The otu_ids as the labels

2: Bubble Chart

Using your knowledge of JavaScript, Plotly, and D3.js, create a bubble chart that will display the following when an individual’s ID is selected from the dropdown menu webpage:

  •	The otu_ids as the x-axis values.
  •	The sample_values as the y-axis values.
  •	The sample_values as the marker size.
  •	The otu_ids as the marker colors.
  •	The otu_labels as the hover-text values.
  
1. The code for the trace object in the buildCharts(); function does the following:
   o	Sets the otu_ids as the x-axis values
   o	Sets the sample_values as the y-axis values
   o	Sets the otu_labels as the hover-text values
   o	Sets the sample_values as the marker size
   o	Sets the otu_ids as the marker colors
2. The code for the layout in the buildCharts(); function does the following:
   o	Creates a title
   o	Creates a label for the x-axis
   o	The text for a bubble is shown when hovered over
3. When the dashboard is first opened in a browser, ID 940’s data should be displayed in the dashboard. All three charts should also          be working according to their            requirements when a sample is selected from the dropdown menu.

Code and Image

3: Gauge Chart

Finally we created a gauge chart that displays the weekly washing frequency's value, and display the value as a measure from 0-10 on the progress bar in the gauge chart when an individual ID is selected from the dropdown menu.

Code and Image

 1.	The code to build the gauge chart does the following:
      o	Creates a title for the chart.
      o	Creates the ranges for the gauge in increments of two, with a different color for each increment.
      o	Adds the washing frequency value on the gauge chart.
      o	The indicator shows the level for the washing frequency on the gauge.
      o	The gauge is added to the dashboard.
      o	The gauge fits in the margin of the <div> element.
  2.	When the webpage loads, the bar and bubble chart are working according to the requirements in Deliverable 1 and 2, respectively,            and the gauge chart is working according to the requirements listed for this Deliverable

Jacquelyn Harland's Projects

360degree-view-of-sales-statistics icon 360degree-view-of-sales-statistics

Sales Interactive Dashboarding, Power BI, A 360 degree view of what's happening within your sales data: KPIs indicated in the top left corner, highest level sales total number Bar Chart that looks at sales by category (cell phones, electronics, hardware, and televisions) Geomap provides sales by location Line Graph provides sales by time, a data model which allows for interactive metrics with a click. Let's say, electronics and it will change all the other visualizations. This interactive element is very valuable and empowers others to start digging for their own answers to their own questions. Leveraging this tool allows for the building out of an organization's reporting infrastructure.

amazon_vine_analysis icon amazon_vine_analysis

Analysis of Vine vs. Non-Vine Member Reviews on sporting goods. Using Amazon’s cloud service AWS, Google Colab and Pyspark to analyze Amazon’s reviews for outdoor products. Determining if there is any bias toward favorable reviews from Vine members.

analyzing-movie-rentals icon analyzing-movie-rentals

Connected to a PostgreSQL database with data around a DVD rental business and produced visualize the data in Python.

biodiversity-dashboard-using-plotly icon biodiversity-dashboard-using-plotly

Dynamic Dashboard of Belly Button Sample Data: an interactive dashboard, a creative visualization that offers interactivity which can help the audience better understand the data and draw the same conclusions as researchers and data analyst. The purpose of this research is to gather, identify, and analyze the biological footprint of individuals’ naval and their unique naval bacteria.

credit_risk_analysis icon credit_risk_analysis

Supervised ML - Credit Risk on Loan Applications: using several models on credit loan data in order to predict credit risk. Python Sickit-learn library and models (Supervised Machine Learning Models - Logistic Regression, Random Forest, AdaBoost Classfier, Cluster Centroids, Oversampling & Undersampling)

cryptocurrencies icon cryptocurrencies

Unsupervised Machine Learning- CyrptoCurrency Analysis, using several models on a cryptocurrency data in order to discover patterns and groups in data. Analysis done to create a report that includes what cryptocurrencies are on the trading market and how they could be grouped in order to create a classification system for potential new investments into the cryptocurrency market.

diamonds-price-acceptable icon diamonds-price-acceptable

Exploratory Analysis of diamonds, predictive price model developed, vendor analysis. I used Python, Excel, R, Tableau to demonstrate my diversity in technology. Python was used for ml, excel pivot tables to work toward choosing diamonds, R for graphs, and exploratory analysis, and Tableau to make a few graphs.

election_analysis icon election_analysis

Developed a Python script that generates a vote count per candidate/per county and creates a report for a U.S. congressional race in a Colorado precinct.

hottest-topics-in-machine-learning icon hottest-topics-in-machine-learning

Neural Information Processing Systems is one of the top machine learning conferences in the world where groundbreaking work is published. In this Project, we analyzed a large collection of NIPS research papers from the past decade to discover the latest trends in machine learning. The techniques used here to handle large amounts of data can be applied to other text datasets as well.Familiarity with Python and pandas is required to complete this Project, as well as experience with Natural Language Processing in Python, sklearn specifically.

kickstarter-analysis icon kickstarter-analysis

A captivating plan to bring success to your endeavors. We have viewed other successful campaigns, assessed the keys to their progress, combined with your brilliant ideas to bring you a statistically supported plan. Performing analysis of Kickstarter data to uncover trends

mapping_earthquakes icon mapping_earthquakes

An interactive earthquake map with JavaScript and Leaflet: created an interactive map that shows the latest earthquake activity around the world. Maps allow us to explore, understand and make decisions about our world. Providing data-driven storytelling on disasters around the world and building insightful data visualization.

mechacar_statistical_analysis icon mechacar_statistical_analysis

Statistical Analysis with R - Summary Statistics, T-Tests, ANOVA. Performed statistical testing in programming language R for a car company. Statistical tests provided data-based insight on the company performance and suggested additional testing for comparison of Car company against its competition.

mission-to-mars icon mission-to-mars

Web-scraping with HTML and CSS: Performed web scraping from active NASA websites in order to retrieve information about NASA’s latest article, featured picture, Mars facts and images of Mars Hemispheres. All retrieved data are put together in a single web application to showcase the gathered information.

movies-etl icon movies-etl

ETL to Collect, Import, and Process Data: The Amazing Prime, a video streaming company, decided to sponsor a hackathon, where participants try to predict which low budget movies being released will become popular. Participants of a hackathon need clean data. Provided an organized and clean dataset using ETL or Extract, Transform, and Load process

neural_network_charity_analysis icon neural_network_charity_analysis

Neural Network and Deep-learning models. Using Neural Networks Machine Learning algorithms, artificial neural networks, as well as Python TensorFlow library in order to create a binary classifier that is capable of predicting whether applicants will be successful if funded by the nonprofit foundation called Alphabet Soup. This ML model will help ensure that the foundation’s money is being used effectively.

nyc-bike-sharing-program-dashboard- icon nyc-bike-sharing-program-dashboard-

Tableau Dashboard for CitiBike NYC Ride Sharing Program. Created data visualization with Tableau for bike sharing program in New York City. Analyzed the data, see the mechanics of the business and figure out how the bike share business works in NYC.

pewlett-hackard-analysis icon pewlett-hackard-analysis

Creating ERD Diagrams, perform data modeling, complete analysis on employee database pgAdmin using SQL: QuickDBD and Schemas to design databases and writing intermediate-level SQL queries to answer important business questions for the company’s HR department. Utilizing PostreSQL a data base system to load, build, and host company data and pgAdmin and the result is a well-designed database with reporting capabilities.

pipeline-with-singer- icon pipeline-with-singer-

Singer is a specification that describes how data extraction scripts and data loading scripts should communicate using a standard JSON-based data format over stdout.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.