Code Monkey home page Code Monkey logo

data-scientist-roadmap-main's Introduction

Data-Science / Data-Engineer-Roadmap

Fundamentals

Data science is a field that involves the use of scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data. It involves the use of a wide range of techniques and tools from various fields such as statistics, machine learning, data mining, and computer science to analyze and interpret data. The goal of data science is to help organizations make better decisions and predictions by uncovering patterns and trends in data. Data scientists use various techniques, including machine learning algorithms, to build predictive models that can be used to make predictions and draw insights from data.

Data engineering is a field that involves the design, construction, and maintenance of systems and infrastructure for collecting, storing, processing, and analyzing large data sets. This encompasses a wide range of tasks, including data warehousing, data modeling, data integration, data quality assurance, and data security. Data engineers work closely with data scientists and analysts to ensure that data is accurate, accessible, and can be used to support business decisions. Data engineers also design and implement the architecture and infrastructure necessary to support big data technologies such as Hadoop and Spark. The field of data engineering is rapidly evolving, as new technologies and approaches continue to emerge.


Data-Science


Why


Note

It is adviced to follow the path in the manner given below

Skills required by Data Scientist

Python

Statistics

Here are some key things that a data scientist should learn in statistics:


Machine Learning


Deep Learning


Natural Language Processing (NLP)


Data Engineers

Skills required by Data Scientist

SQL (Structured Query Language)


Basics Of Linux


Core Data Engineering Concepts


Data Warehouse Fundamentals


Learn Batch/Realtime Streaming Pipeline Building


Data Orchestration (AirFlow)


Cloud Computing


Kubernetes


Docker


data-scientist-roadmap-main's People

Contributors

gulshanpr avatar yashdev9274 avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

data-scientist-roadmap-main's Issues

Add resources in Docker Folder

Description

What can you add:

  • YT lecture
  • Documentation to learning about Docker
  • Blogs
  • Projects on Docker

PS: Create separate folders for Projects and markdown files for other contents

Screenshots

Here is the files/folder you can update

image

Additional information

Keep the format of files/folder like this

image

Add Resources in Deep Learning Section

Description

Add resources in Deep Learning Folder

What can you add:

  • YT lecture
  • Documentation to learning about Deep Learning
  • Blogs
  • Projects on Deep Learning

PS: Create separate folders for Projects and markdown files for other contents

Screenshots

Here is the file/folder you need to update

image

Additional information

Try to create files/folders in this format

image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.