Code Monkey home page Code Monkey logo

datascience's Introduction

datascience

Pandas

import pandas as pd
df=pd.read_csv('Employees')
//top 5 records
df.head()
//top n records
df.head(n)
//last 5 records
df.tail()
//last n records
df.tail(n)
//(columns,rows)
df.shape
//total size of the data frame cols*rows
df.size
//num of rows
len(df)
//list all columns
df.columns
//extract single column
df['Hired']
//you can also extract given range of rows from the coulmn
df['Hired'][:5]
//or even single value in that row
df['Hired'][5]
//extract more than one column
df[['Experience']['Hired']]
//extract specific ranges of rows in those columns
df[['Experience']['Hired']][:5]
//sorting the data frame with specific column
df.sort_values(['Experience'])
//you can break down unique values in a given columns into rows
degree_counts = df['EducationLevel'].value_counts()
//pandas even make it into a histogram of those counts by simply calling plot
degree_counts.plot(kind='bar')

Exercise
Try extracting rows 5-10 of our DataFrame, preserving only the "Previous Employers" and "Hired" columns. Assign that to a new DataFrame, and create a histogram plotting the distribution of the previous employers in this subset of the data.
import pandas as pd
df=pd.read_csv('PastHires.csv')
rows5to10 = df[5:11]
only2=rows5to10[['Previous employers', 'Hired']]
print only2
only2['Previous employers'].plot(kind='bar')

datascience's People

Contributors

venkatachadalawada avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.