To read the given data and perform Feature Scaling and Feature Selection process and save the data to a file.
STEP 1:Read the given Data.
STEP 2:Clean the Data Set using Data Cleaning Process.
STEP 3:Apply Feature Scaling for the feature in the data set.
STEP 4:Apply Feature Selection for the feature in the data set.
STEP 5:Save the data to the file.
- Standard Scaler: It is also called Z-score normalization. It calculates the z-score of each value and replaces the value with the calculated Z-score. The features are then rescaled with x̄ =0 and σ=1
- MinMaxScaler: It is also referred to as Normalization. The features are scaled between 0 and 1. Here, the mean value remains same as in Standardization, that is,0.
- Maximum absolute scaling: Maximum absolute scaling scales the data to its maximum value; that is,it divides every observation by the maximum value of the variable.The result of the preceding transformation is a distribution in which the values vary approximately within the range of -1 to 1.
- RobustScaler: RobustScaler transforms the feature vector by subtracting the median and then dividing by the interquartile range (75% value — 25% value).
Feature selection is to find the best set of features that allows one to build useful models. Selecting the best features helps the model to perform well. The feature selection techniques used are: 1.Filter Method 2.Wrapper Method 3.Embedded Method
import pandas as pd
from scipy import stats
import numpy as np
df=pd.read_csv("/content/bmi.csv")
df.head()
![image](https://private-user-images.githubusercontent.com/142209319/322836895-2aea6717-ef53-42bb-b94c-5228a301c51b.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM2ODk1LTJhZWE2NzE3LWVmNTMtNDJiYi1iOTRjLTUyMjhhMzAxYzUxYi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1lODYwM2JhYTM3NjNkNTU5OTA3YzAzOWMwZDA4MmEzYTlhYzc5NGZkMmYyZDlhOWExNjgwNDAyY2IzNjQwNTY1JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.G3xYGImt5VH1p9ZvAu6lGbXTeWpInxhy9JCjYBjCmWc)
df.dropna()
![image](https://private-user-images.githubusercontent.com/142209319/322837093-8b146d52-1bff-4afe-95ad-1fd1b34c1890.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM3MDkzLThiMTQ2ZDUyLTFiZmYtNGFmZS05NWFkLTFmZDFiMzRjMTg5MC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0wZTIzM2ZjOTg5MjczMDhiOTcwYWVlNjVjMTIxY2QxNjY2MDE0YTFmOGNlNzllY2M2MzZlNTZhZDViNzZmNTRhJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.zo5Y5YdYz_RJRnIjkhZrcx5j4vs3Z5yomDYAtz6cbOg)
max_vals=np.max(np.abs(df[['Height','Weight']]))
max_vals
![image](https://private-user-images.githubusercontent.com/142209319/322837681-76a9d44b-b7de-43b0-b4c5-4fa2369743cd.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM3NjgxLTc2YTlkNDRiLWI3ZGUtNDNiMC1iNGM1LTRmYTIzNjk3NDNjZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT02ZTYwZjZhMTg1NGFjZWUyNzJmNjY2YTJjYWQ3NDljN2Q0Njk3N2UzYTRkM2MyN2E1YTVmOWFkYWM0ZWVmMjM5JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.lPzEFIbSD_u5yOiA-sso_F30f7cRDng5Pxr8WdlpRQ0)
from sklearn.preprocessing import StandardScaler
sc=StandardScaler()
df[['Height','Weight']]=sc.fit_transform(df[['Height','Weight']])
df.head(10)
![image](https://private-user-images.githubusercontent.com/142209319/322838096-c1e1b5ab-3592-4f38-9696-9e394eb38a08.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM4MDk2LWMxZTFiNWFiLTM1OTItNGYzOC05Njk2LTllMzk0ZWIzOGEwOC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT05MmMyMDg4ZGJjZjRkZGJlYTdlNTQyNmFjZDRhZjAyNDBhYTYxNGMyN2I4MzFlZWM1MTZlOTQ2NWNiZDVhZjM2JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.6QrNvGO6PGvLsF4txDmWE10KESj-UIDG8a_-gYgVm7g)
from sklearn.preprocessing import MinMaxScaler
scaler=MinMaxScaler()
df[['Height','Weight']]=scaler.fit_transform(df[['Height','Weight']])
df.head(10)
![image](https://private-user-images.githubusercontent.com/142209319/322838526-cc1b1e0b-0a00-455f-b3aa-2820d2eaea4c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM4NTI2LWNjMWIxZTBiLTBhMDAtNDU1Zi1iM2FhLTI4MjBkMmVhZWE0Yy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT01ZmE5YzBiMTdmYzZlZmM5Y2VjMjhkOTgwMWYwYjM3YWQxN2E3OGRiNzE3MjdkNjVmNTQ2NjMwZjYzNjk3NTE3JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.GwcJzVMlRdDr-_2yeap6QmsMCmYc2YBw3JrtGwjbfPY)
from sklearn.preprocessing import Normalizer
scaler=Normalizer()
df[['Height','Weight']]=scaler.fit_transform(df[['Height','Weight']])
df
![image](https://private-user-images.githubusercontent.com/142209319/322839820-d55c9d08-42a1-4106-9fb0-dad172a28cfd.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM5ODIwLWQ1NWM5ZDA4LTQyYTEtNDEwNi05ZmIwLWRhZDE3MmEyOGNmZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT00MWI1MDkyOWJiZDM1M2FmOWJkNjVkZjcxNmZjYmZmMmRkOTViN2UxZjAzYzE0Mjg5NmNlOTM4YTk5NTg0ODNhJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.fm9ZJJoZTdn0DXHSBGE6iScaAzlut4Kgmlfs2XUH3Fo)
df1=pd.read_csv("/content/bmi.csv")
from sklearn.preprocessing import MaxAbsScaler
scaler=MaxAbsScaler()
df1[['Height','Weight']]=scaler.fit_transform(df1[['Height','Weight']])
df1
![image](https://private-user-images.githubusercontent.com/142209319/322840052-1a21a595-3a30-43de-9dd9-abc59ff86dd8.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQwMDUyLTFhMjFhNTk1LTNhMzAtNDNkZS05ZGQ5LWFiYzU5ZmY4NmRkOC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0wZDc4YmM4NDgwM2RjOTNlN2RjNjhiYmE3M2NhYjFhOGViYmM4YmFiOTAyNmFlZjNkZDdlOGU1OWQ1ODk0NDFjJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.jkMzSkDaTixCU2B5oauxnocxdZp6xFRuywRSWDKKfF8)
df2=pd.read_csv("/content/bmi.csv")
from sklearn.preprocessing import RobustScaler
scaler=RobustScaler()
df2[['Height','Weight']]=scaler.fit_transform(df2[['Height','Weight']])
df2.head()
![image](https://private-user-images.githubusercontent.com/142209319/322840324-df58bc4e-e7c9-49d4-ba54-c61123297ee1.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQwMzI0LWRmNThiYzRlLWU3YzktNDlkNC1iYTU0LWM2MTEyMzI5N2VlMS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT01ZmQ5Zjc3OWNiNmZjNDNmMjk3MzFmMTcyODNlMjBlZTBhMmJiMWI0YTkwYzA1YWQ5OWU5NjdmMDlkYjliNWM4JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.y9vAQ7tJs8V4T5KLpH45nCP1_YSJuHilbrNJyPgOZl4)
import pandas as pd
import numpy as np
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import accuracy_score,confusion_matrix
data=pd.read_csv('/content/income(1) (1).csv',na_values=[" ?"])
data
![image](https://private-user-images.githubusercontent.com/142209319/322840753-1eb413c2-b5a1-4d89-a654-d96a5cbc64b0.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQwNzUzLTFlYjQxM2MyLWI1YTEtNGQ4OS1hNjU0LWQ5NmE1Y2JjNjRiMC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kNDhmMTczMGJjOWZlMGMzNTEyZGZiNDdhOGQ5YjkwMGRjNzQ1ZjI1N2Y1Mzg1NmRjYzk3NmQwZGRkOGVmYTkwJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.13nyQWgFK5wDgVuC9De5ARNDCMaP03vJYtHdoBsxVnc)
data.isnull().sum()
![image](https://private-user-images.githubusercontent.com/142209319/322840962-9a0b6c2c-4c0c-46b9-91ad-475f04ce3562.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQwOTYyLTlhMGI2YzJjLTRjMGMtNDZiOS05MWFkLTQ3NWYwNGNlMzU2Mi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0xODEwZGZmZDQ0YjQ3MTIzN2NjYmE4NzRmNmIxOTQ5YjQ3MWZiZjdhYWI2MWNiMmVhMjA3YjBhMzEyZWJhOTZhJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.zi0b05jMAHCG549Sl0ZnhDMQm2n1YoxdnpyaYjeZrc8)
missing=data[data.isnull().any(axis=1)]
missing
![image](https://private-user-images.githubusercontent.com/142209319/322841183-2b5bbfb3-9315-472c-a097-8adee8f4578f.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQxMTgzLTJiNWJiZmIzLTkzMTUtNDcyYy1hMDk3LThhZGVlOGY0NTc4Zi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT03MzM5MDJhNDEwOWM5YjdhNzhlNzYxZGMwYjc0ODllNWRmZTNkMDZkODA0MTA1MTU1NjZiYjMxYTIwYjg0NjdiJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.8pXCC84roVLdZAUGcClpPxDZVEAcIVzbd3eLXAa5VTs)
data2 = data.dropna(axis=0)
data2
![image](https://private-user-images.githubusercontent.com/142209319/322841391-e0209e9d-dfa7-4245-8d6a-6c695e6728bc.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQxMzkxLWUwMjA5ZTlkLWRmYTctNDI0NS04ZDZhLTZjNjk1ZTY3MjhiYy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0yMTVhNjBkZjBiNGVkNTdiM2VjNWMyZTYwOWM1ZjFmOGMyYjkyOGVkODJjYzA1MTg0NjgxZGEyYzE4NmRiNDBiJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.uSYSZRIWq-JQTvm3rIQrBca5SMEEGwNOn5AXcRt6Mwk)
sal=data['SalStat']
data2['SalStat']=data2['SalStat'].map({' less than or equal to 50,000':0,' greater than 50,000':1})
print(data2['SalStat'])
![image](https://private-user-images.githubusercontent.com/142209319/322841611-c3919808-6ed9-432e-b83e-9aff94a35994.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQxNjExLWMzOTE5ODA4LTZlZDktNDMyZS1iODNlLTlhZmY5NGEzNTk5NC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT05NzJlMTEyMGU0NGVhNmI5OTA3ZjE2ZDA3YTE4MGM3OGVlNmI5ODAzMTJmM2FhN2Y2YWZmODQ1NWE5ZDQzZTc2JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.aSE83adBllyiR37_MtZ4ppVNkJqgXXvTtZKbA8LUtWQ)
sal2=data2['SalStat']
dfs=pd.concat([sal,sal2],axis=1)
dfs
![image](https://private-user-images.githubusercontent.com/142209319/322841890-a2ce5564-7e32-481d-9547-e5943afb5c15.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQxODkwLWEyY2U1NTY0LTdlMzItNDgxZC05NTQ3LWU1OTQzYWZiNWMxNS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zNTY1N2MwN2MzNGU4YzhiNTAxODgxM2Y4ZTU5ZjA3ZGZmMGRiYTdlNDhkYjlkNGRmZWQwZjExMWM5OTU5NDQ5JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.zu4zHIkkWeqoKGSVqgbv8qO9ARqWtmMOMSSdHHR_wRU)
data2
![image](https://private-user-images.githubusercontent.com/142209319/322842106-a4fc1de3-8051-4743-81c6-c054bcb95f7d.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQyMTA2LWE0ZmMxZGUzLTgwNTEtNDc0My04MWM2LWMwNTRiY2I5NWY3ZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT01NmQzY2EwYzY3YmJhMzMzZTZiYTA4ODU4MDJhZjZmNjZlYmI1YzY3ZDdlMDQ5NGJjZmE0NDZiZTVlZDExZjAxJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.wd5HMSgTmo9yqg9eC8dHUXTF_Y_YRmg6BrC0OYBZtz4)
new_data=pd.get_dummies(data2, drop_first=True)
new_data
![image](https://private-user-images.githubusercontent.com/142209319/322843145-7eef042e-6822-4ac5-a574-b19476d68a9a.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQzMTQ1LTdlZWYwNDJlLTY4MjItNGFjNS1hNTc0LWIxOTQ3NmQ2OGE5YS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT03YTMyNzZkMjM5NTBkYjQ5YzZjZWExOTg4OWE4ODhmNGEwYmJjOWIwMzliNGI2OWI0YzQ0MWQwOTg1NDJmYjFkJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.-n7LInv3bSBibxrQa4AUUpm0oj4T3HLBVrJJJQBlDNs)
columns_list=list(new_data.columns)
print(columns_list)
![image](https://private-user-images.githubusercontent.com/142209319/322844071-2958db03-4ee4-4d69-8a09-2154b7756727.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ0MDcxLTI5NThkYjAzLTRlZTQtNGQ2OS04YTA5LTIxNTRiNzc1NjcyNy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0xMjgwMWNhNGFjODM4ZTgxNTRiYTg1OGQ3MTkyYjQ0Y2E5NGY5YjRmYTg2ODgzMzgyMGQxYzIwYmYwNmUwODBlJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.PoVvqBtDcMDc10CRvTOzRDlIB0fpqNpV0-ih6nYPYi4)
features=list(set(columns_list)-set(['SalStat']))
print(features)
![image](https://private-user-images.githubusercontent.com/142209319/322844567-fd4a58e0-39ef-426f-89d3-ff7af0fffd98.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ0NTY3LWZkNGE1OGUwLTM5ZWYtNDI2Zi04OWQzLWZmN2FmMGZmZmQ5OC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1hYTZmMjJmM2Q0NmNjMmY5Yzc3NTYxYjk1Zjc5NWIxYTZlOTQ3NTdiMzA0ZmNlNTA4NjhiZTc5M2MzZWE2ZDVmJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.azBwx4NF1WcSwxRtW97xHsJoDN9L9PxkKWnFNpnGuWU)
y=new_data['SalStat'].values
print(y)
[0 0 1 ... 0 0 0]
x = new_data[features].values
print(x)
![image](https://private-user-images.githubusercontent.com/142209319/322844918-6ad70fd2-6946-4995-be0e-801b3b2f0212.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ0OTE4LTZhZDcwZmQyLTY5NDYtNDk5NS1iZTBlLTgwMWIzYjJmMDIxMi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT01MGFlMmFhMzA3NWJjOTMwZGQ5OGRjZDViYzI1YWE1OGZiZjJhMWZiZTNlOWNiN2UzNjg3Njk3YzFiOWNlMjMxJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.uOAzS0c0oe4Sw2rU9t_HhWNWQhTPT6kqz7wVtHjojgQ)
train_x,test_x,train_y,test_y = train_test_split(x,y,test_size=0.3, random_state=0)
KNN_classifier=KNeighborsClassifier(n_neighbors = 5)
KNN_classifier.fit(train_x,train_y)
![image](https://private-user-images.githubusercontent.com/142209319/322845594-4c854a00-85d9-4d76-b57e-292d13ba84ee.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ1NTk0LTRjODU0YTAwLTg1ZDktNGQ3Ni1iNTdlLTI5MmQxM2JhODRlZS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1jYTJjNzM0YjI0YTY1ODJmZDY4OWUyNjgzZTBmZmQwZmFkOGY3ODIyOGQzNmI2ZjIwYTQwYmE2ZjJiM2VmNTllJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.aRptv3n_B-gYfPuxD1RKFAqiyfhG3UszgDqQm1Y1dno)
prediction = KNN_classifier.predict(test_x)
confusionMmatrix = confusion_matrix(test_y, prediction)
print(confusionMmatrix)
![image](https://private-user-images.githubusercontent.com/142209319/322845909-2aae088f-279f-4f33-8137-b712dc580d9e.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ1OTA5LTJhYWUwODhmLTI3OWYtNGYzMy04MTM3LWI3MTJkYzU4MGQ5ZS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0xYzc2MDQyOTFiN2E5ODg4MWVhNTliN2NmYmJmN2E4YWVmZjM4NDc3ZjRhZjczNWMxNjI2NWViNjRjZTMxYTVmJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.q86BklFmqB5ux-24rFQPHBUNocI39xvsDCW64t-CpWY)
accuracy_score=accuracy_score(test_y, prediction)
print(accuracy_score)
0.8392087523483258
print('Misclassified samples: %d' % (test_y != prediction).sum())
Misclassified samples: 1455
data.shape
(31978, 13)
import pandas as pd
import numpy as np
from scipy.stats import chi2_contingency
import seaborn as sns
tips=sns.load_dataset('tips')
tips.head()
![image](https://private-user-images.githubusercontent.com/142209319/322846817-ee1738f1-e407-493b-a162-dc1ef444d0e4.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ2ODE3LWVlMTczOGYxLWU0MDctNDkzYi1hMTYyLWRjMWVmNDQ0ZDBlNC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0yNDNlZTUzOTE3MTk4NjFhYmQzNTA1YTJhNGNlMzc0OTQ0MTI5N2UwM2Y5MWQxODk1YzBiZTM4YTEwOGUxMTY1JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.07OLer5b_-WbztM9kJyAhroTHFw3FVgOKODlQU9vlEU)
contingency_table=pd.crosstab(tips['sex'],tips['time'])
print(contingency_table)
![image](https://private-user-images.githubusercontent.com/142209319/322847027-51bcc338-6dad-4de6-92c9-bfbbcfaafeb0.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ3MDI3LTUxYmNjMzM4LTZkYWQtNGRlNi05MmM5LWJmYmJjZmFhZmViMC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zOWE3NWZiNDVlODBkNWEwYzIwNGI3NjkxNjM1MWQwY2Q5NzMxNWZmMDRmMWNmOWQ4YjQ5MjE4Mjk1MTA5MmQ1JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.G5RSEfh03OFbrwCxe1rlvEU0-C2vRrOR8C0QEiMJGH0)
chi2, p, _, _ = chi2_contingency(contingency_table)
print(f"Chi-Square Statistic: {chi2}")
print(f"P-value: {p}")
![image](https://private-user-images.githubusercontent.com/142209319/322847205-72746275-4a20-4305-9e80-3bbf1ed0a806.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ3MjA1LTcyNzQ2Mjc1LTRhMjAtNDMwNS05ZTgwLTNiYmYxZWQwYTgwNi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1mOWJiZmJkNzI4MjZjYWRjNjI1ZjU5YjZlNTZjYWZhNGI3MGY4MjJjMjI0ZTJiYzdkYTQzMTAxYWVhYmFkNjZlJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.rB-ncyztgy4xlZr8zIXOKcyDiBZBw72HhSm2p_-zjT0)
import pandas as pd
from sklearn.feature_selection import SelectKBest, mutual_info_classif, f_classif
data={
'Feature1':[1,2,3,4,5],
'Feature2': ['A','B','C','A','B'],
'Feature3':[0,1,1,0,1],
'Target' :[0,1,1,0,1]
}
df=pd.DataFrame(data)
X=df[['Feature1','Feature3']]
y=df['Target']
selector=SelectKBest(score_func=mutual_info_classif, k=1)
X_new = selector.fit_transform (X,y)
selected_feature_indices = selector.get_support(indices=True)
selected_features = X.columns[selected_feature_indices]
print("Selected Features:")
print(selected_features)
![image](https://private-user-images.githubusercontent.com/142209319/322847917-66eac5dc-973c-4cee-833a-21a3ceb5fc61.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIxNzcxNzIsIm5iZiI6MTcyMjE3Njg3MiwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ3OTE3LTY2ZWFjNWRjLTk3M2MtNGNlZS04MzNhLTIxYTNjZWI1ZmM2MS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOFQxNDI3NTJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT02YTA2NzY1NGI2OWRlNWY1YTlmMWI0ZTRiNzc0Yzc5ZGVjMTVkYWZlOWEyY2UzMjZiOTFhZDIzNzQ1YTljNzM5JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.8PyW-oqDO-HgEZyixvOCpy6tUoEmSqKGaZiEquzUMd8)
To read the given data and perform Feature Scaling and Feature Selection process and save the data to a file is successful.