To read the given data and perform Feature Scaling and Feature Selection process and save the data to a file.
STEP 1:Read the given Data. STEP 2:Clean the Data Set using Data Cleaning Process. STEP 3:Apply Feature Scaling for the feature in the data set. STEP 4:Apply Feature Selection for the feature in the data set. STEP 5:Save the data to the file.
- Standard Scaler: It is also called Z-score normalization. It calculates the z-score of each value and replaces the value with the calculated Z-score. The features are then rescaled with x̄ =0 and σ=1
- MinMaxScaler: It is also referred to as Normalization. The features are scaled between 0 and 1. Here, the mean value remains same as in Standardization, that is,0.
- Maximum absolute scaling: Maximum absolute scaling scales the data to its maximum value; that is,it divides every observation by the maximum value of the variable.The result of the preceding transformation is a distribution in which the values vary approximately within the range of -1 to 1.
- RobustScaler: RobustScaler transforms the feature vector by subtracting the median and then dividing by the interquartile range (75% value — 25% value).
Feature selection is to find the best set of features that allows one to build useful models. Selecting the best features helps the model to perform well. The feature selection techniques used are: 1.Filter Method 2.Wrapper Method 3.Embedded Method
import pandas as pd
from scipy import stats
import numpy as np
df=pd.read_csv("/content/bmi.csv")
df.head()
![image](https://private-user-images.githubusercontent.com/142209319/322836895-2aea6717-ef53-42bb-b94c-5228a301c51b.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM2ODk1LTJhZWE2NzE3LWVmNTMtNDJiYi1iOTRjLTUyMjhhMzAxYzUxYi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1lOTU4MWU4NDMyZTlmYTI3MzUxNTM3OTNmZjhjY2VlZDkwMjQ3ZDc4YWU0ZjAzYmFhZmIyMzAyZDQzMWQ4MGYzJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.Bt4VmVmvAlkrvKrO6anDuwaEugy8Ic9Zu7kroAZ1jK8)
df.dropna()
![image](https://private-user-images.githubusercontent.com/142209319/322837093-8b146d52-1bff-4afe-95ad-1fd1b34c1890.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM3MDkzLThiMTQ2ZDUyLTFiZmYtNGFmZS05NWFkLTFmZDFiMzRjMTg5MC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1mYThjOWI3MjMxYzM1Y2MxMDA0MzdhOTU2ZDVlNWJlMTRkY2FlNjQxMTQwNzNiODI3MTNkNjRmYzM5M2VmNjU1JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.54UhZy6dLNrYui35aEFDTNO16dPm7RZiDguGQnVX8vk)
max_vals=np.max(np.abs(df[['Height','Weight']]))
max_vals
![image](https://private-user-images.githubusercontent.com/142209319/322837681-76a9d44b-b7de-43b0-b4c5-4fa2369743cd.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM3NjgxLTc2YTlkNDRiLWI3ZGUtNDNiMC1iNGM1LTRmYTIzNjk3NDNjZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0wZGUzNzNiYjM5OWQxMzYzODgyOGEwMDViMGFiNjEyMTM5M2Q4NzkwZTFjNTk3MWM1ZWU1MDI0YTYxMDg0YWJjJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.4Ji3CUy9MwzUPFhsq0eYLc04-3IFj9GOkLuSehBqOJg)
from sklearn.preprocessing import StandardScaler
sc=StandardScaler()
df[['Height','Weight']]=sc.fit_transform(df[['Height','Weight']])
df.head(10)
![image](https://private-user-images.githubusercontent.com/142209319/322838096-c1e1b5ab-3592-4f38-9696-9e394eb38a08.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM4MDk2LWMxZTFiNWFiLTM1OTItNGYzOC05Njk2LTllMzk0ZWIzOGEwOC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT04ODg0MGY2ZTk4Y2VlZGU4OGY1NDNiNTdiZWU3OWFhYTkxMjAwZjZjODhjMDUwMTI0ZTgyMzM5MTRjYjMwYTU2JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.6M_0tkDDcPrqQxsZnyT2x0x6bVNw7THdhGoWaANHLSk)
from sklearn.preprocessing import MinMaxScaler
scaler=MinMaxScaler()
df[['Height','Weight']]=scaler.fit_transform(df[['Height','Weight']])
df.head(10)
![image](https://private-user-images.githubusercontent.com/142209319/322838526-cc1b1e0b-0a00-455f-b3aa-2820d2eaea4c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM4NTI2LWNjMWIxZTBiLTBhMDAtNDU1Zi1iM2FhLTI4MjBkMmVhZWE0Yy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT02OGI2MzdjYWZiMWJiZjg4N2E1OGUxNGE0YjU4MjI0ZDdhZDQxNTdiNjY2OTRkZDU3MGVjODM4ZGE3ZmNjYmYwJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.aW1GH66f_fw1hxyIujCvuKsW69deUq-wJhY7ava3wUk)
from sklearn.preprocessing import Normalizer
scaler=Normalizer()
df[['Height','Weight']]=scaler.fit_transform(df[['Height','Weight']])
df
![image](https://private-user-images.githubusercontent.com/142209319/322839820-d55c9d08-42a1-4106-9fb0-dad172a28cfd.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODM5ODIwLWQ1NWM5ZDA4LTQyYTEtNDEwNi05ZmIwLWRhZDE3MmEyOGNmZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kODg1MTBkY2I2NWY5NmY0MDIxMGQ5YTNjNWEyYmRkZTBjNjAyNWI1MWJmZWI4ZGM1ZGZiOTFjYjg1ZGQ1YWM0JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.gn6d33CObWjuFEuFfnXjNu1ifoC9kt7ToQxK-D42zww)
df1=pd.read_csv("/content/bmi.csv")
from sklearn.preprocessing import MaxAbsScaler
scaler=MaxAbsScaler()
df1[['Height','Weight']]=scaler.fit_transform(df1[['Height','Weight']])
df1
![image](https://private-user-images.githubusercontent.com/142209319/322840052-1a21a595-3a30-43de-9dd9-abc59ff86dd8.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQwMDUyLTFhMjFhNTk1LTNhMzAtNDNkZS05ZGQ5LWFiYzU5ZmY4NmRkOC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zZGNkZDQ2MWUwN2M2ODU2NTU2OTI0ZTcxZTBiZTY3MjkwZDUwNzM4ZDlmNTVkNTVlMDMyY2RiNjUxYmY3ZDhjJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.nLDNrtTu1rkk0FkL5gcdk8gZKJjZnZusuW0utp_BSdk)
df2=pd.read_csv("/content/bmi.csv")
from sklearn.preprocessing import RobustScaler
scaler=RobustScaler()
df2[['Height','Weight']]=scaler.fit_transform(df2[['Height','Weight']])
df2.head()
![image](https://private-user-images.githubusercontent.com/142209319/322840324-df58bc4e-e7c9-49d4-ba54-c61123297ee1.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQwMzI0LWRmNThiYzRlLWU3YzktNDlkNC1iYTU0LWM2MTEyMzI5N2VlMS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1mNTc1NDM1MzhkOTRiMWIyYWU2MTJhOTYzY2VjZjUyOTU4ODAxNTM2MDBhODQ5YzlkMjg1MDU2MmRjZGJkZWIyJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.99dEa6ALVD5dqJS50IZtH_MGiiitadd-TSI320iD5Vg)
import pandas as pd
import numpy as np
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import accuracy_score,confusion_matrix
data=pd.read_csv('/content/income(1) (1).csv',na_values=[" ?"])
data
![image](https://private-user-images.githubusercontent.com/142209319/322840753-1eb413c2-b5a1-4d89-a654-d96a5cbc64b0.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQwNzUzLTFlYjQxM2MyLWI1YTEtNGQ4OS1hNjU0LWQ5NmE1Y2JjNjRiMC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1lYTM1MmU3NTU2ZDIyOTNjMWM4ZWEwZTZiNDgyNmZkOWMyMzY1YWM5ZGVlNGQzZDJjNGFhNzczNmYxMjZmMWFiJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.h1fV9CAxCP3zHiUfj5eAFh90Rj_pRcyiHIEd5P7D41U)
data.isnull().sum()
![image](https://private-user-images.githubusercontent.com/142209319/322840962-9a0b6c2c-4c0c-46b9-91ad-475f04ce3562.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQwOTYyLTlhMGI2YzJjLTRjMGMtNDZiOS05MWFkLTQ3NWYwNGNlMzU2Mi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT02ZGM0ZDBiYWVkNjMyZTlkZTM0YTE4NTAyZjVkYzFlZjVjNjUwMGRmZmMwN2M2ZGJhYmYyOWMwMTA1ZDQ3MDFmJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.vuDoged_6O25bzXrIyETSbMRExl1HP5KmiTtAyawft0)
missing=data[data.isnull().any(axis=1)]
missing
![image](https://private-user-images.githubusercontent.com/142209319/322841183-2b5bbfb3-9315-472c-a097-8adee8f4578f.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQxMTgzLTJiNWJiZmIzLTkzMTUtNDcyYy1hMDk3LThhZGVlOGY0NTc4Zi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0wNGJkMjNiNTNiMjIzNGMwNDEwYzdmN2MwODY3NjNhY2Y1OWU5ZTJlNjRmZDhiNzcwNDM5NGViYmNhMzMwYTU3JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.mKMcxovY4xqmLIx1GlXQF6NZGrAfU9rLqlsmecMuZwA)
data2 = data.dropna(axis=0)
data2
![image](https://private-user-images.githubusercontent.com/142209319/322841391-e0209e9d-dfa7-4245-8d6a-6c695e6728bc.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQxMzkxLWUwMjA5ZTlkLWRmYTctNDI0NS04ZDZhLTZjNjk1ZTY3MjhiYy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zNTM0OTI2OTgyNjdhZjNkMGE0ODYzNDU1OGI2YmYyMDNkNTcxN2MwMzg3ZDJmNDY1OWEzODJlY2FlYmMyZjk0JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.9bjFCieq7zQrIfx4iNAFHnAI7zYP3C5KE-t0EtTz-m4)
sal=data['SalStat']
data2['SalStat']=data2['SalStat'].map({' less than or equal to 50,000':0,' greater than 50,000':1})
print(data2['SalStat'])
![image](https://private-user-images.githubusercontent.com/142209319/322841611-c3919808-6ed9-432e-b83e-9aff94a35994.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQxNjExLWMzOTE5ODA4LTZlZDktNDMyZS1iODNlLTlhZmY5NGEzNTk5NC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kNDVmZmJmNTdhZTgwOTJmOGQ1OGJiNmNjN2FhZmVhMGNmNTYxOWE0NmM0NjY1M2RhZDUyNDcyYzFhM2Q5YjE2JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.y_j54z6FxVK4MINuYgTRHeO3HusxVu4iogqwC8Cwtqo)
sal2=data2['SalStat']
dfs=pd.concat([sal,sal2],axis=1)
dfs
![image](https://private-user-images.githubusercontent.com/142209319/322841890-a2ce5564-7e32-481d-9547-e5943afb5c15.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQxODkwLWEyY2U1NTY0LTdlMzItNDgxZC05NTQ3LWU1OTQzYWZiNWMxNS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT04YTgwMzA5MzA3MzAxZWY1MDM5ODFhZmQzNjVjZWNhMjQ2YWI4MTk3YzE4NmU2NDk2MTNmMGQwYzU4YjVlZGU2JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.1i3YmLnmv0xCh5b6T6jm7F1m4dYoBaDPPbYn5_0gymc)
data2
![image](https://private-user-images.githubusercontent.com/142209319/322842106-a4fc1de3-8051-4743-81c6-c054bcb95f7d.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQyMTA2LWE0ZmMxZGUzLTgwNTEtNDc0My04MWM2LWMwNTRiY2I5NWY3ZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1iMTkzNTI3MTllYmQ3ZTg3OGNiNjQ1YmYzYzhkNGMwNTU2MTQzMzI3MjJkNmNmNTJkMjY5ZWU1MjlmOGEyMjc3JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.YCj_6vSJrCZ2r3H_wTeg7ySJgH_NafkboGxMhYwVgR0)
new_data=pd.get_dummies(data2, drop_first=True)
new_data
![image](https://private-user-images.githubusercontent.com/142209319/322843145-7eef042e-6822-4ac5-a574-b19476d68a9a.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQzMTQ1LTdlZWYwNDJlLTY4MjItNGFjNS1hNTc0LWIxOTQ3NmQ2OGE5YS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1jMDYzNTEzNzBmNGI2YjUxMjY4OGQ2NzkzYmMwMGZhNTBjMTY1YTNjNGVkZDgzOTk3ZTE3ODlhNjBjYjJhNjcwJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.fv5oT7NA44-CPX922i6GM2NB-jgOpPuQSs-p2Drnkhs)
columns_list=list(new_data.columns)
print(columns_list)
![image](https://private-user-images.githubusercontent.com/142209319/322844071-2958db03-4ee4-4d69-8a09-2154b7756727.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ0MDcxLTI5NThkYjAzLTRlZTQtNGQ2OS04YTA5LTIxNTRiNzc1NjcyNy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT00Njc3YmI5NzQ3OWY3NjYxOWQ2ZjA1NjY5Y2JhN2FjMzEyODk4NGZkODUxM2YxNDE0MjliNGM3Mzk0ZTgyOWIwJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.sbqMMNTSRZALZkwQbg7qvCWQQNIiPHqRI4wye54B90E)
features=list(set(columns_list)-set(['SalStat']))
print(features)
![image](https://private-user-images.githubusercontent.com/142209319/322844567-fd4a58e0-39ef-426f-89d3-ff7af0fffd98.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ0NTY3LWZkNGE1OGUwLTM5ZWYtNDI2Zi04OWQzLWZmN2FmMGZmZmQ5OC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0xM2ExNWZkZTk2YzA1ODJlZWQyNjBiMDkxNTAzZmExNzI1Y2VlZTcyNWIyYjAzYjBiNTlmNmRkMjFiNzEzYzcwJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.zE4GYoe85RVSepO5sxMluawlEAQDlR5q67lUB3MqxR4)
y=new_data['SalStat'].values
print(y)
[0 0 1 ... 0 0 0]
x = new_data[features].values
print(x)
![image](https://private-user-images.githubusercontent.com/142209319/322844918-6ad70fd2-6946-4995-be0e-801b3b2f0212.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ0OTE4LTZhZDcwZmQyLTY5NDYtNDk5NS1iZTBlLTgwMWIzYjJmMDIxMi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1lMmFmYjEwMDQ0OTRhZDM3ZjYxMTNlZGI3YTY3MzVmOWE4M2M3MjZlMDZmMTEyNmU1MDBhYjkxMDM5NzE3Y2Q4JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.SRx-Ij1-FoUa-cA1fhTUtu_1CE6gJfGSFid48g1awK4)
train_x,test_x,train_y,test_y = train_test_split(x,y,test_size=0.3, random_state=0)
KNN_classifier=KNeighborsClassifier(n_neighbors = 5)
KNN_classifier.fit(train_x,train_y)
![image](https://private-user-images.githubusercontent.com/142209319/322845594-4c854a00-85d9-4d76-b57e-292d13ba84ee.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ1NTk0LTRjODU0YTAwLTg1ZDktNGQ3Ni1iNTdlLTI5MmQxM2JhODRlZS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0wYzc4Y2M1NTkzMGZjYWVhM2Q4MzdlMDVmZmVkNjFiYzkxOTVlYWFhY2ExMGM5ZGVhNDI5ZDJlZTc5YWE5YmQ3JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.0KqPES3D8jWd5x4RdUqvDjuiVkpWaz9yn3h8lXHiYiw)
prediction = KNN_classifier.predict(test_x)
confusionMmatrix = confusion_matrix(test_y, prediction)
print(confusionMmatrix)
![image](https://private-user-images.githubusercontent.com/142209319/322845909-2aae088f-279f-4f33-8137-b712dc580d9e.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ1OTA5LTJhYWUwODhmLTI3OWYtNGYzMy04MTM3LWI3MTJkYzU4MGQ5ZS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT00ZjI1Y2ZhMDU4Yzk2M2E2YzMyYjMwMmEyMzYyMWRlNDMwNDZlYjExMzdiZmE1N2RhMmVjOGNlMWI3ZTBjNTk2JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.ZODZUp503Khkq4emvhgkfoKZ7W52FiAmPQ-4s0uDqS4)
accuracy_score=accuracy_score(test_y, prediction)
print(accuracy_score)
0.8392087523483258
print('Misclassified samples: %d' % (test_y != prediction).sum())
Misclassified samples: 1455
data.shape
(31978, 13)
import pandas as pd
import numpy as np
from scipy.stats import chi2_contingency
import seaborn as sns
tips=sns.load_dataset('tips')
tips.head()
![image](https://private-user-images.githubusercontent.com/142209319/322846817-ee1738f1-e407-493b-a162-dc1ef444d0e4.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ2ODE3LWVlMTczOGYxLWU0MDctNDkzYi1hMTYyLWRjMWVmNDQ0ZDBlNC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1mMDI1MmNjZTZiMjk0ZmYyNDczYWQ1OTE2YzA5ZGY4OTczNDM0OTMwNmE5YzM2OWM0ODUyOTNhZmU2ZGI3YjFiJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.ont6hgXjrgV6ngU3QPkgI02IlNcBqaB2iWQYmyY1RWM)
contingency_table=pd.crosstab(tips['sex'],tips['time'])
print(contingency_table)
![image](https://private-user-images.githubusercontent.com/142209319/322847027-51bcc338-6dad-4de6-92c9-bfbbcfaafeb0.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ3MDI3LTUxYmNjMzM4LTZkYWQtNGRlNi05MmM5LWJmYmJjZmFhZmViMC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1iZDk0OWNlYzdhYTNhN2U4OTg3ODcyZDJmMmQ4NDgxNDM1YmM4MzE4NDc3Y2M2MDg1NGVmNmFkNWVlZTQ5Y2IyJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.2-Vzz6qtOEI3llD3unvT0vxxeq9RgEtvVNYODSjRYiw)
chi2, p, _, _ = chi2_contingency(contingency_table)
print(f"Chi-Square Statistic: {chi2}")
print(f"P-value: {p}")
![image](https://private-user-images.githubusercontent.com/142209319/322847205-72746275-4a20-4305-9e80-3bbf1ed0a806.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ3MjA1LTcyNzQ2Mjc1LTRhMjAtNDMwNS05ZTgwLTNiYmYxZWQwYTgwNi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT03NTcwNjhkMmM1NTZkOTcxYjZiMzkxMmNlZjkxZWNmYWU2NmMwZDA4ZWUyZTUzYTA4MjIwMmI3MmE4Y2Q5M2Q2JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.AW0NV7T_PlQMq7JQV8AWKViqZezhOUn0X__Ki1K1ILc)
import pandas as pd
from sklearn.feature_selection import SelectKBest, mutual_info_classif, f_classif
data={
'Feature1':[1,2,3,4,5],
'Feature2': ['A','B','C','A','B'],
'Feature3':[0,1,1,0,1],
'Target' :[0,1,1,0,1]
}
df=pd.DataFrame(data)
X=df[['Feature1','Feature3']]
y=df['Target']
selector=SelectKBest(score_func=mutual_info_classif, k=1)
X_new = selector.fit_transform (X,y)
selected_feature_indices = selector.get_support(indices=True)
selected_features = X.columns[selected_feature_indices]
print("Selected Features:")
print(selected_features)
![image](https://private-user-images.githubusercontent.com/142209319/322847917-66eac5dc-973c-4cee-833a-21a3ceb5fc61.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjIyMTI5OTQsIm5iZiI6MTcyMjIxMjY5NCwicGF0aCI6Ii8xNDIyMDkzMTkvMzIyODQ3OTE3LTY2ZWFjNWRjLTk3M2MtNGNlZS04MzNhLTIxYTNjZWI1ZmM2MS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzI5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcyOVQwMDI0NTRaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kODUyNTc0ODY4NDYwNDE4NDdjMGMwYWU3MDExZDk2YmUwOWNkNzRlMGY5MGM2YTlmZDYwNDQ4OTAzNDBkNWZiJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.yRiSKtv_jKQ7LVwpaAir1zXfRezF_GGjfIo4B7055Is)
To read the given data and perform Feature Scaling and Feature Selection process and save the data to a file is successful.