Feature Scaling Techniques¶

Why?¶

\[ X'\ =\ \frac{X\ -\ \min(X)}{\max(X)\ -\ \min(X)} \]

\[ X'\ =\ a\ +\ (b\ -\ a)\ \cdot\ \left(\frac{X\ -\ \min(X)}{\max(X)\ -\ \min(X)}\right) \]

Values transform to range [a, b]
Advantages
Simple Interpretation: Transforms data to a uniform range.
Retains Relationships: Does not distort the shape or distribution of the data.
Improved Convergence: For gradient-based models like neural networks, scaled features help avoid exploding or vanishing gradients.
Disadvantages
Sensitive to Outliers: Since min-max normalization uses the minimum and maximum values of a feature, it is sensitive to outliers. If outliers are present, consider robust scaling techniques (e.g., standardization or robust scaling).
Feature Dependence: If new data is added, the \(\min(X)\) and \(\max(X)\) might change, requiring re-scaling.

\[ X'\ =\ \frac{X\ -\ \mu}{\sigma} \]

Value centred around mean 0 with standard deviation of 1
When to use:
Data follows Gaussian Distribution (algos like linear&logistic regression, LDA assume normal features)
Outlier Robustness is needed (less sensitive to outliers)
Algos sensitive to feature magnitude (PCA,SVM, KNN, GD-based models)\
Advantages:
Handles Data Centering and Scaling: Centers data around zero and adjusts for varying feature magnitudes.
Improved Performance for Distance-Based Models: Works well for models that rely on distances or gradients.
Less Sensitive to Outliers: Compared to min-max normalization, z-score scaling is less affected by outliers.
Disadvantages:
Not Ideal for Non-Gaussian Data: If the data does not follow a normal distribution, z-score scaling might not be as effective as other methods (e.g., robust scaling).
Outliers Can Still Influence: While more robust than min-max normalization, extreme outliers can still skew the mean and standard deviation.
Dependent on Data: Like min-max normalization, if new data is added, the mean and standard deviation might change, requiring re-scaling.