You're reading from scikit-learn Cookbook Over 80 recipes for machine learning in Python with scikit-learn

Product type Paperback

Published in Dec 2025

Publisher Packt

ISBN-13 9781836644453

Length 388 pages

Edition 3rd Edition

Languages

Python

Tools

Scikit-learn

Concepts

Machine Learning

Author (1):

John Sukup

View More author details

Table of Contents (17) Chapters

Preface

1. Chapter 1: Common Conventions and API Elements of scikit-learn

2. Chapter 2: Pre-Model Workflow and Data Preprocessing FREE CHAPTER

3. Chapter 3: Dimensionality Reduction Techniques

4. Chapter 4: Building Models with Distance Metrics and Nearest Neighbors

5. Chapter 5: Linear Models and Regularization

6. Chapter 6: Advanced Logistic Regression and Extensions

7. Chapter 7: Support Vector Machines and Kernel Methods

8. Chapter 8: Tree-Based Algorithms and Ensemble Methods

9. Chapter 9: Text Processing and Multiclass Classification

10. Chapter 10: Clustering Techniques

11. Chapter 11: Novelty and Outlier Detection

12. Chapter 12: Cross-Validation and Model Evaluation Techniques

13. Chapter 13: Deploying scikit-learn Models in Production

14. Chapter 14: Unlock Your Exclusive Benefits

Unlock this Book’s Free Benefits in 3 Easy Steps

15. Index

Why subscribe?

16. Other Books You May Enjoy

Distance Metrics Overview

Distance metrics are essential for measuring the similarity or dissimilarity between data points in various ML algorithms, including KNN. The choice of distance metric can significantly influence model performance, affecting how data points are classified and how clusters are formed so it’s best to get comfortable with more than the standard Euclidean distance most early data science practitioners default to. This recipe will give the reader an opportunity to compare how different distance metrics compare when datasets contain different properties.

Getting ready

We’ll create two new datasets for illustrating the differences between distance metrics using scikit-learn’s built-in make_circles() function.

Load libraries:

import matplotlib.pyplot as plt
import numpy as np
from sklearn.datasets import make_circles
from sklearn.model_selection import train_test_split

Create two synthetic datasets that highlight metric differences:
```
n_samples ...
```

The rest of the chapter is locked

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

You're reading from scikit-learn Cookbook Over 80 recipes for machine learning in Python with scikit-learn

Table of Contents (17) Chapters

Distance Metrics Overview

Getting ready

Authors (1)

Personalised recommendations for you

You're reading from scikit-learn Cookbook Over 80 recipes for machine learning in Python with scikit-learn

Table of Contents (17) Chapters

Distance Metrics Overview

Getting ready

Authors (1)

Personalised recommendations for you

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access