Parameter Tuning

Python and Machine Learning (ML) – Part 1 Parameter Tuning

Parameter tuning, often referred to as hyperparameter optimization, is a critical step in machine learning that involves configuring the parameters of a model that are not learned from data but set before the learning process begins. These parameters, known as hyperparameters, can significantly influence the performance of the model. Here are the key aspects of parameter tuning:

1. Understanding Hyperparameters

Hyperparameters are settings or configurations that govern the training process itself. For example:

Learning rate: Determines how much to change the model in response to the estimated error each time the model weights are updated.
Number of trees in a random forest: Controls the number of trees that will be built in the model.
Number of layers and neurons in a neural network: Sets the depth and width of the network.

2. Techniques for Hyperparameter Tuning

Various strategies can be employed to find the optimal combination of hyperparameters:

Grid Search: This method involves specifying a list of values for different hyperparameters and trying every possible combination of these values. It’s exhaustive but can be very time-consuming.
Random Search: Contrasts with grid search by selecting random combinations of hyperparameters to try. This can be more efficient than grid search, especially when some hyperparameters do not influence the performance significantly.
Bayesian Optimization: Uses a probabilistic model to predict the performance of different hyperparameters and intelligently chooses which hyperparameters to evaluate next based on past results.
Gradient-based Optimization: Some hyperparameters can be tuned by using gradient descent methods, though this is less common.

3. Cross-Validation

To reliably assess the performance of a model with a given set of hyperparameters, cross-validation is used. This technique involves dividing the data into several subsets and training multiple models by varying which subset is held out as a test set. This helps ensure that the model performs well across different subsets of the data and reduces the risk of overfitting.

4. Practical Tools and Libraries

Several Python libraries support hyperparameter tuning:

Scikit-learn offers tools like GridSearchCV and RandomizedSearchCV for grid and random search methods.
Hyperopt is a library for conducting search across a defined search space and supports Bayesian optimization.
Optuna is another modern library for hyperparameter optimization, offering efficient and flexible options for the task.

Example: Using GridSearchCV in Scikit-learn

Here’s a basic example of how to use GridSearchCV for tuning hyperparameters of a support vector machine:

from sklearn.model_selection import GridSearchCV
from sklearn.svm import SVC
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split

# Load data
data = load_iris()
X, y = data.data, data.target

# Split data
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.25, random_state=42)

# Define model
model = SVC()

# Define parameter grid
param_grid = {
    'C': [0.1, 1, 10],
    'kernel': ['linear', 'rbf']
}

# Setup the grid search
grid_search = GridSearchCV(model, param_grid, cv=3, scoring='accuracy', verbose=1)

# Perform grid search
grid_search.fit(X_train, y_train)

# Best parameters and best score
print("Best parameters:", grid_search.best_params_)
print("Best score:", grid_search.best_score_)

Code Explanation

Import Necessary Libraries and Modules

GridSearchCV: This is a powerful tool from sklearn.model_selection that performs an exhaustive search over specified parameter values for an estimator.
SVC: Short for Support Vector Classifier, this class is imported from sklearn.svm and it implements the support vector machine learning algorithm.
load_iris: A function from sklearn.datasets that loads the Iris dataset, a classic and easy-to-use dataset for classification.
train_test_split: A utility from sklearn.model_selection to split data arrays into two subsets (training and testing datasets).

Load and Prepare Data

The Iris dataset is loaded into the variable data.
X and y are extracted from data, where X contains the dataset’s features (sepal length, sepal width, petal length, petal width), and y contains the target labels (species of the iris flowers).

Split the Data

train_test_split is used to divide the data into training (X_train, y_train) and testing (X_test, y_test) sets. test_size=0.25 specifies that 25% of the data is used as the test set. The random_state=42 parameter ensures that the split is reproducible; the same train-test split will occur every time the script is run.

Define the Model

model = SVC(): An instance of the Support Vector Classifier is created with default parameters.

Define the Parameter Grid

param_grid defines the parameters to test when tuning the model. It is a dictionary with the keys being the parameters and the values being the settings to test.
- 'C': [0.1, 1, 10]: The regularization parameter. Lower values create a smoother decision boundary whereas higher values aim to classify all training examples correctly.
- 'kernel': ['linear', 'rbf']: Specifies the type of kernel to be used in the algorithm. 'linear' uses a linear kernel; 'rbf' uses a radial basis function kernel.

Setup Grid Search

GridSearchCV is configured with the model, the parameter grid, and the method of cross-validation (cv=3 means three-fold cross-validation). scoring='accuracy' sets the performance measure to accuracy. verbose=1 will display more information during the fitting process.

Perform Grid Search

grid_search.fit(X_train, y_train): This command fits the GridSearchCV model with the data. The method goes through all possible combinations of parameter values (as specified in param_grid) and computes the model performance for each combination.

Retrieve Best Parameters and Score

After fitting, grid_search.best_params_ provides the best parameters found during the grid search, and grid_search.best_score_ gives the highest score achieved with those parameters.

Previous Lesson

Back to Tutorial

Next Lesson

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
tk_lr	1 year	The tk_lr is a referral cookie set by the JetPack plugin on sites using WooCommerce, which analyzes referrer behaviour for Jetpack.
tk_or	5 years	The tk_or is a referral cookie set by the JetPack plugin on sites using WooCommerce, which analyzes referrer behaviour for Jetpack.
tk_r3d	3 days	JetPack installs this cookie to collect internal metrics for user activity and in turn improve user experience.
tk_tc	session	JetPack sets this cookie to record details on how user's use the website.