[Learn about machine learning from the Keras] — 19.Use callbacks.ModelCheckpoint to find the best model

3 min readSep 22, 2023

First, let’s take a look at the quantitative changes in compiler metrics after model fit.

1.Example 1, default metrics content

from tensorflow.keras.models import Sequential
from tensorflow.keras.datasets import mnist

(train_images, train_labels), (test_images, test_labels) = mnist.load_data()
train_images = train_images.reshape((60000, 28 * 28))
train_images = train_images.astype("float32") / 255
test_images = test_images.reshape((10000, 28 * 28))
test_images = test_images.astype("float32") / 255

from tensorflow.keras import layers
from tensorflow.keras.models import Model
model = Sequential([
layers.Dense(512, activation="relu"),
layers.Dense(10, activation="softmax")
])

model.compile(optimizer="rmsprop",
loss="sparse_categorical_crossentropy",
)

historyTrain = model.fit(train_images, train_labels, epochs=1, batch_size=128 )

After completing model.fit, check historyTrain.history.keys(). There is only one “loss” by default.

2.Example 2, continuing before Example , model.compiler specifies adding metrics

model.compile(optimizer="rmsprop",
loss="sparse_categorical_crossentropy",
metrics=["accuracy"]
)

After completing model.fit, check historyTrain.history.keys(). With the setting of metrics=[“accuracy”], there is an additional observation indicator “accuracy”

3.Example code: Continuation of Example 2, add parameter validation_split=0.2 to model.fit

historyTrain = model.fit(train_images, train_labels, epochs=1, batch_size=128, validation_split=0.2 )

From the above three examples, we can see how to generate observation indicators. What this section will discuss is using callbacks.ModelCheckpoint to find the best model.

After finally generating four observation indicators from the above three examples, we can use the callbacks.ModelCheckpoint class to declare the best results of which indicators to use to save the trained model.

So add callbacks.ModelCheckpoint as follows:

from keras import callbacks

Best_ValAcc_Model = callbacks.ModelCheckpoint(filepath="ModelValAacc",monitor="val_accuracy",mode="max")

Best_ValLoss_Model = callbacks.ModelCheckpoint(filepath="ModelValLoss",monitor="val_loss",mode="min")

historyTrain = model.fit(train_images, train_labels, epochs=50, batch_size=128, validation_split=0.2, callbacks = [Best_ValAcc_Model,Best_ValLoss_Model] )

import matplotlib.pyplot as plt
plt.xlabel('epoch', fontsize=12)
plt.ylabel('loss_value', fontsize=12)
plt.plot(historyTrain.history['val_loss'],color='red')
plt.plot(historyTrain.history['loss'],color='blue')
plt.show()
plt.xlabel('epoch', fontsize=12)
plt.ylabel('accuracy_value', fontsize=12)
plt.plot(historyTrain.history['val_accuracy'],color='red')
plt.plot(historyTrain.history['accuracy'],color='blue')
plt.show()

The first ModelCheckpoint is set to monitor=”val_accuracy” and assigned to Best_ValAcc_Model; the other ModelCheckpoint is set to monitor=”val_loss” and assigned to Best_ValLoss_Model. Then pass in these two ModelCheckpoint objects to the callbacks of the model fit function. After execution, two folders, ModelValAacc and ModelValLoss, will be generated in the program directory:

These two folders save the model weight corresponding to the maximum value val_accuracy and the model weight corresponding to the minimum value val_loss respectively.

Generally speaking, the best weights trained by the model will be retained when the model is trained to an epoch value between 10 and 20.

The model rereads and performs prediction judgments

How to retrieve the best model? The following sample code:

model.load_weights("./ModelValAacc")

Read the folder you just saved. After success, you can continue to execute the model’s perdict.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by Czxdas

68 Followers

31 Following

Keep looking for Prajna wisdom

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

More from Czxdas

Real-time detection implementation of Yolo v7 Pose

Czxdas

Real-time detection implementation of Yolo v7 Pose

Yolo is one of the favorites in deep learning object detection. It is currently v7, and maybe v8 is about to appear. Among them, gesture…

Jan 30, 2023

[Learn about machine learning from the Keras] — 18.Compute_loss and metrics of the model

Czxdas

[Learn about machine learning from the Keras] — 18.Compute_loss and metrics of the model

As can be seen from the previous section, when the model is compiled, a loss function can be specified, or a customized loss function can…

Sep 22, 2023

[Learn about machine learning from the Keras] — 14.optimizer And learning_rate

Czxdas

[Learn about machine learning from the Keras] — 14.optimizer And learning_rate

This section will discuss the initial setting methods, operation, and impact of optimizer and learning rate.

Sep 22, 2023

[Dot Net Core](Graphic series ) 17. MediatR — PreProcess and PostProcess

Czxdas

[Dot Net Core](Graphic series ) 17. MediatR — PreProcess and PostProcess

In the content of MediatR-IRequest, the operation of IRequest is shown, as shown in the figure below:

Dec 2, 2022

See all from Czxdas

Recommended from Medium

Getting Started with PyTorch: A Beginner-Friendly Guide

The Deep Hub

Palash Mishra

Getting Started with PyTorch: A Beginner-Friendly Guide

If you’ve ever wondered how to build and train deep learning models, PyTorch is one of the most beginner-friendly and powerful frameworks…

Dec 3, 2024

XGBoost vs. Random Forest: A Sophisticated Analysis of Superiority in Real-World Data

VARUN MISHRA

XGBoost vs. Random Forest: A Sophisticated Analysis of Superiority in Real-World Data

5d ago

Friendly Introduction to Deep Learning Architectures (CNN, RNN, GAN, Transformers, Encoder-Decoder…

Python in Plain English

Jyoti Dabass, Ph.D.

Friendly Introduction to Deep Learning Architectures (CNN, RNN, GAN, Transformers, Encoder-Decoder…

This blog aims to provide a friendly introduction to deep learning architectures involving Convolutional Neural Networks (CNN), Recurrent…

Apr 1, 2024

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jessica Stillman

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Oct 30, 2024

761

Jo Wang

Principal Component Analysis (PCA)

Principal Component Analysis (PCA) is a statistical technique used for dimensionality reduction while retaining most of the variance…

Oct 21, 2024

Exploring Edge Detection and Gaussian Blurring in Python with OpenCV: A Step-by-Step Guide

Umerabdkhan

Exploring Edge Detection and Gaussian Blurring in Python with OpenCV: A Step-by-Step Guide

Edge detection is a fundamental image processing technique that helps in identifying the boundaries of objects within an image. In this…

Nov 16, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech