1
00:00:01,020 --> 00:00:07,500
In the last couple of lectures, we learned how to train and create what Mortons?

2
00:00:09,090 --> 00:00:17,370
One thing you have noticed is each time we opened the door by the notebook, we started creating and

3
00:00:17,370 --> 00:00:20,100
training our models from scratch.

4
00:00:21,500 --> 00:00:23,480
But we don't have to do that.

5
00:00:24,540 --> 00:00:29,410
That is a very simple way to save your model after you have trained it.

6
00:00:30,810 --> 00:00:38,760
Kill us will save both the mortal architecture, including the every layer put parameters and the value

7
00:00:38,760 --> 00:00:45,000
of all the modern parameters for each layer that is biases and weights.

8
00:00:45,040 --> 00:00:45,420
Well, you.

9
00:00:47,880 --> 00:00:54,690
So and sort of greening the model each time you open your dupatta notebook or initiate a new session.

10
00:00:55,260 --> 00:01:02,890
You can just load the model that you have previously saved to get all the details of that.

11
00:01:02,910 --> 00:01:03,270
More than.

12
00:01:05,440 --> 00:01:07,630
Now, saving model is very easy.

13
00:01:07,990 --> 00:01:10,070
You just have to right, you are more than Lahm.

14
00:01:11,370 --> 00:01:18,360
So in the last lecture, we created a model with functional LaPier and object name was model.

15
00:01:18,630 --> 00:01:24,320
So I'm saving that same object so I can write model dot save.

16
00:01:24,390 --> 00:01:25,370
Save is the keyword.

17
00:01:26,100 --> 00:01:31,800
And in the parenthesis, you have to write the name that you want to give to that fight.

18
00:01:33,180 --> 00:01:40,710
All this models are saved in ads beep at five, formic the extension of their test dot at five.

19
00:01:42,660 --> 00:01:50,590
So if you just run this, you have saved your Morden and Newa pleasant working battery.

20
00:01:52,590 --> 00:02:03,140
So if I open my battery, this is this and you can see my model is here.

21
00:02:03,570 --> 00:02:10,860
And the name of models and web func underscore model is the same as I have mentioned.

22
00:02:12,480 --> 00:02:15,360
You can see your present working directory with this.

23
00:02:15,360 --> 00:02:15,750
Come on.

24
00:02:16,220 --> 00:02:16,750
Just step two.

25
00:02:16,770 --> 00:02:17,020
Right.

26
00:02:18,700 --> 00:02:20,150
But since December I.

27
00:02:20,450 --> 00:02:24,030
BWB descends for present working directory.

28
00:02:24,440 --> 00:02:27,600
So if you run this, you will get the directory location.

29
00:02:28,500 --> 00:02:32,730
And in that directory you can find the model that you have saved.

30
00:02:34,440 --> 00:02:41,200
Now, in case you want to change your or indirectly, you can do it with this keyword.

31
00:02:41,820 --> 00:02:46,470
You have to write percentage 10 Sudi and then the new directory name.

32
00:02:48,180 --> 00:02:52,740
So as we have seen, I have this more than present in my folder.

33
00:02:53,550 --> 00:02:57,000
Now let's just delete the model rate even.

34
00:03:00,430 --> 00:03:02,930
So I believe more than an hour.

35
00:03:03,620 --> 00:03:05,870
I'm also clearing the session off guitars.

36
00:03:08,860 --> 00:03:15,100
Now, again, I am loading this mortal with Lord underscore more than my third.

37
00:03:16,450 --> 00:03:19,390
And here also you have to just provide the name of the file.

38
00:03:20,530 --> 00:03:24,640
And I'm saving this and another object known as modern.

39
00:03:25,680 --> 00:03:29,620
Let's just try loading the model that we have recently saved.

40
00:03:32,100 --> 00:03:33,850
Now we can call the summary matter.

41
00:03:34,120 --> 00:03:41,840
This should give me the summary that we saw earlier while creating regression neural network with functional

42
00:03:41,840 --> 00:03:42,260
LaPier.

43
00:03:42,690 --> 00:03:44,990
Just run this.

44
00:03:47,110 --> 00:03:50,770
You can see that we have the same somebody.

45
00:03:52,360 --> 00:03:54,170
Remember, we believe data were more than.

46
00:03:55,060 --> 00:03:58,450
And then we loaded it using the same version.

47
00:03:59,980 --> 00:04:02,820
Now somebody is just the architecture of that model.

48
00:04:03,910 --> 00:04:12,320
You can also predict using this model because it already have the optimized weights and bytes as well.

49
00:04:13,630 --> 00:04:16,020
So you can use it for prediction on.

50
00:04:16,020 --> 00:04:17,320
So you can see.

51
00:04:17,380 --> 00:04:21,400
This is the same prediction that we were getting earlier.

52
00:04:21,490 --> 00:04:22,780
With functional LaPier.

53
00:04:25,240 --> 00:04:30,940
But you can save one model only after the completion of its training.

54
00:04:33,110 --> 00:04:43,490
But usually we have a very large dataset which can take up to eight to 10 hours just to train the model

55
00:04:44,120 --> 00:04:45,500
for that scenario.

56
00:04:45,890 --> 00:04:47,420
We can use callbacks.

57
00:04:50,330 --> 00:04:58,820
You can think of callbacks as check points, which will allow you to save your model after each epoch.

58
00:05:00,530 --> 00:05:03,380
So you don't have to wait for the complete training.

59
00:05:04,130 --> 00:05:07,850
You can save your model after each epoch as well.

60
00:05:09,800 --> 00:05:14,020
Let's just again delete this model and.

61
00:05:16,010 --> 00:05:18,350
Your decision here?

62
00:05:18,580 --> 00:05:22,750
I'm just taking a small example of a sequential model to Bensley.

63
00:05:23,230 --> 00:05:24,550
And when all clear.

64
00:05:24,790 --> 00:05:26,260
I wouldn't be using the same data.

65
00:05:27,280 --> 00:05:28,850
Let's just compiling.

66
00:05:29,260 --> 00:05:31,860
We are using the same parameters as earlier.

67
00:05:34,040 --> 00:05:40,010
Now, you can provide this callback parameter by training the model.

68
00:05:40,510 --> 00:05:45,460
So earlier we were only providing training, data, epochs and validation dataset.

69
00:05:46,420 --> 00:05:50,590
Now, along with this, we can also put away callbacks.

70
00:05:51,550 --> 00:05:58,750
So you have to write callbacks and then here you can provide the list of callbacks that you have created.

71
00:06:01,150 --> 00:06:02,860
This is how you create a callback.

72
00:06:03,370 --> 00:06:05,350
So this is what a variable name is.

73
00:06:05,430 --> 00:06:06,730
Check point callback.

74
00:06:07,890 --> 00:06:12,820
And you have to write give us the callbacks DOT model checkpoint.

75
00:06:14,770 --> 00:06:16,590
Here I can write the filing.

76
00:06:18,310 --> 00:06:19,060
But since.

77
00:06:20,350 --> 00:06:22,970
We are running this model for banning books.

78
00:06:23,150 --> 00:06:27,500
That means we are generating 10 files for each epoch.

79
00:06:30,170 --> 00:06:38,210
So if I provide a concern, find names such as my get out more than it will just override the file for

80
00:06:38,240 --> 00:06:39,890
each of these tiny box.

81
00:06:40,960 --> 00:06:46,600
So, again, I would be only getting a single file of my 10 people

82
00:06:49,210 --> 00:06:50,980
now to resolve this problem.

83
00:06:51,670 --> 00:06:53,860
We can use these variables.

84
00:06:54,550 --> 00:07:05,460
So for one epoch, I want to save my file as model this zero one dot at five for the second Depok.

85
00:07:05,890 --> 00:07:08,150
I want to save it as Mordred best.

86
00:07:08,230 --> 00:07:08,920
Zero two.

87
00:07:09,190 --> 00:07:09,710
That's fine.

88
00:07:10,360 --> 00:07:11,140
And so on.

89
00:07:11,220 --> 00:07:12,310
The last 10 people.

90
00:07:13,150 --> 00:07:17,380
So you can use such variable names in your filing also.

91
00:07:18,040 --> 00:07:21,850
So you don't have to manually light each filing.

92
00:07:24,870 --> 00:07:28,590
So here model is the name that I have given.

93
00:07:28,770 --> 00:07:36,160
And this is the variable name which is kept in the Gotlieb record, so I can write a book and I want

94
00:07:36,160 --> 00:07:38,070
the two digits of each book.

95
00:07:38,310 --> 00:07:40,170
So for one, I want zero one.

96
00:07:40,470 --> 00:07:44,750
So that's why zero to be if I wanted three digits.

97
00:07:44,790 --> 00:07:50,880
So suppose if I was running for 300 bucks I would want three digits.

98
00:07:50,910 --> 00:07:55,750
Zero zero one or a hundred or a hundred and fifty to ninety nine.

99
00:07:56,280 --> 00:07:59,480
In that case I would have written zero 3D.

100
00:08:02,380 --> 00:08:06,740
So let's just cleared this checkpoint, call back again.

101
00:08:07,000 --> 00:08:12,210
Crosstalk Fallbacks Dort model checkpoint and then the variable name and inside variable name.

102
00:08:12,220 --> 00:08:14,410
Also, we can give variables.

103
00:08:15,560 --> 00:08:17,730
You mentioned different epoch values.

104
00:08:18,260 --> 00:08:18,520
Yes.

105
00:08:18,570 --> 00:08:19,200
Run this.

106
00:08:20,260 --> 00:08:22,180
And now just drain the water.

107
00:08:22,500 --> 00:08:28,240
We are using the same method wanderlust fit training people, validation data.

108
00:08:28,410 --> 00:08:30,090
And then the callback.

109
00:08:35,580 --> 00:08:40,890
So with each of this epoch, I'm also saving a model file for that epoch.

110
00:08:43,320 --> 00:08:50,310
So if we just go back to our directory, we should have 10 model files.

111
00:08:50,640 --> 00:08:54,570
Here you can see we have a model that's zero one more or less.

112
00:08:54,570 --> 00:08:56,430
Zero to model.

113
00:08:56,720 --> 00:08:57,660
Less than.

114
00:08:58,990 --> 00:09:00,150
This for First Époque.

115
00:09:00,260 --> 00:09:03,680
Secondly, Poque holy book and sworn killed 10 people.

116
00:09:04,660 --> 00:09:13,330
If I would ever use the single file name, such as murder, then and then I would be getting on the

117
00:09:13,330 --> 00:09:15,710
single file because of very easy pop.

118
00:09:15,760 --> 00:09:17,420
That file was over.

119
00:09:17,450 --> 00:09:19,030
I could buy the next book when you.

120
00:09:20,530 --> 00:09:25,760
So remember to use the variable names of Epoch while creating a model file for Epoch.

121
00:09:27,400 --> 00:09:34,210
Now, I can load this files similar to the previous time.

122
00:09:34,640 --> 00:09:35,620
We just have to.

123
00:09:35,620 --> 00:09:36,020
Right.

124
00:09:36,440 --> 00:09:37,680
He does talk more a lot.

125
00:09:37,790 --> 00:09:38,250
Lord Morton.

126
00:09:38,590 --> 00:09:41,290
And the file, which we want to load.

127
00:09:43,120 --> 00:09:50,740
So just to demonstrate that, let's believe the modern clear position now.

128
00:09:50,890 --> 00:09:57,620
So suppose if I won five of my ten people, so modern law.

129
00:10:02,290 --> 00:10:04,780
So this is the model I want to export.

130
00:10:09,330 --> 00:10:11,850
You can see imported that model.

131
00:10:12,990 --> 00:10:17,700
So in future, also, if I'm opening this, you buy a notebook of that three or four days.

132
00:10:18,000 --> 00:10:19,920
I can just write this model.

133
00:10:20,400 --> 00:10:22,440
Gave us dot model, dot load model.

134
00:10:22,590 --> 00:10:25,230
And then the mordantly to load that model.

135
00:10:25,620 --> 00:10:32,580
There is absolutely no need of creating, compiling and creating the model once again.

136
00:10:33,690 --> 00:10:35,470
We can also use evaluate method.

137
00:10:36,390 --> 00:10:39,660
You can see this model object is working.

138
00:10:41,250 --> 00:10:45,330
Now there is no need of saving the model.

139
00:10:45,630 --> 00:10:55,200
At each epoch, if you are using the regulations set during the training, you can set the safe best

140
00:10:55,230 --> 00:10:58,980
only equal to true value, creating a checkpoint called the back.

141
00:11:01,770 --> 00:11:07,450
It will only save your more than when its performance on the validation set is best.

142
00:11:10,210 --> 00:11:13,870
So suppose out of this then epoch will lose.

143
00:11:15,070 --> 00:11:20,440
We were getting the best validation is scored, I suppose, epoch.

144
00:11:21,400 --> 00:11:26,520
Then in the final fight, we will be getting the model corresponding to Epoch eight.

145
00:11:27,100 --> 00:11:32,260
There is no need to create a separate model for all this epoch values.

146
00:11:34,590 --> 00:11:39,150
Because generally, we want the model corresponding to the best score on the.

147
00:11:42,940 --> 00:11:47,030
So let us believe the model in creating a structure.

148
00:11:47,740 --> 00:11:56,440
And here, while creating the callback, I am also setting the parameter well, your safe best, only

149
00:11:56,650 --> 00:11:57,560
equal to crawl.

150
00:11:58,310 --> 00:12:03,640
And then I would be getting only a single more done with the name.

151
00:12:03,760 --> 00:12:04,480
Best model.

152
00:12:05,550 --> 00:12:09,900
The model, which is corresponding to the best value of validation, is called.

153
00:12:12,570 --> 00:12:15,760
So let's just create this point.

154
00:12:16,140 --> 00:12:24,360
And then we are creating the more than, again, extreme epochs validation set and then the callbacks.

155
00:12:36,030 --> 00:12:41,160
Now, to get the best model, we can just import the best model that we have saved.

156
00:12:41,820 --> 00:12:43,770
So if you see in there that are cool.

157
00:12:43,800 --> 00:12:47,720
So we have saved this best mardon daughter.

158
00:12:51,510 --> 00:12:53,950
We are just loading and using that model.

159
00:12:54,270 --> 00:12:56,550
Do you and your performance on test set?

160
00:13:00,360 --> 00:13:05,950
No one of the best method to scream your murder is to use a little stopping.

161
00:13:07,470 --> 00:13:14,280
What I'm referring at least stopping, is that you would run your model for a very large number of epoch

162
00:13:14,280 --> 00:13:14,790
values.

163
00:13:15,240 --> 00:13:17,910
Suppose here I am using two hundred epochs.

164
00:13:19,190 --> 00:13:23,900
And then I will keep an eye on my validation, said Skort.

165
00:13:25,060 --> 00:13:33,820
So suppose if after 60 epochs, my relevation said accuracy's, not improving, then I will stop the

166
00:13:33,820 --> 00:13:40,760
training at that point and I will be using the model with the best validation is, quote, so far.

167
00:13:41,620 --> 00:13:43,330
So this is an ideal situation.

168
00:13:43,570 --> 00:13:46,840
We don't have to worry about how many books we want to run.

169
00:13:47,500 --> 00:13:49,750
We just set a very high value of people.

170
00:13:50,320 --> 00:13:56,990
We monitor the valuations are Disko and then select the best model that we have got so far.

171
00:13:59,360 --> 00:14:06,980
So to achieve that, we will be needing and another callback, which we generally call at least stopping

172
00:14:06,980 --> 00:14:07,440
callback.

173
00:14:08,390 --> 00:14:16,190
And we will use this callback along with our normal callback of safe, best only model.

174
00:14:18,170 --> 00:14:22,310
So in a list topping callback, we have to provide the patients value.

175
00:14:23,240 --> 00:14:31,730
Patients value is the number of epochs after which our training will be interrupted if there is no progress

176
00:14:31,760 --> 00:14:34,160
on the validation set score.

177
00:14:36,410 --> 00:14:44,210
So suppose after 53 epochs, we are not getting any improvement on our evaluations or discourse.

178
00:14:44,870 --> 00:14:49,510
Then get us will stop the training at six three.

179
00:14:51,020 --> 00:14:54,420
So the difference between the sexy three and four P3 is ten.

180
00:14:54,930 --> 00:14:55,950
This is the patients.

181
00:14:56,450 --> 00:15:02,380
Thus the number of epochs scared us will wait for any improvement in validation is called.

182
00:15:04,520 --> 00:15:08,150
So earlier you saw value creating our functional model.

183
00:15:08,180 --> 00:15:14,990
We have we have to look at our valuation score graph to check out whether our model is converging or

184
00:15:14,990 --> 00:15:15,380
not.

185
00:15:16,100 --> 00:15:18,070
Now, we don't have to worry about that.

186
00:15:18,500 --> 00:15:27,110
We can use a list scoping along with other callbacks, which will automatically save the best model

187
00:15:27,140 --> 00:15:28,110
that is available.

188
00:15:28,970 --> 00:15:36,950
And it will also train our model for optimal amount of time needed to create that model to its best

189
00:15:36,950 --> 00:15:37,880
accuracy score.

190
00:15:39,740 --> 00:15:42,680
So let's just run this.

191
00:15:42,860 --> 00:15:49,510
We are first creating this structure now, along with barely stopping callback.

192
00:15:49,550 --> 00:15:52,940
We have to use over the normal callbacks also to save the model.

193
00:15:54,380 --> 00:15:59,960
And here we are also using save best on Leape barometer as well.

194
00:16:01,760 --> 00:16:09,020
So at the end of all this training, we would be getting on the single model file on which we would

195
00:16:09,020 --> 00:16:11,850
be getting the best validation service score.

196
00:16:13,550 --> 00:16:16,960
Now our second callback is a list stopping callback.

197
00:16:17,480 --> 00:16:23,740
Here you have to provide patients parameters for the number of epochs.

198
00:16:25,780 --> 00:16:26,360
Run this.

199
00:16:27,710 --> 00:16:28,610
Now we have Ukraine.

200
00:16:28,640 --> 00:16:33,020
Our model will be passing screening dataset, epochs, values.

201
00:16:34,040 --> 00:16:36,710
Remember to increase epoch values.

202
00:16:38,630 --> 00:16:42,150
Then the validation data and then callbacks.

203
00:16:43,070 --> 00:16:46,100
Again, we are passing callbacks in the form of list.

204
00:16:46,280 --> 00:16:52,250
We are first passing over a checkpoint callback and then we are passing over at least stopping callback.

205
00:16:52,880 --> 00:16:53,900
Just run this.

206
00:17:21,250 --> 00:17:25,860
So their training has stopped after 87 epoch.

207
00:17:27,690 --> 00:17:33,150
This means that there is no need to train it, two 200 people could lose.

208
00:17:34,950 --> 00:17:40,890
Our model has already converged and there is no way to further decrease the validation loss.

209
00:17:41,970 --> 00:17:51,270
You can see the best value we are getting is that 70 70 book, and it waited for 10 more epochs to get

210
00:17:51,700 --> 00:17:53,840
the better value than zero point three two.

211
00:17:54,990 --> 00:17:57,250
And you can see up to 10 Époque will.

212
00:17:58,590 --> 00:18:02,550
The value is still hovering over zero point three three.

213
00:18:02,820 --> 00:18:06,290
That's why it is stopped at 87 Depok value.

214
00:18:08,150 --> 00:18:10,360
Let's just look at our directory.

215
00:18:11,330 --> 00:18:17,580
We should have a modern with file name on this top model here.

216
00:18:18,500 --> 00:18:26,990
You can see it US has already saved the best model here with the name of the top model.

217
00:18:28,240 --> 00:18:29,860
So now the training has to stop.

218
00:18:30,080 --> 00:18:35,900
We can use this at least top model to predict and evaluate our future values.

219
00:18:37,550 --> 00:18:45,710
You can see the loss on tests that we are getting is zero point two three one as compared to our previous

220
00:18:45,710 --> 00:18:49,820
loss values of zero point four three seven, eight and so on.

221
00:18:51,830 --> 00:18:58,820
So using only stop callback is a very good way to optimize performance of your model.

222
00:18:59,150 --> 00:19:02,570
You don't have to worry about the number of epochs.

223
00:19:05,330 --> 00:19:09,200
So let's have a quick recap of what we learned in this lecture.

224
00:19:09,680 --> 00:19:19,890
First, we learn how to save and load over more than than we used callbacks to save our model at each

225
00:19:19,890 --> 00:19:20,600
epoch value.

226
00:19:21,890 --> 00:19:28,970
Then we learned about a parameter that is saved best only, which can save the model for which we are

227
00:19:28,970 --> 00:19:32,870
getting the best accuracy score or lowest validation loss.

228
00:19:34,010 --> 00:19:43,550
And after that, we saw how to use only a stop callback to terminate the training if no progress on

229
00:19:43,640 --> 00:19:46,070
validation loss is occurring.

230
00:19:46,940 --> 00:19:48,440
So that's all for this lecture.

231
00:19:48,860 --> 00:19:49,310
Thank you.