1 00:00:00,600 --> 00:00:04,600 So this is the dataset that we are going to use to build that classification. 2 00:00:06,050 --> 00:00:11,550 The data set is exactly similar to the one that we used for building out regression tree. 3 00:00:12,510 --> 00:00:13,830 These are the same variables. 4 00:00:14,370 --> 00:00:17,400 Only thing is that in the end. 5 00:00:19,020 --> 00:00:25,800 Last thing we were predicting the value of collection to, we had the information about all these variables 6 00:00:25,890 --> 00:00:30,250 and we were trying to predict the value of collection in this dataset. 7 00:00:31,020 --> 00:00:32,220 We already know that. 8 00:00:32,220 --> 00:00:32,760 How much? 9 00:00:34,080 --> 00:00:37,020 A movie has collected at the box office. 10 00:00:38,350 --> 00:00:45,070 Now we want to predict whether that particular movie is going to win a star like Oscar or not. 11 00:00:46,020 --> 00:00:53,310 So this particular award, will any of these movies win or Nortman besides the data? 12 00:00:53,350 --> 00:00:54,640 And this lady was. 13 00:00:55,890 --> 00:01:01,200 Again, this data is for, I wondered, six movies for these 506 movies. 14 00:01:01,320 --> 00:01:09,510 We have the data whether they've won or whether they did not win the static Oscar using this data. 15 00:01:10,470 --> 00:01:12,090 We will train our model. 16 00:01:14,100 --> 00:01:15,800 We will spread the data into two parts. 17 00:01:16,080 --> 00:01:17,690 One part will be used to pay non-moral. 18 00:01:18,090 --> 00:01:21,030 The other part will be used to check its performance. 19 00:01:23,280 --> 00:01:30,120 And once we have created this model, we can use that model to predict whether any particular movie 20 00:01:30,210 --> 00:01:37,530 is going to win a static Oscar or not by putting the value of all these other variables into our model.