1 00:00:01,250 --> 00:00:07,390 Now, to handle the second observation of our analysis that is missing values in and horse bids. 2 00:00:08,780 --> 00:00:13,700 We are going to replace the missing values with the mean of the other values available with us. 3 00:00:14,930 --> 00:00:18,290 So now we need to know two things to be able to do that. 4 00:00:18,860 --> 00:00:22,400 One is how to get mean when column has any values. 5 00:00:23,300 --> 00:00:31,580 This is an issue because if I just write mean and within record, I write B.F. Dolar and Husband. 6 00:00:33,150 --> 00:00:39,230 So if I'm trying to find out mean of this variable and if I run it, I get any. 7 00:00:39,500 --> 00:00:41,230 Because this variable contains any. 8 00:00:42,160 --> 00:00:43,640 So I need to tweak this formula. 9 00:00:44,240 --> 00:00:46,070 I will write you mean. 10 00:00:46,760 --> 00:00:47,400 And within bracket. 11 00:00:47,450 --> 00:00:48,500 I will I be of color. 12 00:00:48,530 --> 00:00:53,270 Any D.F. dollar and husband does the same. 13 00:00:54,080 --> 00:00:59,750 I'll give another parameter which is comma and Nadaf item is equal to. 14 00:01:01,400 --> 00:01:03,660 And a dog at home is equal to two. 15 00:01:06,250 --> 00:01:13,990 What this means is, while calculating the mean remove and is from it to no effect on this, I mean 16 00:01:13,990 --> 00:01:19,450 value to the minus seven point eight nine for this variable. 17 00:01:20,950 --> 00:01:25,630 Now we have the mean value know we need to identify the black values in this variable. 18 00:01:27,020 --> 00:01:30,830 This can be done by writing, which will write which. 19 00:01:32,550 --> 00:01:34,820 And within Blackard, we will write is any. 20 00:01:37,880 --> 00:01:44,060 And again, within record, we are the only variable, which is D.F., daughter and husband. 21 00:01:48,090 --> 00:01:56,640 Head dysfunction is any will identify which all the cells are any and which will list them down as a 22 00:01:56,640 --> 00:01:57,390 result for us. 23 00:01:58,080 --> 00:01:58,920 So when I run this. 24 00:02:01,190 --> 00:02:05,910 This is telling me you have to phosphorylation one hour today, integration, 16th position. 25 00:02:06,500 --> 00:02:09,800 All these values are having and in them. 26 00:02:10,880 --> 00:02:11,160 Nonsense. 27 00:02:11,270 --> 00:02:13,130 We do not need to list Tony values. 28 00:02:13,160 --> 00:02:14,720 We just need to identify them. 29 00:02:14,900 --> 00:02:19,040 Will not be using which but only the last part of this. 30 00:02:19,820 --> 00:02:20,950 So let's replace values. 31 00:02:20,960 --> 00:02:24,970 No, by rating the dollar and hospitals. 32 00:02:29,320 --> 00:02:35,430 And husbands and within square bracket will specify is not any. 33 00:02:37,280 --> 00:02:38,340 We'll take this part. 34 00:02:38,940 --> 00:02:39,890 Copy pasted here. 35 00:02:43,570 --> 00:02:48,020 So this is identifying which all values are in it. 36 00:02:48,760 --> 00:02:52,150 And we are replacing these values with the mean. 37 00:02:52,240 --> 00:02:56,080 So this gets to mean that we got locatable. 38 00:03:01,220 --> 00:03:02,280 So let's run this. 39 00:03:05,370 --> 00:03:06,000 This is done. 40 00:03:07,880 --> 00:03:09,710 Missing values are replaced by mean. 41 00:03:09,920 --> 00:03:14,960 And now if I run the somebody of this particular variable. 42 00:03:23,170 --> 00:03:25,450 It is not giving me any any value. 43 00:03:25,930 --> 00:03:33,340 We can also run this, which come on again, which listed on all the cells which have any value. 44 00:03:35,040 --> 00:03:38,290 If I run this again, I'm getting in danger of zero. 45 00:03:38,440 --> 00:03:41,470 That is, there is no position which has any value. 46 00:03:41,800 --> 00:03:45,490 So we have successfully imputed missing values in this variable. 47 00:03:45,940 --> 00:03:46,660 This is all missing. 48 00:03:46,660 --> 00:03:48,400 Value imputation is done in our.