1 00:00:01,260 --> 00:00:07,370 Now, to handle the second observation of our analysis that is missing values in and horse bids. 2 00:00:08,760 --> 00:00:13,710 We are going to replace the missing values with the mean of the other values available with us. 3 00:00:14,940 --> 00:00:18,270 So now we need to know two things to be able to do that. 4 00:00:18,870 --> 00:00:22,410 One is, how do good mean when column has any values? 5 00:00:23,310 --> 00:00:31,590 This is an issue because if I just write mean and within record, I write B.F. Towler and Husband. 6 00:00:33,240 --> 00:00:39,240 So if I'm trying to find out mean of this variable and if I run it, I get any. 7 00:00:39,510 --> 00:00:41,250 Because this very well contains any. 8 00:00:42,090 --> 00:00:43,630 So I need to tweak this formula. 9 00:00:44,250 --> 00:00:44,820 I will write. 10 00:00:45,000 --> 00:00:47,400 You mean and within bracket. 11 00:00:47,430 --> 00:00:48,510 I will be of color. 12 00:00:48,550 --> 00:00:53,280 Any beef dollar and husband does the same. 13 00:00:54,090 --> 00:00:59,990 I'll give another parameter which is comma and need art item is equal to blue. 14 00:01:01,400 --> 00:01:03,660 In a dark autumn is equal to two. 15 00:01:06,250 --> 00:01:10,870 What this means is, while calculating the mean remove and is from it. 16 00:01:11,730 --> 00:01:19,430 Now if I run this, I'll get a mean value to the Menas seven point eight nine for this variable. 17 00:01:20,950 --> 00:01:22,210 Now we have the mean value. 18 00:01:22,360 --> 00:01:25,630 Now we need to identify the black values using this variable. 19 00:01:27,030 --> 00:01:30,800 This can be done by writing, which will write which. 20 00:01:32,620 --> 00:01:34,860 And within Blackard, we will write is any. 21 00:01:37,880 --> 00:01:44,090 And again, within record, we're the only variable, which is D.F. Toler and husband. 22 00:01:49,250 --> 00:01:56,960 Dysfunction is any will identify which all the cells are any and which will list them down as a result 23 00:01:56,960 --> 00:01:57,410 for us. 24 00:01:58,100 --> 00:01:58,940 So when I this. 25 00:02:01,180 --> 00:02:06,170 This is telling me you have to ask permission one after the into the 16th position. 26 00:02:06,490 --> 00:02:09,810 All these values are having and name them. 27 00:02:10,860 --> 00:02:16,210 Oh, since we do not need to list Tony values, we just need to identify them, will not be using which 28 00:02:17,120 --> 00:02:19,070 but only the last part of this. 29 00:02:19,820 --> 00:02:20,960 So let's replace values. 30 00:02:20,960 --> 00:02:21,740 No, by rating. 31 00:02:22,770 --> 00:02:24,960 Beef, dollar and horsemeat. 32 00:02:29,340 --> 00:02:35,430 And husbands and within square bracket will specify is not any. 33 00:02:37,290 --> 00:02:38,340 We'll take this part. 34 00:02:38,940 --> 00:02:39,860 Copy pasted here. 35 00:02:43,550 --> 00:02:48,130 So this is identifying which all values are any. 36 00:02:48,770 --> 00:02:52,130 And we are replacing these values with the mean. 37 00:02:52,220 --> 00:02:56,120 So this gets to mean that we got locatable. 38 00:03:01,510 --> 00:03:02,290 Let's run this. 39 00:03:05,400 --> 00:03:06,030 This is done. 40 00:03:07,880 --> 00:03:14,960 Missing values are replaced by mean and no, if I ran the somebody of this particular variable. 41 00:03:23,190 --> 00:03:25,440 It is not giving me any any value. 42 00:03:25,950 --> 00:03:33,330 We can also run this, which come on again, which listed on all the cells which have any value. 43 00:03:34,900 --> 00:03:38,300 But on this, again, I'm getting a deja vu of zero. 44 00:03:38,460 --> 00:03:41,490 That is, there is no position which has any value. 45 00:03:41,820 --> 00:03:45,510 So we have successfully imputed missing values in this variable. 46 00:03:45,960 --> 00:03:46,680 This is all missing. 47 00:03:46,680 --> 00:03:48,430 Value imputation is done in our.