1 00:00:01,780 --> 00:00:08,930 To handle the outliers in any hard rooms and rainfall, we will use the capping and flooring technique 2 00:00:09,170 --> 00:00:10,130 discussed in Lichter's. 3 00:00:11,410 --> 00:00:18,670 We will cap the upper value of and hot rooms to a value of three into 99 percentile, and we will cap 4 00:00:18,670 --> 00:00:22,650 the lower value of rent for two point three into first person date. 5 00:00:24,840 --> 00:00:30,010 For this, we need to know how to find out the value of a variable at a particular percentile level. 6 00:00:31,360 --> 00:00:38,760 The method for this is to use quintile to relate quintile within bracket. 7 00:00:40,010 --> 00:00:40,340 Relate. 8 00:00:40,570 --> 00:00:42,700 D.F. dollar and hard rooms. 9 00:00:47,210 --> 00:00:47,620 Coma. 10 00:00:48,920 --> 00:00:51,730 We mentioned deep percentiles where useful data point mining. 11 00:00:52,580 --> 00:00:53,330 If you've done this. 12 00:00:55,460 --> 00:01:06,700 In the desert, you can see fifteen point three nine nine is the 99 percentile value in hot rooms to 13 00:01:06,700 --> 00:01:10,280 be won three times of this value as the upper limit. 14 00:01:10,460 --> 00:01:16,010 So we will assign you V a value of three times this. 15 00:01:16,250 --> 00:01:17,810 So I'll copy paste. 16 00:01:20,530 --> 00:01:21,610 This a lane. 17 00:01:25,400 --> 00:01:33,290 So about well, you get a number, which is forty six point two, which is three times fifteen point 18 00:01:33,290 --> 00:01:33,800 three nine. 19 00:01:35,200 --> 00:01:40,610 Now, for all we're losing in hot rooms, we will compare whether it is greater than be. 20 00:01:41,380 --> 00:01:44,240 And if it is, we will change that value to you. 21 00:01:44,250 --> 00:01:46,030 We to do that. 22 00:01:46,630 --> 00:01:46,870 We will. 23 00:01:46,870 --> 00:01:47,240 Right. 24 00:01:48,540 --> 00:01:50,640 B.F. Dollar and Hart rooms 25 00:01:54,000 --> 00:01:54,990 within square brackets. 26 00:01:55,020 --> 00:01:57,630 We relate behav dollar and hadrons. 27 00:01:58,970 --> 00:02:00,230 Greater than EUI. 28 00:02:07,250 --> 00:02:14,370 So all values of this variable greater than you, we should get the value you we. 29 00:02:17,460 --> 00:02:25,560 Now, if you on this, all the values which were beyond the movie range now have value, U.B.. 30 00:02:26,980 --> 00:02:34,360 Now, if we run UDD on this particular variable, we will somebody and within bracket will they D.F. 31 00:02:34,360 --> 00:02:36,460 Dollar and hotrods. 32 00:02:39,190 --> 00:02:39,630 Run this. 33 00:02:41,150 --> 00:02:46,030 You can see the median and mean are now much closer to. 34 00:02:47,750 --> 00:02:52,490 And the maximum value is change four to six, which was earlier hundred and one. 35 00:02:54,390 --> 00:02:56,560 So let's let us do the same thing for rainfall. 36 00:02:56,870 --> 00:03:04,060 Rainfall has outlaid on the lower side, so we assign Elvie the value of point three times the first 37 00:03:04,060 --> 00:03:13,390 quartile value to relate Elway's equal to zero point three times gone by. 38 00:03:20,060 --> 00:03:22,730 And the variable is D.F. dollar rainfall. 39 00:03:24,200 --> 00:03:25,670 And we want DeForest Cottontail. 40 00:03:26,000 --> 00:03:27,290 So at this point due to one. 41 00:03:29,690 --> 00:03:32,400 So we have the law well, you know, we've done this. 42 00:03:33,130 --> 00:03:37,960 So there is another variable, Elvie, here, which has a value of six. 43 00:03:40,010 --> 00:03:41,800 No, let us replace this value. 44 00:03:42,460 --> 00:03:47,970 We will ride the F dollar rainfall. 45 00:03:48,690 --> 00:03:51,550 And within square brackets, we will light the F dollar. 46 00:03:51,570 --> 00:03:55,840 Rainfall is less than this first court quantized value. 47 00:03:57,460 --> 00:04:01,960 These values will get Elvie input on this. 48 00:04:03,320 --> 00:04:04,660 The values have been changed. 49 00:04:04,870 --> 00:04:11,380 If we done EGD again on rainfall by somebody, B.F. dolar rainfall. 50 00:04:17,040 --> 00:04:23,250 We again see that mean and median values are quite close now, and the minimum value, which was earlier, 51 00:04:23,640 --> 00:04:25,940 probably three, is now six. 52 00:04:26,550 --> 00:04:32,930 If you want to have a value of minimum value closer to the first quintile, you we could have used a 53 00:04:32,940 --> 00:04:35,760 different multiplier than point three. 54 00:04:36,690 --> 00:04:38,490 Probably that gave us a very low value. 55 00:04:39,360 --> 00:04:43,290 But still, the median median are quite close and we are happy with this. 56 00:04:43,690 --> 00:04:48,750 So we have done the outlier treatment for the two variables that we wanted to treat.