1 00:00:01,780 --> 00:00:07,900 In this video, we will discuss descriptive numerical measures which describe the center of the data. 2 00:00:09,740 --> 00:00:15,620 There are four values that they know decentered mean median Moore and mid-range. 3 00:00:17,330 --> 00:00:22,220 If you are aware about this, you can skip the lecture else, let us discuss them one by one. 4 00:00:24,140 --> 00:00:31,100 First is the also called average, it is obtained by adding all the values and then dividing by the 5 00:00:31,100 --> 00:00:32,000 number of values. 6 00:00:33,390 --> 00:00:37,740 The definition is pretty simple, but not these two delusions. 7 00:00:38,850 --> 00:00:41,730 One is population me and the other is Tamplin. 8 00:00:42,870 --> 00:00:49,530 If you want to find out the average height of, say, a hundred thousand people and you collect height 9 00:00:49,530 --> 00:00:53,430 of a hundred thousand people and find average, that is population mean. 10 00:00:55,220 --> 00:01:01,490 But if you think that this would be a difficult task and you collect it of only a hundred people assemble 11 00:01:01,910 --> 00:01:05,360 and find the average that averages sample mean. 12 00:01:07,060 --> 00:01:15,970 So population mean is denoted by Mbewe, the symbolism example, meaning is denoted by X Bar. 13 00:01:17,500 --> 00:01:18,820 Let's look at an example. 14 00:01:20,510 --> 00:01:22,880 Here are five students and their height. 15 00:01:23,510 --> 00:01:28,520 What is the meaning of their right to be at the height and divided by five? 16 00:01:30,150 --> 00:01:35,730 We get a twenty four divided by five, which comes out to one sixty four point eight. 17 00:01:36,920 --> 00:01:43,400 So if the glass contained only five students, this will be the population mean, but if the glass contained 18 00:01:43,400 --> 00:01:49,460 50 students and we were using the word of five students to calculate the mean, then this will be the 19 00:01:49,460 --> 00:01:57,380 sample mean because this is a sample of five students out of the 50 student, depending on what mean 20 00:01:57,380 --> 00:01:59,680 you are representing, you'll use the symbol here. 21 00:01:59,840 --> 00:02:03,400 If it is population mean, then you'll write music to 160 420. 22 00:02:03,620 --> 00:02:08,180 If it is sample when you write X bar is equal to one sixty four point eight. 23 00:02:08,930 --> 00:02:10,010 Next comes median. 24 00:02:10,580 --> 00:02:16,190 Median is the middle term when you order the data in increasing or decreasing order. 25 00:02:17,660 --> 00:02:22,010 So first step is to order the data and then we will find the middle. 26 00:02:23,810 --> 00:02:27,980 If the number of observations are ordered, then there will be a single random. 27 00:02:29,660 --> 00:02:35,360 And that would be odd million, but if the number of observations are even, then there will be two 28 00:02:35,360 --> 00:02:36,220 values in the middle. 29 00:02:36,650 --> 00:02:39,650 In that case, we will take average of the two middle values. 30 00:02:40,870 --> 00:02:45,080 Let's see an example again for the state of you'll find the median age. 31 00:02:45,490 --> 00:02:48,850 So first we order these in ascending order. 32 00:02:50,280 --> 00:02:55,830 And next, we find the middle value, which is the third value here, which comes out to 168. 33 00:02:56,920 --> 00:02:58,600 Suppose we have another student. 34 00:03:00,220 --> 00:03:03,580 Who has a height of 170 centimeters? 35 00:03:04,680 --> 00:03:11,490 That value will come a default position, in that case, there will be two values in the middle, 160 36 00:03:11,790 --> 00:03:12,780 and 170. 37 00:03:14,190 --> 00:03:20,010 There will have to average these two values to give the median, so the median in that case will be 38 00:03:20,010 --> 00:03:24,180 160 plus 170 by two, which will be 169 centimeters. 39 00:03:25,930 --> 00:03:26,770 Next is more. 40 00:03:28,530 --> 00:03:33,270 More value is the value which is occurring maximum number of times in the data. 41 00:03:34,610 --> 00:03:42,590 So if you consider probability, more value has the maximum probability of occurrence, not depending 42 00:03:42,590 --> 00:03:46,940 on the number of moles that the data has, it is given different names. 43 00:03:47,210 --> 00:03:49,550 If data has to modes, it is bimodal. 44 00:03:49,880 --> 00:03:57,170 If more than two modes, then it is multimodal or if no value is repeated and frequent is one for each 45 00:03:57,170 --> 00:03:57,620 value. 46 00:03:58,040 --> 00:03:59,350 Data has no more. 47 00:04:01,320 --> 00:04:02,340 Let's see an example. 48 00:04:05,260 --> 00:04:07,240 For this data, what is the mood hide? 49 00:04:08,460 --> 00:04:15,570 So since 168 comes to times and every other value is coming only once. 50 00:04:16,680 --> 00:04:26,610 168 is the here, if you go to the other data that we had when we calculated median, there is no repetition 51 00:04:26,610 --> 00:04:27,610 of any height. 52 00:04:28,320 --> 00:04:33,870 So this data has no more nexus, mid-range marriages. 53 00:04:34,290 --> 00:04:34,950 Very simple. 54 00:04:35,130 --> 00:04:44,610 You just find the average of largest and smallest values of your data to, for example, have the tallest 55 00:04:44,610 --> 00:04:49,080 one is 182 and shortages 155. 56 00:04:49,800 --> 00:04:56,640 Therefore, mid-range will be 155 plus 182 divided by two, which comes out to one sixty eight point 57 00:04:56,640 --> 00:04:56,910 five. 58 00:05:00,150 --> 00:05:03,700 So now we know the definitions of these measures of senator. 59 00:05:05,080 --> 00:05:14,110 Now, let us compare them, if you remember, we are using the image of a histogram and we have symmetric 60 00:05:14,110 --> 00:05:19,660 data like this, usually the mean median and mode will be identical. 61 00:05:21,980 --> 00:05:29,560 But if the data is skewed, like in the two images after this one, the three values will be different. 62 00:05:30,780 --> 00:05:33,990 Men will be highly influenced by the outliers here. 63 00:05:36,060 --> 00:05:37,680 Million will still be at the center. 64 00:05:39,090 --> 00:05:42,060 More will know the value with highest frequency. 65 00:05:44,520 --> 00:05:49,770 So which one stays relevant for us depends on the data in case we have outliers. 66 00:05:50,800 --> 00:05:58,580 We should either remove the outliers and then find something else, mediant would be our preferred measure 67 00:05:58,580 --> 00:05:59,860 of center over me. 68 00:06:01,950 --> 00:06:07,050 One advantage of Maude is that it can be concluded for qualitative, but also. 69 00:06:08,440 --> 00:06:15,850 So, for example, if we have categories like north, east, west, south, and we want to find out 70 00:06:15,850 --> 00:06:19,030 which category has the highest number of customers. 71 00:06:21,830 --> 00:06:27,860 We just find the mode categories, so we assign the number of customers to each category and then the 72 00:06:27,860 --> 00:06:30,920 more will be the category which has the maximum number of customers. 73 00:06:31,970 --> 00:06:36,820 This is a qualitative data that it has categories and numbers assigned to it. 74 00:06:38,530 --> 00:06:45,040 More can be found out for such data mean and median cannot be found out for such data, that is, you 75 00:06:45,040 --> 00:06:47,740 cannot have a mean of North-Western told. 76 00:06:49,500 --> 00:06:56,370 Also, if you are interested to know the probability of occurrence, Morde is still denoting the value 77 00:06:56,370 --> 00:06:58,230 which has the highest probability of occurrence. 78 00:07:00,110 --> 00:07:05,240 So these are the measures of center in the next video, we will discuss the measures of dispersion.