1 00:00:00,210 --> 00:00:06,840 Long before going deep dive into the session, let's have a quick recap of what we have done in our 2 00:00:06,840 --> 00:00:07,880 previous session. 3 00:00:08,310 --> 00:00:14,280 So no previous session, we have this distribution, which is nothing but a distribution of price of 4 00:00:14,460 --> 00:00:19,140 each and every room type tonight and per person. 5 00:00:19,140 --> 00:00:24,010 So this is exactly the distribution that we have come of it in a similar way. 6 00:00:24,060 --> 00:00:32,070 We have this analysis, which is exactly a room price per night over the air with respect to my resort 7 00:00:32,070 --> 00:00:35,880 hotel as well as Cost City Hotel as so in the session. 8 00:00:35,880 --> 00:00:44,910 We have this assignment in which I have to find what is our distribution of nights spent at hotels by 9 00:00:44,910 --> 00:00:49,160 different different market segment and hotel type Lexcen. 10 00:00:49,170 --> 00:00:53,770 Let's see if I'm going to call our pad on my date of now. 11 00:00:53,790 --> 00:01:00,770 You will see over here, this is the data fee that you have to consider for your analysis purposes. 12 00:01:01,410 --> 00:01:08,790 So here you have a column, Naem, which is exactly something known as market segment, which is exactly 13 00:01:08,790 --> 00:01:11,070 this one, which is exactly this one. 14 00:01:11,070 --> 00:01:16,010 And this is exactly your second column, which is my stays in weekend. 15 00:01:16,020 --> 00:01:24,780 And it means it means you have to find a distribution of each and every market segment or you can say 16 00:01:24,780 --> 00:01:32,770 you have to find distribution of it stays in weekend night in each and every market segment. 17 00:01:33,030 --> 00:01:35,460 So for this, we have a very handy plot. 18 00:01:35,460 --> 00:01:39,090 So I'm going to say asanas dot box plot. 19 00:01:39,090 --> 00:01:43,850 And in this box plot, I have to play with different different parameters. 20 00:01:44,280 --> 00:01:51,390 So here I'm going to say on X axis, I have this I have this feature. 21 00:01:51,390 --> 00:02:00,510 So here I'm just going to do copy paste and basically on y axis or y axis, I have this column, so 22 00:02:00,510 --> 00:02:01,830 I have to just copy. 23 00:02:02,110 --> 00:02:04,130 I have to just paste over there. 24 00:02:04,140 --> 00:02:10,560 Let me remove the special character and after it, what I have to do, I have to say what exactly is 25 00:02:10,560 --> 00:02:13,740 my little frame, which is my data and simple. 26 00:02:13,890 --> 00:02:22,080 I had to split this box plot on the basis of this hotel, on this basis, all on the basis of this hotel 27 00:02:22,650 --> 00:02:23,520 and after it. 28 00:02:23,520 --> 00:02:28,200 What I have to do, let's say I have to set my own window site for this. 29 00:02:28,200 --> 00:02:30,320 I'm going to say Paltalk figure. 30 00:02:30,330 --> 00:02:36,630 And here I have something which is exactly Fixit and I have to assign my physical side of that Cifas 31 00:02:36,660 --> 00:02:37,580 Kincumber 10. 32 00:02:38,070 --> 00:02:41,610 So after what I have to do, I have to just execute it. 33 00:02:41,610 --> 00:02:49,260 And this is exactly your box plot with respect to each and every different different market segment 34 00:02:49,260 --> 00:02:52,920 you will see with respect to this stock market segment. 35 00:02:53,220 --> 00:02:59,220 You have this distribution, whereas with respect to all these different, different kind of market 36 00:02:59,220 --> 00:03:02,100 segments, you have this kind of situation. 37 00:03:02,520 --> 00:03:09,540 So from this, if I have to conclude that, I can say almost most of the groups are normally distributed. 38 00:03:09,550 --> 00:03:11,670 Yeah, some of have some skewness. 39 00:03:11,670 --> 00:03:15,210 You will see this this data point, which are exactly outliers. 40 00:03:15,210 --> 00:03:20,590 It means some of have some positive skewness in my data. 41 00:03:20,610 --> 00:03:28,020 So if looking at this box plot or I can see looking at the distribution, I can see most people do not 42 00:03:28,020 --> 00:03:32,240 seem to prefer to stay at the hotel more than one weeks. 43 00:03:32,280 --> 00:03:37,430 And you will observe over here in case of resolved people are going to stay more. 44 00:03:37,440 --> 00:03:43,260 It means it is always that whenever people go to resort hotels, they prefer to stay more. 45 00:03:43,290 --> 00:03:47,120 So that's a basic conclusion, how you can fetch from your data. 46 00:03:47,340 --> 00:03:54,360 So let's go ahead with our second problem statement, which is I have to analyze preference of guest 47 00:03:54,510 --> 00:03:56,760 what they basically prefer. 48 00:03:56,940 --> 00:04:01,710 So I have to analyze their behavior, what basically the PAFA. 49 00:04:02,100 --> 00:04:06,200 So here I'm going to say let's say, well, let me show you a thing. 50 00:04:06,210 --> 00:04:10,590 So very first, I have to excuse my middle column. 51 00:04:10,590 --> 00:04:17,820 And on this, I have to say I could just call this the loop, because you will see over here this this 52 00:04:17,820 --> 00:04:19,980 Beeby has highest count. 53 00:04:19,980 --> 00:04:24,450 It means most of the guests are basically going to prep for this meeting. 54 00:04:25,110 --> 00:04:26,600 Let me show this thing. 55 00:04:26,610 --> 00:04:27,930 Let me show this thing. 56 00:04:28,230 --> 00:04:29,690 Why am I pie chart? 57 00:04:29,700 --> 00:04:34,190 Or you can use some handy personal picture like me, use Pockley. 58 00:04:34,560 --> 00:04:42,760 So I'm going to say B X dot by and in this, if you will, plus tab, you will get all your stuff. 59 00:04:43,290 --> 00:04:50,160 So here what I'm going to do very first, I have to pass my Netafim, which is exactly my data. 60 00:04:51,180 --> 00:04:58,860 After what I have to do here in my names, I have to just copy this and I have to just paste over there 61 00:04:59,340 --> 00:04:59,910 and. 62 00:05:00,000 --> 00:05:09,360 In this index, I have to say DOT index of this one, and after what I have to do, are likely brashest 63 00:05:09,360 --> 00:05:11,310 plus tapu check all the parameters. 64 00:05:11,310 --> 00:05:12,410 It has names. 65 00:05:12,420 --> 00:05:15,480 So let me just do some modifications. 66 00:05:15,480 --> 00:05:19,400 And this is exactly my names. 67 00:05:19,440 --> 00:05:20,700 It is not index. 68 00:05:21,000 --> 00:05:26,010 So here I'm going to say it is nothing, but it is just my values. 69 00:05:26,010 --> 00:05:31,980 And if you will shift crosstab, you will see here you have name, here you have values. 70 00:05:32,270 --> 00:05:37,130 And let's say I have to use something known as donor chart. 71 00:05:37,140 --> 00:05:40,800 So for this I have to just blow this whole parameter. 72 00:05:40,800 --> 00:05:43,680 And in this whole parameter I have to set some value. 73 00:05:44,040 --> 00:05:47,220 Let's say I have to set a value of five five. 74 00:05:47,220 --> 00:05:55,350 And if you will, execute the self executed and this is your beautiful donnacha and if you will hold 75 00:05:55,350 --> 00:05:58,950 your mouth, you will see all these have this discount. 76 00:05:59,250 --> 00:06:04,140 So you will see this this breakfast has that much higher ground. 77 00:06:04,140 --> 00:06:09,450 And if you have to conclude from this visual, you will see this is exactly on a chart. 78 00:06:09,570 --> 00:06:16,170 And almost 90 percent of the bookings are definitely reserved for breakfast. 79 00:06:16,170 --> 00:06:21,240 So that's the type of analysis that's type of conclusion you can fetch from data. 80 00:06:21,270 --> 00:06:24,510 You can also consider your pie chart instead of this chart. 81 00:06:24,510 --> 00:06:31,260 You can also consider Barcia, but you done it and by will play a very handy role whenever you have 82 00:06:31,260 --> 00:06:33,570 some left number of Category four. 83 00:06:33,570 --> 00:06:40,950 The idea behind when to use a lot means that that's all up to what scenario text all up to what use 84 00:06:41,190 --> 00:06:41,760 we have. 85 00:06:41,760 --> 00:06:43,620 So I hope you love the session very much. 86 00:06:43,780 --> 00:06:44,570 Thank you. 87 00:06:44,610 --> 00:06:45,560 Have a nice day. 88 00:06:45,780 --> 00:06:46,620 Keep learning. 89 00:06:46,740 --> 00:06:47,580 Keep growing. 90 00:06:47,850 --> 00:06:48,720 Keep practicing.