1 00:00:00,070 --> 00:00:06,840 Hello, before going at it in the last session of this project, let me have a quick overview of what 2 00:00:06,840 --> 00:00:14,280 we have done in our project and not so from importing the data that we have done lots of costing. 3 00:00:14,280 --> 00:00:17,700 Then we have packed some meaningful insight from this budget. 4 00:00:17,970 --> 00:00:22,460 Then we have some surplus than we have in France from it as well. 5 00:00:22,830 --> 00:00:27,120 Then after it, we have performed some kind of this point cost analysis. 6 00:00:27,120 --> 00:00:30,150 After it, we have performed this heat map. 7 00:00:30,150 --> 00:00:36,570 Then we have automate all this stuff after we have performed some kind of a speech analysis on our data. 8 00:00:36,780 --> 00:00:41,400 And this is exactly that, a spatial analysis after we have automated all these things. 9 00:00:41,700 --> 00:00:47,610 And in the very last session, what we have done, we have done what exactly that has ongoing in our 10 00:00:47,610 --> 00:00:51,460 New York City with respect today, with respect to our all these things. 11 00:00:51,840 --> 00:00:56,940 So in this session, what I have to do, I have to read this data, this one, and I have to perform 12 00:00:56,950 --> 00:01:00,780 analysis on this one, which is exactly Ortberg, Jan five. 13 00:01:00,840 --> 00:01:05,970 While so very first I have to read this to let me copy this, but we are exactly right. 14 00:01:05,970 --> 00:01:07,160 It is available over here. 15 00:01:07,170 --> 00:01:11,610 So here I'm going to say B'Day Dot read Six Feet. 16 00:01:11,610 --> 00:01:17,400 And here very first we have to mention that particular part just passed and this is exactly the data 17 00:01:17,400 --> 00:01:18,600 set that you have to read. 18 00:01:18,930 --> 00:01:24,650 So let me append roasting over here so that to get rid of all your errors so it fit exactly it. 19 00:01:24,660 --> 00:01:26,430 And we need a frame object. 20 00:01:26,430 --> 00:01:32,490 So I'm just going to store it in, let's say over underscore while so it's all up to you, whatever 21 00:01:32,490 --> 00:01:33,720 name you want to assign. 22 00:01:33,990 --> 00:01:40,500 And if I'm going to call this head over there to get how exactly my data frame looks like. 23 00:01:40,920 --> 00:01:42,390 So it just executed. 24 00:01:42,390 --> 00:01:48,300 And this is exactly your new little frame on which you have to do certain kind of analysis. 25 00:01:48,570 --> 00:01:54,810 And if you are going to like lexical shape over there to check the number of rows and call it, have 26 00:01:55,020 --> 00:01:59,310 to have almost three hundred fifty rows in your data. 27 00:01:59,460 --> 00:02:05,220 So here you will see over here, this is exactly your assignment very forcefully statement. 28 00:02:05,730 --> 00:02:14,240 You have to analyze which base number, which based on what exactly this one has most active cases. 29 00:02:14,340 --> 00:02:18,780 You have to analyze this for this what I am going to do here. 30 00:02:18,900 --> 00:02:25,890 So let me show you how many unique piece number we have so that it shows I have to exit this over and 31 00:02:25,890 --> 00:02:26,730 a square file. 32 00:02:27,090 --> 00:02:31,260 So in this I have to exit the column, which is exactly this one number. 33 00:02:31,500 --> 00:02:39,060 And on this, if I'm going to call this unique and if I'm going to execute it now, here you will see 34 00:02:39,060 --> 00:02:45,050 you have that look this this you need piece number available in your data. 35 00:02:45,420 --> 00:02:49,050 So now what you need, your problem statement is nothing. 36 00:02:49,050 --> 00:02:54,960 But you have to analyze which base number has most active basis. 37 00:02:54,960 --> 00:03:02,910 It means you just need a distribution of activities with respect to each and every base number. 38 00:03:03,570 --> 00:03:12,690 So far days, what we guys can do, we guys can simply use our box plot or because here you have a multiple 39 00:03:12,690 --> 00:03:18,340 with no and whenever you have multiple was always try to go ahead with box part. 40 00:03:18,390 --> 00:03:19,200 That's a trick. 41 00:03:19,440 --> 00:03:26,130 So here I would say Asness dot box, but you can definitely use your block box score as well. 42 00:03:26,130 --> 00:03:27,150 It's already OK. 43 00:03:27,390 --> 00:03:35,100 So in this X-axis, I have to just copy this, despatching this number from here and just paste over 44 00:03:35,100 --> 00:03:43,580 here and on this Y-axis, I definitely need my activators, which is exactly this feature. 45 00:03:43,950 --> 00:03:50,910 So here I am going to say I have to say here and after it I have to say what exactly is I'm going to 46 00:03:50,910 --> 00:03:57,270 say data is equal to nothing but over underscore file, which is exactly this one. 47 00:03:57,690 --> 00:04:05,700 So if I'm going to execute this cell, you will see with respect to each and every piece number, you 48 00:04:05,700 --> 00:04:08,430 have some kind of distributions over here. 49 00:04:08,640 --> 00:04:12,600 And if you want to customize it, you can zoom, zoom or whatever you want. 50 00:04:12,600 --> 00:04:13,670 It's all up to you. 51 00:04:14,310 --> 00:04:17,460 So from this, you can definitely come up with conclusion. 52 00:04:17,460 --> 00:04:17,880 Yeah. 53 00:04:18,150 --> 00:04:26,730 This base number, which is exactly the zero two seven six four, has a maximum number of activists. 54 00:04:27,330 --> 00:04:31,380 So that's the type of interest that you can fetch from your data. 55 00:04:31,650 --> 00:04:39,540 So let's go ahead with our next policy statement in which we have to analyze which base number has maximum 56 00:04:39,540 --> 00:04:41,190 troops to four days. 57 00:04:41,190 --> 00:04:43,080 Again, you need your box plot. 58 00:04:43,230 --> 00:04:47,280 So here I'm just going to say you here, you have to do some modification. 59 00:04:47,280 --> 00:04:47,810 That's it. 60 00:04:48,270 --> 00:04:52,470 So on y axis, you just need your grips here. 61 00:04:52,470 --> 00:04:55,460 I'm going to say on y axis, I just need Kraatz. 62 00:04:55,800 --> 00:04:57,180 So just executed. 63 00:04:57,180 --> 00:04:59,550 And now you can definitely see over here. 64 00:05:00,110 --> 00:05:04,820 Again, this these two, seven, six, four is still fine. 65 00:05:05,660 --> 00:05:13,610 It means it has a maximum number of activities as well as it has a maximum number of trips as well. 66 00:05:13,910 --> 00:05:19,220 So that's the type of conclusion you can get from this board, these vessels. 67 00:05:19,520 --> 00:05:21,840 So let's go ahead with our next analysis. 68 00:05:21,860 --> 00:05:29,990 Let's go ahead with the next statement in which you have to analyze how average trips per vehicle increases 69 00:05:29,990 --> 00:05:34,460 or decreases with respect to date with each of the base number. 70 00:05:34,460 --> 00:05:40,470 But here in this data frame, you will observe you don't have any future as trips per vehicle. 71 00:05:40,700 --> 00:05:44,810 It means you have to create these features, like we create that feature very full. 72 00:05:45,430 --> 00:05:53,240 I am going to say over an escort while and let's say my future name is nothing but trips per week. 73 00:05:53,810 --> 00:06:00,590 So here first you have defined this feature and this feature is nothing but which is exactly your divisor 74 00:06:00,620 --> 00:06:03,570 of your trips for activities. 75 00:06:03,590 --> 00:06:08,990 So here I am going to say very first, you have to access your very first, which is exactly what trips 76 00:06:08,990 --> 00:06:10,990 after you have to use this operator. 77 00:06:11,000 --> 00:06:16,800 And I'm just going to use this one and here I have to access this activity. 78 00:06:16,820 --> 00:06:17,750 So that's it. 79 00:06:17,930 --> 00:06:21,040 And if, again, I'm going to call had over there. 80 00:06:21,050 --> 00:06:25,940 So I'm just going to say you were on this call for I'll head over there now. 81 00:06:25,940 --> 00:06:30,200 You can easily visualize with respect to each and have the best number. 82 00:06:30,410 --> 00:06:33,860 You'll have that much number of trips per week. 83 00:06:34,160 --> 00:06:41,900 Now, what you need over here, according to a problem statement here you have how your average cruise 84 00:06:41,960 --> 00:06:45,740 per week increases or decreases with respect to date. 85 00:06:45,740 --> 00:06:50,840 It means you have to create this date, this date as your index. 86 00:06:51,110 --> 00:06:58,760 So what I'm going to do, I'm just going to say over underscore one dot set on the gold index and I 87 00:06:58,760 --> 00:07:02,090 have to set my date as index. 88 00:07:02,090 --> 00:07:06,140 And if I'm going to execute, it all just gets executed over here. 89 00:07:06,170 --> 00:07:09,440 So once you have this is just what you have to do. 90 00:07:09,590 --> 00:07:13,880 Very first, you have to group your data, because now you have this data frame. 91 00:07:14,090 --> 00:07:18,890 Now you have to group your data on the basis of your base number. 92 00:07:18,930 --> 00:07:26,090 So here I'm going to say dot group by and I have to basically group it on the basis of this despatching 93 00:07:26,090 --> 00:07:26,810 based number. 94 00:07:27,050 --> 00:07:30,470 Once I have this group, I have to access this. 95 00:07:30,770 --> 00:07:37,150 I have to access this vehicle, so I have to access this vehicle. 96 00:07:37,160 --> 00:07:47,150 And on this I have to simply call this plot that sit and let me customize my own window side or phagocytes 97 00:07:47,200 --> 00:07:47,340 here. 98 00:07:47,390 --> 00:07:50,290 I'm going to say VLT, Dot Fagre. 99 00:07:50,300 --> 00:07:54,860 And here I have a parameter which is exactly Fixit and it triggers. 100 00:07:54,860 --> 00:08:01,700 I just like to say to all six and if you want to assign some X levels unvaluable, you can assign it 101 00:08:01,700 --> 00:08:02,090 as well. 102 00:08:02,450 --> 00:08:12,260 So let's say my Y label is nothing but X in my right label is that the average crib's for Weyco and 103 00:08:12,260 --> 00:08:16,540 after assigning available, I have to assign my title. 104 00:08:16,550 --> 00:08:20,780 So here I'm going to say BLT dot type does. 105 00:08:20,780 --> 00:08:27,080 So my title is nothing but let's say on demand versus supply chart. 106 00:08:27,380 --> 00:08:33,290 So demand was a supply chart, longsuffering, all this stuff just executed. 107 00:08:33,290 --> 00:08:38,140 And now you will see what this is with respect to each of your base number. 108 00:08:38,360 --> 00:08:40,710 So let me add on Leegin over here. 109 00:08:40,730 --> 00:08:45,260 So here I'm going to say BLT dot e again and again. 110 00:08:45,260 --> 00:08:52,070 If I'm going to execute it now, you will see with respect to each other based number, you have your 111 00:08:52,190 --> 00:08:55,510 own plot for each of the base number. 112 00:08:55,820 --> 00:09:03,200 So that's the type of analysis how you can perform on your data and you can definitely see or hear this 113 00:09:03,200 --> 00:09:10,670 orange plot, which is exactly this one, and this purple, which is exactly this be zero two seven 114 00:09:10,670 --> 00:09:11,380 six four. 115 00:09:11,450 --> 00:09:12,410 They definitely are. 116 00:09:12,410 --> 00:09:16,550 Turns your attention and you can definitely conclude as yeah. 117 00:09:16,580 --> 00:09:20,980 With respect to these d did you have that much demand and supply? 118 00:09:20,990 --> 00:09:27,440 It means both these base number, which is the zero two seven six four, and this orange which is exactly 119 00:09:27,440 --> 00:09:31,040 zero two five nine eight, they definitely performed better. 120 00:09:31,280 --> 00:09:37,010 Whereas in case of blue chart, this blue plot, which is with respect to the zero two, five and two, 121 00:09:37,160 --> 00:09:42,230 it doesn't have that much good attention comparing to all other space number. 122 00:09:42,560 --> 00:09:47,570 So that's the type of inference your conclusion you have patch on the data. 123 00:09:47,750 --> 00:09:49,700 So that's all about this project. 124 00:09:49,710 --> 00:09:51,470 Hope you love this project very much. 125 00:09:51,770 --> 00:09:52,430 Thank you. 126 00:09:52,430 --> 00:09:59,270 Have a nice day and try to explore this data as much as you can because it's still you have managed. 127 00:09:59,500 --> 00:10:06,490 Does a available over here to try to create his data and try to explore as much as you can, and that's 128 00:10:06,540 --> 00:10:12,960 it said just focus on your resume and secure your job as whatever rules you want. 129 00:10:13,200 --> 00:10:16,230 So I hope you will love this session and this project very much. 130 00:10:16,500 --> 00:10:17,110 Thank you. 131 00:10:17,130 --> 00:10:19,230 How nice to keep learning. 132 00:10:19,230 --> 00:10:21,420 Keep growing, keep motivating.