1 00:00:05,860 --> 00:00:06,220 Here. 2 00:00:06,260 --> 00:00:11,830 Even so in this video we are going to find some relations. 3 00:00:13,240 --> 00:00:19,810 And here if you notice on the caution is there a relation between number of projects offers and number 4 00:00:19,810 --> 00:00:21,320 of donations made by donor. 5 00:00:21,640 --> 00:00:27,190 So we have to find a relationship between projects offered and the donations made by donor. 6 00:00:27,310 --> 00:00:33,160 Then with state performing battering case and how many of responding projects request below average 7 00:00:33,160 --> 00:00:38,210 and states are performing best and best in terms of donations per project. 8 00:00:38,860 --> 00:00:44,340 So after that we need to find that the state is performing that best and which is performing lowest. 9 00:00:44,440 --> 00:00:46,960 And then I have a hint here for you. 10 00:00:47,080 --> 00:00:51,360 In order to answer this question we just need to get the number of projects per state. 11 00:00:51,460 --> 00:00:53,130 That is we have already done before. 12 00:00:53,860 --> 00:00:56,820 And then the number of donations made per state. 13 00:00:56,920 --> 00:01:03,690 That is also we have done in the last video then we should most these two and proto skateboard to solicit. 14 00:01:03,690 --> 00:01:09,330 So first of all we're required that the number of projects posted if you remember we have done that 15 00:01:09,330 --> 00:01:14,830 one in the case we have schools in the previous one. 16 00:01:15,150 --> 00:01:22,590 And we have used schools state that is maybe the first one that we have done here. 17 00:01:22,600 --> 00:01:28,520 School straight goes this one and is this one. 18 00:01:28,950 --> 00:01:34,500 And then we will count the values that how many of that type out of eleven. 19 00:01:35,820 --> 00:01:38,090 So here just count. 20 00:01:38,250 --> 00:01:46,080 And then the second one is the one that we have done in the last video that is we have data for and 21 00:01:46,080 --> 00:01:55,740 we first group all the data by donor state so that we have these states according to the donors and 22 00:01:55,740 --> 00:02:02,400 then we will count the number of items available there which will be equal to the number of donations 23 00:02:02,400 --> 00:02:05,790 because I decided presenting the donations. 24 00:02:06,260 --> 00:02:09,920 So instead of counting them we can just count the ibis. 25 00:02:09,990 --> 00:02:19,370 So hey we have these two datasets and discern is s now build most of them by using couldn't coordination. 26 00:02:19,500 --> 00:02:29,340 Here I have RDF and I will do PD dot com get they began to discern and provide the values that need 27 00:02:29,340 --> 00:02:40,980 to be concatenated like I have s full and S5 after that positive barometers like X is I need to 1 instead 28 00:02:40,980 --> 00:02:47,700 of zeros then I have keys and keys are the projects and donations 29 00:02:49,940 --> 00:02:51,480 maybe Edison 30 00:02:54,220 --> 00:02:57,730 and David go the data. 31 00:02:57,980 --> 00:03:07,210 Now if you print death first that we've been ahead Dave we have this one we have states then we have 32 00:03:07,210 --> 00:03:13,330 projects and we have donation we have conquered needed this one only that these two keys projects and 33 00:03:13,330 --> 00:03:19,630 donations and we have the does this for an S5 that is first one the school states and we have counted 34 00:03:19,630 --> 00:03:25,720 all the values and then we have beautiful group donor state and counted the donations I despairs and 35 00:03:25,740 --> 00:03:35,280 then so get the decision and if you print this one simply you will find we have all these states the 36 00:03:35,300 --> 00:03:42,430 deep project and donations made but in the last we have a debt that is an empty value men but if it 37 00:03:42,430 --> 00:03:50,590 has donations so do you not have any projects but it has donation maybe something Blechman but we need 38 00:03:50,590 --> 00:03:52,510 to drop that well. 39 00:03:52,780 --> 00:04:02,230 So here we will just do deserve equal to be f don't drop any so we will drop that null value now if 40 00:04:02,230 --> 00:04:08,680 you print this one you will find that you do not have that particular value now and that is where they 41 00:04:08,680 --> 00:04:13,680 drop a name though I have not shown that bonanza before. 42 00:04:13,690 --> 00:04:21,460 Now here we had the head they legal the Denver now we will visualize this one by just simply D after 43 00:04:21,550 --> 00:04:30,100 a plot and we need a scatter probe there if you remember biased get to produce lays it. 44 00:04:30,300 --> 00:04:38,160 So here we have this one scatter There we go and then we have X titan 45 00:04:41,700 --> 00:04:49,350 that is the projects this time invite title is the donations. 46 00:04:49,800 --> 00:05:03,210 So here we have donations then we have the title of this float that is simply projects was is donations 47 00:05:05,010 --> 00:05:20,830 to go after that the wide symbol equal to x and color scale equal to be it shifted on David go the debt. 48 00:05:21,300 --> 00:05:28,560 So here we have this one but this one is form of Inform of line we need to add one more Bama to them 49 00:05:28,950 --> 00:05:35,650 so find a space and then decode and just write 50 00:05:38,320 --> 00:05:43,850 Ma ko so it is more equal to Marcus. 51 00:05:44,380 --> 00:05:53,150 So here we had the son Marcus will shift Britain and then we had this when I started plotting form of 52 00:05:53,540 --> 00:05:55,440 this cross team bus. 53 00:05:55,520 --> 00:06:02,530 So now we have plotted despite now we need to combine the data to visualize that. 54 00:06:02,830 --> 00:06:08,510 So in order to enter a discussion we must get the number of projects posted and the number of tuition. 55 00:06:08,590 --> 00:06:10,420 Then we should must this data. 56 00:06:10,420 --> 00:06:11,200 This too. 57 00:06:11,320 --> 00:06:13,030 And plotters get to visualize it. 58 00:06:13,570 --> 00:06:14,990 So here we have done this one. 59 00:06:15,040 --> 00:06:19,200 But now we also require to much upload with that one. 60 00:06:19,270 --> 00:06:23,530 Here we have now done is the relationship between number of projects offered and the donations made 61 00:06:23,530 --> 00:06:24,810 by donor. 62 00:06:24,850 --> 00:06:31,030 This is what we have done here and now fit a linear model which would basically indicate a relationship 63 00:06:31,030 --> 00:06:32,950 between projects and donations. 64 00:06:32,950 --> 00:06:41,400 So we need to create some particular relation in between the projects and donations so that there will 65 00:06:41,400 --> 00:06:44,920 be a linear linear go is something a straight line we can see. 66 00:06:45,420 --> 00:06:49,830 If I have discovered in a straight line going down in which X and Y values are equal. 67 00:06:50,880 --> 00:06:55,140 So for that we define two way above the slope and intercept 68 00:06:58,040 --> 00:07:02,350 and they will be defined by using a method that is known as poly fit. 69 00:07:02,540 --> 00:07:09,320 This is a method that simply return a polynomial of a particular degree it requires three parameters 70 00:07:10,160 --> 00:07:17,000 generated by mode but the basic one of three that is X flight and then degree of what degree we require 71 00:07:17,030 --> 00:07:17,960 the polynomial. 72 00:07:18,350 --> 00:07:21,380 So in X we just two days after the projects 73 00:07:26,100 --> 00:07:30,430 in case of value we just simply do D F for donations. 74 00:07:30,460 --> 00:07:34,510 So here we have donations and we require a degree. 75 00:07:34,530 --> 00:07:40,040 1 This is something you can do search for qualified a home book for you. 76 00:07:40,120 --> 00:07:49,510 After that we have X and Y No we required to define the excesses so X is something and B don't every 77 00:07:52,020 --> 00:08:01,020 and in that one the bus B after projects and we will go for the minimum and maximum values. 78 00:08:01,950 --> 00:08:10,620 So first minimum and after that one will pass the F not projects 79 00:08:13,200 --> 00:08:21,870 and then the maximum because we need to figure it out here on the cushion visited performing better 80 00:08:21,870 --> 00:08:22,440 in this case. 81 00:08:22,440 --> 00:08:25,830 How many of responding projects request below average. 82 00:08:25,830 --> 00:08:28,950 And which states are performing best in terms of donations per project. 83 00:08:29,250 --> 00:08:35,570 So we simply need to find a minimum and maximum values so that we can visualize that one that real are 84 00:08:35,580 --> 00:08:40,650 the maximum values and we're at the minimum values that which states are denoting the maximum one and 85 00:08:40,650 --> 00:08:42,100 which are denoting the minimum. 86 00:08:42,360 --> 00:08:50,210 So for that when we require this x variable in which we have minimum and maximum values of project so 87 00:08:50,210 --> 00:08:57,560 there is word here X is I hope you get the idea of what the X is more you will understand this one while 88 00:08:57,560 --> 00:09:01,420 we complete the other remaining problems here. 89 00:09:01,760 --> 00:09:13,520 Then we have the right that will be equal to that slope and multiply by x plus intercept. 90 00:09:13,520 --> 00:09:22,980 Now if you plot this one you will get a straight line X comma white shift written day we have a straight 91 00:09:22,980 --> 00:09:24,140 line. 92 00:09:24,580 --> 00:09:26,560 I hope you go that what I have done here. 93 00:09:26,590 --> 00:09:31,410 If your mental math is mathematics student then this one is very easy for you just. 94 00:09:31,420 --> 00:09:38,910 I have created a slope and intercept that is of these two which is a polynomial of degree. 95 00:09:39,490 --> 00:09:43,990 It provides me slope of that particular location and the intercept of that one. 96 00:09:44,140 --> 00:09:52,360 And if you remember the general equation for any go is Vi's equal to emacs plus C the M is the slope 97 00:09:52,830 --> 00:09:57,800 X is the variable C is any constant C the intercept of these two lines. 98 00:09:58,330 --> 00:09:59,530 And then we have divide. 99 00:10:00,460 --> 00:10:06,640 So I have defined a variable X that is denoting an area of the minimum and maximum values of the project. 100 00:10:06,760 --> 00:10:13,060 And then I have defined a variable VI which is the general equation slope plus intercept then I have 101 00:10:13,070 --> 00:10:17,970 P BLT dot plot and simply I have plotted to discuss. 102 00:10:18,270 --> 00:10:23,430 I hope you got this one now and if you get any problem then please ask me the cushions. 103 00:10:23,460 --> 00:10:28,320 After that we need to combine these plots so combining the plots is very simple. 104 00:10:28,710 --> 00:10:29,740 We just have first. 105 00:10:29,730 --> 00:10:32,860 Do you have good plot not shatter. 106 00:10:32,880 --> 00:10:38,630 The first one this time not by using the plot because we required to combine these two. 107 00:10:38,790 --> 00:10:46,530 I have X is equal to projects the same this one but in form of a simple plot. 108 00:10:46,530 --> 00:10:55,130 So here I have projects and then I have Y equal to donations. 109 00:10:55,160 --> 00:10:56,040 Then we had this one. 110 00:10:57,060 --> 00:10:59,580 If you plot that one you will have this simple curve. 111 00:10:59,760 --> 00:11:02,760 So a simple plot of scatter plots. 112 00:11:03,000 --> 00:11:08,910 Then after this we will add this whole currency. 113 00:11:09,480 --> 00:11:13,200 Here in this set come on 3. 114 00:11:13,260 --> 00:11:15,460 So here we had this slope and intercept. 115 00:11:15,460 --> 00:11:20,900 We have defined and qualified poly fit with projects and donations with the group 1. 116 00:11:21,000 --> 00:11:23,020 Then I have the maximum and minimum value. 117 00:11:23,040 --> 00:11:28,680 Edit That is the x and then I have VI that is the slope into x plus intercept. 118 00:11:28,830 --> 00:11:30,400 Here I have plotted these two. 119 00:11:30,680 --> 00:11:31,280 No. 120 00:11:31,350 --> 00:11:40,680 Just simply provide BLT or type underscored layout for a proper view and also provide a margin there 121 00:11:40,690 --> 00:11:42,390 that is BLT. 122 00:11:42,780 --> 00:11:43,890 Note margin 123 00:11:47,590 --> 00:11:51,320 and this one zero point zero five shift return day. 124 00:11:51,340 --> 00:11:58,990 We have a combined plot here if you notice this go the values that are about this one showing the states 125 00:11:58,990 --> 00:12:04,750 in which we have better donations and the values below this one representing that is not performing 126 00:12:04,870 --> 00:12:05,210 good. 127 00:12:05,860 --> 00:12:14,810 So this is the visualization by design of this particular problem that which states which projects request 128 00:12:14,810 --> 00:12:19,420 below average and which states are performing best in terms of donations. 129 00:12:19,460 --> 00:12:26,420 So this goes on about this straight line representing districts which are better donations according 130 00:12:26,420 --> 00:12:29,320 to the number of projects they can cannot just simply conclude. 131 00:12:29,330 --> 00:12:34,670 This one like this one has the maximum number of donations so this is best if you notice this one is 132 00:12:34,670 --> 00:12:40,430 also the maximum number of projects and the best means that according to the number of projects we have 133 00:12:40,430 --> 00:12:42,040 a better number of donations. 134 00:12:42,080 --> 00:12:49,510 So this call there is this simple line dividing the plane into two halves is just simply representing 135 00:12:49,730 --> 00:12:54,540 we can see a boundary line below which the plots are not performing well. 136 00:12:54,800 --> 00:13:00,150 They have met more number of projects but let's number of donations about this one. 137 00:13:00,230 --> 00:13:03,870 We have the plots which are generally presenting lower number of projects. 138 00:13:03,900 --> 00:13:09,350 So we can see the number of projects fulfilled by the number of donations. 139 00:13:09,380 --> 00:13:15,230 So it means these states like when we have this one two thousand and the about this line this dude is 140 00:13:15,230 --> 00:13:16,350 performing well. 141 00:13:16,670 --> 00:13:19,380 So I hope you get this and lies that. 142 00:13:19,520 --> 00:13:24,260 What do I need to show you here that this girl is just a special light. 143 00:13:24,350 --> 00:13:31,550 I need to plot as straight go so that we can have the idea of maximum and minimum values about this. 144 00:13:31,550 --> 00:13:33,830 We have the better one below this. 145 00:13:33,920 --> 00:13:36,110 We have the one that under performing well. 146 00:13:36,530 --> 00:13:39,110 So and then analyzes them. 147 00:13:39,170 --> 00:13:40,100 Now we will move that. 148 00:13:40,100 --> 00:13:43,130 How many different projects type sexist. 149 00:13:43,160 --> 00:13:43,910 So try. 150 00:13:43,910 --> 00:13:47,540 If you are feeling comfortable with this one and see in the next video.