1 00:00:06,070 --> 00:00:11,210 Everyone so from this video they are going to do some exciting stuff here. 2 00:00:11,210 --> 00:00:12,860 Selfless dusk. 3 00:00:12,860 --> 00:00:16,210 Here we have is create new data by using the data sets. 4 00:00:16,250 --> 00:00:20,090 We are going to create some new data by using these datasets that we have. 5 00:00:20,750 --> 00:00:22,790 So let's begin with that one. 6 00:00:23,000 --> 00:00:24,350 What we are going to do here. 7 00:00:24,380 --> 00:00:31,170 We are going to define a data set that contains all these files in only one file or we can say in one 8 00:00:31,170 --> 00:00:31,710 data. 9 00:00:32,280 --> 00:00:40,290 So we will march all these available data frames here one by one and begin with donations and see which 10 00:00:40,470 --> 00:00:44,220 one not out of the other five we can merge with this one. 11 00:00:44,310 --> 00:00:50,220 So here if you notice first of all we have project data and find the one with Project data. 12 00:00:50,360 --> 00:00:50,910 Go. 13 00:00:51,510 --> 00:00:56,750 So we will take this one as we can see key and merge these files. 14 00:00:56,760 --> 00:01:02,960 So first of all here we have our data and demanded multiple use Bebe dot much 15 00:01:06,330 --> 00:01:11,070 and then provides the file first one is one we have donations. 16 00:01:11,070 --> 00:01:17,310 Second one is one projects and how we will most them enough. 17 00:01:17,970 --> 00:01:24,650 If you remember this one how in the outer left right then we have on the vote. 18 00:01:24,660 --> 00:01:34,000 We can say key according to which we are going to most this one that is project Heidi B's capital and 19 00:01:34,090 --> 00:01:35,920 I N D also capital them. 20 00:01:36,190 --> 00:01:41,170 So make sure you are also using the capital votes if there is any capital vote otherwise you will get 21 00:01:41,170 --> 00:01:49,900 any DNA or find note found later shifted on that one and it will take a little time to most these vice 22 00:01:51,810 --> 00:02:01,010 and here that we write and then data head and possibly leave to do them so that we can have the idea 23 00:02:01,010 --> 00:02:03,680 of the columns that are going to be in one 24 00:02:06,560 --> 00:02:08,700 this one is taking long time. 25 00:02:09,150 --> 00:02:14,830 So let me just check out the columns by Eva said Lim. 26 00:02:14,930 --> 00:02:16,090 Now we have the data. 27 00:02:16,340 --> 00:02:20,350 And the second idea is donation and then we have Donna. 28 00:02:20,810 --> 00:02:22,790 So check for this one has Donna writing. 29 00:02:22,820 --> 00:02:30,380 So will most the data the donors the data that we are going to get here is the one that have already 30 00:02:30,380 --> 00:02:40,980 these columns so data to be really defined something like BD don't much this one is don't know if I 31 00:02:40,980 --> 00:02:51,370 did on this one I have projected a donation I don't write writing so I will march this data with donors 32 00:02:53,160 --> 00:02:55,010 and then again in the 33 00:02:58,940 --> 00:03:02,430 and the key this time is going to be the donor right. 34 00:03:02,750 --> 00:03:05,390 So here the capital owner. 35 00:03:05,570 --> 00:03:11,750 And then I'd be again these are capitalism shifted on. 36 00:03:11,850 --> 00:03:21,050 David go with that one not the one that is going with this one is going to have the I.D. of donations. 37 00:03:21,100 --> 00:03:25,340 We have this one donations Donna and projects. 38 00:03:25,420 --> 00:03:30,030 Now let me move to another one that is resources schools and DHL. 39 00:03:30,310 --> 00:03:34,530 So if you notice Project have also school lately I have noticed this one here. 40 00:03:35,110 --> 00:03:43,600 And also a teacher writing so we can mostly the data the schools and the teachers by using the teacher 41 00:03:43,600 --> 00:03:44,760 I.D. and school I.D.. 42 00:03:45,820 --> 00:03:59,430 So we really do hear data three dot much then we will pass data to help and then what we are going to 43 00:03:59,430 --> 00:04:00,430 miss this one. 44 00:04:00,440 --> 00:04:00,720 Wait. 45 00:04:00,720 --> 00:04:06,540 First let me go with the schools that he had discerned how in 46 00:04:09,390 --> 00:04:14,630 then we have the key on equal to and the one we are going to use here. 47 00:04:14,640 --> 00:04:23,550 These school lady so escapee done school and then I did this one is not most to know. 48 00:04:23,810 --> 00:04:26,300 So we are not going to run this one. 49 00:04:26,600 --> 00:04:34,000 We will run this one after this one completely matches and then we have data for that will be PD don't 50 00:04:34,030 --> 00:04:44,150 much and then in that run we will provide data three this one is taking time because these fires are 51 00:04:44,150 --> 00:04:46,810 very large so did burn. 52 00:04:46,820 --> 00:04:56,780 Now if you print data or head right now you will have project I.D. donation I.D. and on writing. 53 00:04:56,830 --> 00:05:01,640 Now we have marked this one with the school Lady by using that one with schools. 54 00:05:01,870 --> 00:05:07,920 So shifted on that one and did it get done let me complete our syntax here. 55 00:05:08,670 --> 00:05:11,160 What we are going to merge here is the teachers now 56 00:05:14,190 --> 00:05:16,680 and how it will do enough. 57 00:05:17,010 --> 00:05:26,720 Then we have a equal to what they're going to use our teacher writing each US based I.D. then we are 58 00:05:26,720 --> 00:05:34,860 only left with assume resources don't CSC that. 59 00:05:35,510 --> 00:05:37,530 Here we have this one resources. 60 00:05:37,580 --> 00:05:39,460 So we are not going to miss this one. 61 00:05:40,590 --> 00:05:45,510 Because we are not going to require that when you know a complete dataset even this one has a project 62 00:05:45,510 --> 00:05:45,940 item. 63 00:05:45,960 --> 00:05:51,830 You can also use this one to most these files but we are not going to do that here. 64 00:05:52,640 --> 00:05:59,720 So this one is still processing and let it proved learn here. 65 00:06:01,160 --> 00:06:09,890 So let me pose the video and I will resume after completing these two processing so here. 66 00:06:09,900 --> 00:06:12,180 These two has done processing. 67 00:06:12,180 --> 00:06:21,890 And if you've been the head of the data for now you will get this one a complete most file of all these 68 00:06:21,890 --> 00:06:29,180 data frames and this one create contains all the data of all the five CSP files and just one that is 69 00:06:29,180 --> 00:06:32,160 resources towards CSC not included in this one. 70 00:06:32,390 --> 00:06:36,860 After that this one has a number of columns from each of these. 71 00:06:36,860 --> 00:06:44,810 So let me just print the columns here sort of in the columns who will take a variable equal to data 72 00:06:44,810 --> 00:06:46,700 for Dot. 73 00:06:46,700 --> 00:06:52,340 Then we will provide columns and we need their values. 74 00:06:52,340 --> 00:06:59,510 So values Hale and in which form in a least form shifted and if we have that one. 75 00:06:59,990 --> 00:07:07,230 And now to print that one just add a here shifted and we have all the columns present in Denver that 76 00:07:07,230 --> 00:07:12,560 is beginning from project idea donation idea as you can see and who's your daddy donation I lead on 77 00:07:12,590 --> 00:07:22,640 variety and like this one we have here approximately maybe 30 columns so this is about creating new 78 00:07:22,640 --> 00:07:26,000 data by merging the available data. 79 00:07:26,000 --> 00:07:27,140 So thanks for watching. 80 00:07:27,140 --> 00:07:29,840 We will continue in the next video the analyses. 81 00:07:29,900 --> 00:07:33,700 Now the questions are going to be something logical. 82 00:07:33,780 --> 00:07:36,800 Now they are only on data. 83 00:07:36,800 --> 00:07:37,960 So thanks for watching. 84 00:07:38,040 --> 00:07:39,020 Soon the next video.