1 00:00:00,420 --> 00:00:05,460 So it looks like my file is completely uploaded because this number hasn't gone up in a little while. 2 00:00:05,460 --> 00:00:11,520 And now you may notice a little discrepancy between what the file appears to be in size here on my local 3 00:00:11,520 --> 00:00:15,170 machine seven twenty four point five and six ninety one. 4 00:00:15,530 --> 00:00:16,090 Mm hmm. 5 00:00:16,680 --> 00:00:22,710 But this has seven twenty four four hundred ninety five thousand nine hundred twenty six bytes which 6 00:00:22,710 --> 00:00:23,760 equates to that. 7 00:00:24,270 --> 00:00:31,080 So then maybe a little bit of a discrepancy in terms of how Google Drive Stores files compared to locally 8 00:00:31,080 --> 00:00:37,430 and how it represents size but it looks like all of the files of uploaded which is what we're after. 9 00:00:37,440 --> 00:00:44,900 So now we've got our zip file and our dog vision folder on our Google Drive. 10 00:00:45,070 --> 00:00:50,200 The next step is to unzip it much like we could locally. 11 00:00:50,200 --> 00:00:55,710 So if I just double click here it's going to expand dog breed ID dot zip. 12 00:00:55,720 --> 00:00:57,190 So we'll have a look at what's inside 13 00:01:07,660 --> 00:01:09,470 Beautiful. 14 00:01:09,540 --> 00:01:17,500 So this is what we're off to try and test sample submission labels dot CSB and this is what Kaggle says 15 00:01:17,500 --> 00:01:23,590 it's gonna give us if we were to download the zip labels sample submission test and train and in labels 16 00:01:23,590 --> 00:01:26,710 we have ten thousand two hundred twenty two unique values. 17 00:01:26,710 --> 00:01:27,150 OK. 18 00:01:27,220 --> 00:01:28,090 Wonderful. 19 00:01:28,090 --> 00:01:33,460 So these are the kind of things that we can test to make sure that even if we see a little discrepancy 20 00:01:33,460 --> 00:01:37,330 here we can test once we've got our data in our notebook. 21 00:01:37,330 --> 00:01:39,410 Hey does this line out with Kaggle. 22 00:01:39,410 --> 00:01:41,680 So that's what we'll have to look at now. 23 00:01:41,840 --> 00:01:49,310 So how do we unzip this in Google Drive and maybe a way we can do it from here. 24 00:01:49,560 --> 00:01:51,260 Well we're going to see how to do it with code. 25 00:01:51,520 --> 00:01:55,460 So let's go unzip and now remember these cells here. 26 00:01:55,470 --> 00:02:00,840 Just like in Jupiter if we want to run something that's like on the command line we can use the bank 27 00:02:00,840 --> 00:02:02,900 symbol here or the exclamation mark. 28 00:02:02,910 --> 00:02:06,090 So this is going to go unzip and then we give it a file. 29 00:02:06,180 --> 00:02:07,500 We need the file path here. 30 00:02:07,560 --> 00:02:12,310 I believe he might be had to go coffeepot oh wait you can. 31 00:02:12,310 --> 00:02:14,720 That's going to save us some time. 32 00:02:14,830 --> 00:02:16,000 Beautiful. 33 00:02:16,000 --> 00:02:19,640 We could do that or we could just type it out for practice. 34 00:02:19,870 --> 00:02:24,920 So you actually don't need content little tidbit there so we can go here. 35 00:02:24,960 --> 00:02:26,310 Let's see how we top it out. 36 00:02:26,340 --> 00:02:29,090 Drive because we're gonna drive here. 37 00:02:29,190 --> 00:02:30,420 Drive my drive. 38 00:02:30,420 --> 00:02:32,910 Don't forget the space my drive. 39 00:02:32,910 --> 00:02:37,440 Yours might be named a little bit different to mine but this is just what mine looks like. 40 00:02:37,440 --> 00:02:38,630 Our target file is there. 41 00:02:38,660 --> 00:02:41,600 So your target file be the same as mine. 42 00:02:41,760 --> 00:02:42,690 My drive. 43 00:02:42,690 --> 00:02:46,080 And then the project folder is dog vision. 44 00:02:46,110 --> 00:02:56,430 So we need to go dog vision dog space vision and then dog breed identification dog zip. 45 00:02:56,690 --> 00:02:58,430 Is that smell right. 46 00:02:58,430 --> 00:02:59,240 I believe so. 47 00:02:59,690 --> 00:03:05,570 And then we're going to posit the D tag the destination and then we want to put it in drive just the 48 00:03:05,570 --> 00:03:15,230 exact same forward I drive my drive dog vision we just want to unzip it where it leaves I believe that 49 00:03:15,230 --> 00:03:15,890 should be enough 50 00:03:18,720 --> 00:03:19,970 wonderful. 51 00:03:20,250 --> 00:03:25,130 Let's run this so we can shift it into what we could do that little play button. 52 00:03:25,130 --> 00:03:32,750 This little play button here and now we're gonna get a big output of messages here because it has to 53 00:03:32,750 --> 00:03:38,530 go through and unzip 10000 files into dog vision. 54 00:03:38,530 --> 00:03:44,680 Now it may take a little while for this folder to update over here but we can be sure that it's running 55 00:03:44,680 --> 00:03:46,410 by having a look at all this. 56 00:03:46,570 --> 00:03:49,810 So say these are all test images. 57 00:03:49,880 --> 00:03:52,130 Now again this might take a little while. 58 00:03:52,190 --> 00:03:57,900 So while this code is running I'm going to stop this video here and we'll come back in the next one. 59 00:03:57,980 --> 00:04:03,710 Once we've unzipped all of our data in the dog vision file and then we'll have our data ready to be 60 00:04:03,710 --> 00:04:05,820 used for our project. 61 00:04:05,900 --> 00:04:06,620 There we go. 62 00:04:06,630 --> 00:04:08,420 We've got some updates. 63 00:04:08,420 --> 00:04:10,140 So this might take a few minutes to run. 64 00:04:10,190 --> 00:04:15,770 Make sure it's fully run before we go into the next segment because otherwise if it's not fully run 65 00:04:15,830 --> 00:04:23,170 you'll come into the same mistake I made when trying to work with your data before it had been unzipped. 66 00:04:23,430 --> 00:04:30,600 And so you end up working with less files than what Kaggle says there is and that causes whole bunches 67 00:04:30,600 --> 00:04:31,960 of errors down the track. 68 00:04:32,070 --> 00:04:37,110 So we'll let this run it may take a few minutes but I'll see in the next video once this is completed 69 00:04:37,410 --> 00:04:40,140 and we'll go through what it's unzipped.