1 00:00:00,170 --> 00:00:07,850 If you were tasked to gather information from a company website, then the file type search operator 2 00:00:07,850 --> 00:00:16,790 is going to be vitally important, because this search operator will allow you to search for file types 3 00:00:16,820 --> 00:00:23,990 across the internet, which means that you can tell Google to show you search results that are only 4 00:00:23,990 --> 00:00:31,910 PDF files, Excel files, word files, PowerPoint files, etc. and we can combine it with the site search 5 00:00:31,940 --> 00:00:37,430 operator to find a specific file type on a certain website. 6 00:00:37,910 --> 00:00:46,790 For example, here we have specified the site as the Security.org and we are searching for all the indexed 7 00:00:46,790 --> 00:00:52,190 PDF files on Google from the domain name security.org. 8 00:00:53,000 --> 00:00:55,520 So let's see an example. 9 00:00:55,820 --> 00:01:02,120 I'm going to search for all the PDF files that are indexed by Google from the domain name z security.org. 10 00:01:02,120 --> 00:01:03,380 I'm going to hit enter. 11 00:01:04,160 --> 00:01:08,150 And then we can see that we have 82 search results. 12 00:01:08,150 --> 00:01:10,970 And all of these search results are PDF files. 13 00:01:11,840 --> 00:01:13,730 So let me open the first file. 14 00:01:14,480 --> 00:01:21,650 And this is basically a walkthrough of what we can do is download this file on our machine. 15 00:01:24,080 --> 00:01:31,160 And then go to a website that will allow us to read metadata from this file. 16 00:01:31,940 --> 00:01:39,200 And metadata are information that are embedded with a PDF file, or with an image that will allow you 17 00:01:39,200 --> 00:01:44,180 to see the creation date, the creator, producer, author, etc.. 18 00:01:44,180 --> 00:01:49,160 So these are some information that can be only found if you read the metadata. 19 00:01:49,160 --> 00:01:55,610 Sometimes you'll be able to find some geolocation information in the metadata, which is something that 20 00:01:55,610 --> 00:01:58,400 I'm going to cover in the next lectures. 21 00:01:58,850 --> 00:02:02,760 So now let me upload the file that we have downloaded. 22 00:02:02,760 --> 00:02:05,760 And then I'm going to say read PDF metadata. 23 00:02:06,660 --> 00:02:10,800 And you can see that this is the author, which is something that we cannot find. 24 00:02:10,830 --> 00:02:18,480 If you open the file, we can also see the exact time when this file has been created and with which 25 00:02:18,510 --> 00:02:21,360 software this file has been created. 26 00:02:21,390 --> 00:02:29,730 Sometimes you will see old software versions that has many vulnerabilities that a hacker could exploit. 27 00:02:30,420 --> 00:02:36,870 Here is another example to find PDF files that are related to the name Rishi Kabra. 28 00:02:36,990 --> 00:02:45,030 I'm going to hit enter and I can scroll down and see that there is a document on this website. 29 00:02:45,840 --> 00:02:47,940 So I'm going to open this document. 30 00:02:49,110 --> 00:02:56,610 And if I zoomed in, I can see the names of the people who contributed in this document. 31 00:02:56,790 --> 00:02:59,280 So we can see the name Rishi Kabra. 32 00:03:00,370 --> 00:03:06,490 And we can also see an email of this person which is right here. 33 00:03:06,490 --> 00:03:09,460 Rishi Kabra 132 at gmail.com. 34 00:03:09,880 --> 00:03:14,770 And this is a username that we have previously saved in our notepad. 35 00:03:16,210 --> 00:03:22,600 So as you can see here, he used his GitHub username as his email address. 36 00:03:23,800 --> 00:03:27,490 Coming back to the search results, we can scroll down a little bit. 37 00:03:27,490 --> 00:03:35,650 And then we can see a search result of a PDF file that is published on the university of where Rishi 38 00:03:35,650 --> 00:03:36,640 Kabra was. 39 00:03:37,060 --> 00:03:39,430 So this is called SRM something. 40 00:03:39,430 --> 00:03:47,440 If we went to Rishi Kabra LinkedIn account, we can also see that he was in the SRM University. 41 00:03:47,920 --> 00:03:56,440 And if we open this file we can search for the name Rishi Kabra, as you can see here. 42 00:03:56,980 --> 00:04:00,790 So his name were also mentioned in this document.