1 00:00:05,130 --> 00:00:07,050 Hi there, welcome back. 2 00:00:07,170 --> 00:00:17,610 In this video, you will learn how to install an El Python module to use it with Python Banda's module 3 00:00:17,610 --> 00:00:22,550 to scrap HTML5 from a specific webpage. 4 00:00:24,000 --> 00:00:34,440 So we will open ACMD and install the XML file module as the following web install l x m l. 5 00:00:34,800 --> 00:00:45,060 Then we will return to our project at the lab and import pendas module as the following on board Banda's 6 00:00:45,060 --> 00:00:46,180 as PDE. 7 00:00:50,650 --> 00:01:01,880 Then after run the cell, we will assign a variable called data to the data that Banda's has read from 8 00:01:01,880 --> 00:01:15,570 a webpage that we specified inside the Reed Underscore XHTML method and the following data equal b'day 9 00:01:15,580 --> 00:01:24,000 dot Reed underscore Actimel between two brackets and two parentheses at our website, then run the cell. 10 00:01:25,420 --> 00:01:33,820 Then we will print the data, which contains all the data from the HTML file of the web page as you 11 00:01:33,820 --> 00:01:35,500 see in the form of list. 12 00:01:35,500 --> 00:01:43,090 And then we will print the first item in the list and assign this first item of our list in that a frame 13 00:01:43,090 --> 00:01:45,610 called the F, then print the data frame. 14 00:01:45,760 --> 00:01:48,570 It contains the table from our Web page. 15 00:01:48,580 --> 00:01:56,890 So if we want to get the first column, we will type the following code using our data framed F as the 16 00:01:56,890 --> 00:02:01,590 following the F between two square brackets, the column name. 17 00:02:01,900 --> 00:02:05,770 So the F between two square brackets, 18 00:02:08,410 --> 00:02:17,590 popularity, unique visitors per month at will print this column and return all values inside this column, 19 00:02:18,010 --> 00:02:18,730 as you see. 20 00:02:20,470 --> 00:02:31,570 Also, we will do that for the websites column and front end side column also as a follow. 21 00:02:40,650 --> 00:02:51,030 But in case of website, this is a column name consists of one world, so we can use that column name 22 00:02:51,240 --> 00:02:51,830 method. 23 00:02:54,260 --> 00:03:02,450 The F between two brackets, back end server side, it will print that column. 24 00:03:04,520 --> 00:03:16,490 Now we want to do statistics to the column of popularity, unique visitor Bill Months as a. 25 00:03:18,740 --> 00:03:23,450 We will use the following method, the first method. 26 00:03:23,450 --> 00:03:30,500 We will use the F between two square brackets dot method that will return. 27 00:03:30,650 --> 00:03:40,280 The mean of that, the values of the popularity unique visitor Burmans dot mean medium for million dollar 28 00:03:40,280 --> 00:03:42,560 standard deviation for standard deviation. 29 00:03:42,560 --> 00:03:52,820 The variance for variance and describe for all of that count mean standard deviation minimum and twenty 30 00:03:52,820 --> 00:03:55,310 five which is the fairest quartile. 31 00:03:55,940 --> 00:04:00,280 Fifty percent is a second Alstyle and is the median in the same time. 32 00:04:00,410 --> 00:04:06,620 Seventy five percent which is third quarter and the maximum value. 33 00:04:07,280 --> 00:04:12,170 That's all for the statistics for that column. 34 00:04:12,740 --> 00:04:25,310 So if we want to do all statistics for this specific column that contains numbers, we use those described 35 00:04:25,310 --> 00:04:25,900 methods. 36 00:04:28,820 --> 00:04:31,550 At this point, we reached the end of this lecture. 37 00:04:31,580 --> 00:04:36,320 I hope you enjoyed this lecture and to get all of that, thank you for being here. 38 00:04:37,280 --> 00:04:40,160 That's all for this video. 39 00:04:40,490 --> 00:04:41,120 Thank you. 40 00:04:57,310 --> 00:04:58,480 Thanks for watching. 41 00:04:58,630 --> 00:05:00,420 See you next with you.