1 00:00:05,740 --> 00:00:06,750 Hi, welcome back. 2 00:00:06,760 --> 00:00:12,130 And this lecture, we will learn how to use vendors to do analysis and statistics to our cities, we 3 00:00:12,130 --> 00:00:18,280 find that as we learn in the last lecture, we first bought Banda's as the following bought vendors, 4 00:00:18,280 --> 00:00:20,010 as did the runners. 5 00:00:20,530 --> 00:00:30,670 So then we will use Banda's to read our CSP file data that read underscores the ASV between two Brackett's. 6 00:00:32,120 --> 00:00:38,630 Our uploaded CSP file between two branches inside the two brackets. 7 00:00:40,000 --> 00:00:49,840 Dulcy ASV and run the cell, if we want to know data type of columns in our CHP file, we will write 8 00:00:49,840 --> 00:00:56,660 the following the F dot types method and run the cell after we run the cell. 9 00:00:57,160 --> 00:01:05,320 We find that passenger name and partial city and rival city columns are of object data type, which 10 00:01:05,320 --> 00:01:16,930 in data frame that R of string data type day and month of and sixty four, which means that these columns 11 00:01:16,930 --> 00:01:19,390 are of actual data type. 12 00:01:19,870 --> 00:01:25,270 The last column is SAYES, which is of float sixty four. 13 00:01:26,320 --> 00:01:34,210 That means that float data ties and we will use another vendor's methods which will describe the statistics 14 00:01:34,210 --> 00:01:39,750 of our data sheet of the columns that have numbers values. 15 00:01:39,760 --> 00:01:46,870 The DOT described methods to witness account mean standard deviation, the minimum, the maximum values 16 00:01:47,140 --> 00:01:51,700 of the three columns that contain numbers values. 17 00:01:51,940 --> 00:02:00,610 It also provides the first time, which is twenty five percent and the second quartile, which is seventy 18 00:02:00,610 --> 00:02:06,080 five percent and median for the median, which is fifty percent. 19 00:02:06,370 --> 00:02:13,690 We will focus on sales column as this column contains the data that we can analyze and get the statistics 20 00:02:13,900 --> 00:02:19,180 from the sales of tickets and our database airlines. 21 00:02:19,450 --> 00:02:27,040 What is that e plus any number that the value is multiplied by ten numbers of time. 22 00:02:27,050 --> 00:02:38,470 So it plus six is the means that value multiplied by ten six times the F dot sales dot described method 23 00:02:38,770 --> 00:02:39,970 and run this. 24 00:02:40,120 --> 00:02:47,860 So entertaining the account means standard deviation, the minimum and the maximum values of sales of 25 00:02:47,860 --> 00:02:48,310 tickets. 26 00:02:48,460 --> 00:02:56,320 Also it provides the first quartile, which is twenty five percent and the second quartile, which is 27 00:02:56,320 --> 00:03:01,120 seventy five percent and the median which is fifty percent. 28 00:03:01,420 --> 00:03:03,460 Also there are two methods. 29 00:03:03,460 --> 00:03:11,020 One for ZAMEEN as the following between two square brackets, sales inside the two square brackets between 30 00:03:11,020 --> 00:03:21,040 two parentheses not me method and runs SL to return the main value of the sales value inside our C as 31 00:03:21,040 --> 00:03:22,600 we datasheet. 32 00:03:22,750 --> 00:03:26,140 I forget to add the two brackets after that mean. 33 00:03:28,190 --> 00:03:37,520 As a full two hour meeting, value of the sales is one hundred and six thousand and one hundred eighty 34 00:03:37,520 --> 00:03:45,680 four point forty between two square records sales inside the two square brackets between two parentheses, 35 00:03:46,130 --> 00:03:49,610 then dot median two brackets running the sell. 36 00:03:49,610 --> 00:03:56,810 We get the median value of sales column, which is seventy five thousand two hundred. 37 00:03:59,240 --> 00:04:00,440 Thanks for watching. 38 00:04:00,590 --> 00:04:02,130 See you next week. 39 00:04:02,230 --> 00:04:02,380 You.