1 00:00:01,140 --> 00:00:08,340 So in all of a previous session, we have done various data processing stage, we have done so the visas 2 00:00:08,340 --> 00:00:09,160 for our data. 3 00:00:09,510 --> 00:00:15,810 So in this session, I'm going to consider this assignment in which the very first statement in which 4 00:00:15,810 --> 00:00:23,580 I have to check what the coalition data and the second one is basically I have to analyze education 5 00:00:23,700 --> 00:00:29,160 status of the customers that is right now applying for a loan in a bank. 6 00:00:29,610 --> 00:00:37,740 So what I'm going to do, guys, very first, I have to just call some functions for the coalition purpose. 7 00:00:38,100 --> 00:00:43,610 So very first, I have to just call a function, which is my card. 8 00:00:44,010 --> 00:00:50,670 And if you are going to execute it, you will see coordination in terms of nomadic life, say, to make 9 00:00:50,670 --> 00:01:00,390 it more interactive, I have to call a heat map on these metrics that you will see or hear just executed 10 00:01:00,390 --> 00:01:04,170 and you will see these beautiful heat map. 11 00:01:04,530 --> 00:01:08,070 So let's say I'm going to customize this heat map. 12 00:01:08,310 --> 00:01:12,770 So for this, I have to just call, set or figure. 13 00:01:12,780 --> 00:01:16,460 And in this I'm going to set my own side. 14 00:01:16,680 --> 00:01:26,880 So for this, I have to see then comma six Vendel and just executed and you will see your heat map gets 15 00:01:27,090 --> 00:01:28,680 zoom in a little bit. 16 00:01:29,070 --> 00:01:37,100 So to show you all the coalition values, basically you have to pass your energy parameter as cool. 17 00:01:37,680 --> 00:01:45,370 So this is exactly the correlation between all the features to form this coalition table. 18 00:01:45,600 --> 00:01:53,450 You will see each and experience which are basically this one, are very highly correlated. 19 00:01:53,460 --> 00:02:03,750 It means it is fine for us to go with age and drop experience to avoid multipolarity because both of 20 00:02:03,750 --> 00:02:06,690 the features are doing the same task. 21 00:02:06,870 --> 00:02:11,850 So it will basically provide me more quality of the issue. 22 00:02:12,150 --> 00:02:16,610 So we have to basically avoid these multi the issue. 23 00:02:17,100 --> 00:02:21,690 So for these guys, I have to just drop any of the feature. 24 00:02:21,760 --> 00:02:27,700 Either we can drop each or either I can drop experience as well for four days. 25 00:02:27,720 --> 00:02:32,130 I'm going to show call a drop on my face. 26 00:02:32,310 --> 00:02:37,830 And here you have to say I'm going to drop my experience column. 27 00:02:38,130 --> 00:02:42,480 And for this you have to set your access parameters. 28 00:02:42,810 --> 00:02:47,910 Let's say after updating, I'm going to store all this stuff in data as well. 29 00:02:48,390 --> 00:02:53,640 So it is showing not not not because I didn't add E here. 30 00:02:53,670 --> 00:03:01,290 And I have just as you get it now, if I'm going to check ahead so you will see you don't have any of 31 00:03:01,290 --> 00:03:04,390 that Putin's column in your data. 32 00:03:04,860 --> 00:03:11,850 So now it's time for control of the statement in which I have to analyze education, the status of the 33 00:03:11,850 --> 00:03:15,780 customers that are going to apply for the loan in the bank. 34 00:03:16,440 --> 00:03:20,050 So let's say for this purpose, I'm going to check. 35 00:03:20,400 --> 00:03:20,900 Yeah. 36 00:03:21,120 --> 00:03:24,580 What are the unique values in my education column? 37 00:03:24,990 --> 00:03:27,620 So you will see the first two entry. 38 00:03:27,900 --> 00:03:35,070 So first of all, basically that goes to undergraduate there two will basically be efforts to graduate 39 00:03:35,640 --> 00:03:40,330 and three will basically refers to advanced or against the professional. 40 00:03:40,890 --> 00:03:45,740 So let's say I'm going to assign all the three values to one, two, three. 41 00:03:46,080 --> 00:03:49,470 So for this, I'm just going to create a function. 42 00:03:49,500 --> 00:03:54,090 Or you can also do the same task using the concept of the study as well. 43 00:03:54,540 --> 00:03:57,660 So let's say I want to define my function in the function. 44 00:03:57,660 --> 00:03:58,460 Name is Mark. 45 00:03:58,950 --> 00:04:06,480 So here, whatever acts it will receive and then I'm going to add up if s over here, so I'm going to 46 00:04:06,480 --> 00:04:15,120 see if that's equal to one, then basically I have to return undergraduate, so I'm going to say I'm 47 00:04:15,130 --> 00:04:16,470 the graduate. 48 00:04:16,770 --> 00:04:20,850 Then I have to add another condition as I live. 49 00:04:21,030 --> 00:04:26,370 And here my second condition is whenever I have to do so. 50 00:04:26,370 --> 00:04:30,240 In such case, basically I have to return graduate. 51 00:04:30,330 --> 00:04:32,580 So I'm going to see graduate. 52 00:04:32,790 --> 00:04:43,320 Then I have the last condition as an s blog and here I'm going to see I have to treat in such case, 53 00:04:43,650 --> 00:04:50,350 basically I have to return advanced or I can say professional, whatever you want. 54 00:04:50,370 --> 00:04:52,770 It's already so professional. 55 00:04:53,070 --> 00:04:59,370 So just execute this function and now you have to apply this function on your education. 56 00:05:00,430 --> 00:05:07,430 So education don't apply and you have to need the muscle function. 57 00:05:07,780 --> 00:05:11,940 So once I will apply this function, it will return me value. 58 00:05:12,190 --> 00:05:18,250 So whatever value it will return me, I want to store to underscore the mark. 59 00:05:18,790 --> 00:05:22,630 So just sign this and just execute it. 60 00:05:22,630 --> 00:05:29,500 And if you are going to call ahead, you will see this column has been added in your data form, which 61 00:05:29,500 --> 00:05:32,500 have different, different models as well. 62 00:05:32,860 --> 00:05:39,190 So now you have to analyze what are the categories of all the persons that are going to apply for the 63 00:05:39,190 --> 00:05:42,810 loan purpose in your bank for this purpose? 64 00:05:42,820 --> 00:05:49,990 I going use Blakley because I want to provide you already, in fact, a very beautiful resource, very 65 00:05:49,990 --> 00:05:51,660 particular with this one as well. 66 00:05:52,000 --> 00:05:58,030 So for this, I'm going to use pie chart because here I have only three categories, or you can use 67 00:05:58,180 --> 00:06:05,530 Barcia as well, but as having less number of categories, pie chart plays a vital role over there. 68 00:06:05,920 --> 00:06:07,790 So I'm going to use pie chart here. 69 00:06:08,590 --> 00:06:14,830 So here very first, if you are going to press shift crosstab, so you will see the very first one is 70 00:06:14,830 --> 00:06:19,180 my data frame and then I have to parse it in the form of values. 71 00:06:19,450 --> 00:06:20,290 So values. 72 00:06:20,650 --> 00:06:24,130 And then here the second one is my labels. 73 00:06:24,410 --> 00:06:26,680 So in values and in labels. 74 00:06:26,680 --> 00:06:30,190 And here you will see I have names. 75 00:06:30,400 --> 00:06:36,280 So these names and these values plays a major role in case of pie chart. 76 00:06:36,610 --> 00:06:40,860 So here and we do see values equals two. 77 00:06:41,290 --> 00:06:45,880 So let's say I'm going to define a new variable over here and here. 78 00:06:46,090 --> 00:06:48,820 I'm going to see you on this score this. 79 00:06:49,120 --> 00:06:54,610 And here I'm basically going to call off group on my data frame. 80 00:06:54,940 --> 00:06:58,770 So I have to group on the basis of this edu or the score. 81 00:06:59,740 --> 00:07:00,850 So you're doing a school model? 82 00:07:01,210 --> 00:07:10,990 Once I get a group by then, basically on the basis of each, I have to perform a function. 83 00:07:11,200 --> 00:07:12,550 So just execute it. 84 00:07:12,850 --> 00:07:23,610 Now is I'm going to print these edu, edu, edu press tab over here. 85 00:07:23,950 --> 00:07:30,550 And this is exactly my idea on a scale this so you will see it will contain all the values advanced 86 00:07:30,550 --> 00:07:31,570 have Dismas. 87 00:07:32,640 --> 00:07:41,400 Frequency graduate has this much frequency, an undergraduate has this much frequency now in value. 88 00:07:41,440 --> 00:07:50,010 You have to assign this edu underscore this now in names, you have to assign the names or I can see 89 00:07:50,010 --> 00:07:53,450 the index of this edu underscore this. 90 00:07:53,730 --> 00:07:57,690 So Edu on this score makes DOT index. 91 00:07:59,040 --> 00:08:06,270 And let's see, in case of assigning some titles, you can assign some titles using title parameter. 92 00:08:06,690 --> 00:08:14,670 So here I'm going to say title equals by chance and let's say this is exactly my figure and if you have 93 00:08:14,670 --> 00:08:22,530 to show, you have to just call our show on your FAQ, just execute it and it will give you a beautiful 94 00:08:22,920 --> 00:08:23,670 pie chart. 95 00:08:23,680 --> 00:08:29,610 So if you are going to hover your mouse, you will see forty one percent, which basically refers to 96 00:08:29,970 --> 00:08:38,040 this undergraduate, is 30 percent basically opposed to advanced or proficient on whatever you have 97 00:08:38,040 --> 00:08:40,770 assigned in here in this function. 98 00:08:41,280 --> 00:08:47,670 And this twenty one, twenty eight percent will basically refers to this graduate degree. 99 00:08:47,880 --> 00:08:55,020 So that's a type of analysis by using fighter or by using Bottcher, it's all up to you. 100 00:08:55,300 --> 00:09:01,970 What kind of ViSalus you are trying to provoke depending upon what statement you have. 101 00:09:01,980 --> 00:09:10,320 So if you have two entries from this visual, you can see we can see that we can have more number of 102 00:09:10,320 --> 00:09:19,020 undergraduate mythically forty one point nine percent then graduates and advanced professionals as 30 103 00:09:19,020 --> 00:09:19,580 percent. 104 00:09:20,100 --> 00:09:21,980 So hopefully you will love this. 105 00:09:22,440 --> 00:09:28,980 So in all our upcoming session, we are going to perform some beautiful analysis as well as I am trying 106 00:09:28,980 --> 00:09:32,550 to do all these stuff in an automated way. 107 00:09:32,940 --> 00:09:34,510 So I hope you will love it. 108 00:09:34,740 --> 00:09:35,370 Thank you. 109 00:09:35,460 --> 00:09:36,330 Have a nice day. 110 00:09:36,540 --> 00:09:37,320 Keep learning. 111 00:09:37,530 --> 00:09:38,310 Keep going. 112 00:09:38,880 --> 00:09:39,750 Keep practicing.