1 00:00:01,060 --> 00:00:02,500 Welcome back. 2 00:00:02,500 --> 00:00:09,580 Up until now I've been blabbering on about what machine learning is and I hope you have a bit of an 3 00:00:09,580 --> 00:00:14,050 idea of why we have it and why it's useful. 4 00:00:14,170 --> 00:00:21,040 But at the end of the day the only reason that we care about machine learning is that we're able to 5 00:00:21,040 --> 00:00:26,290 use machines to predict results based on incoming data. 6 00:00:26,290 --> 00:00:27,310 That's it. 7 00:00:27,340 --> 00:00:29,440 Now this idea of machine learning. 8 00:00:29,440 --> 00:00:30,240 And don't worry. 9 00:00:30,280 --> 00:00:35,740 I know we keep talking about theory but I promise we're gonna get some coding exercises first. 10 00:00:35,740 --> 00:00:43,330 But I do want to talk quickly about some of the machine learning categories that you often see and keep 11 00:00:43,330 --> 00:00:51,420 in mind what I said that is machine learning is simply about predicting results based on incoming data. 12 00:00:51,640 --> 00:00:55,820 And all these subcategories simply do that. 13 00:00:55,840 --> 00:01:03,820 For example we have the idea of a supervised learning which is a subset of machine learning in this 14 00:01:03,880 --> 00:01:05,270 supervised learning. 15 00:01:05,500 --> 00:01:09,400 The data that we received already has CATEGORIES. 16 00:01:09,400 --> 00:01:13,740 THINK OF IT AS A CSC files with rows and columns label. 17 00:01:13,840 --> 00:01:22,410 We have labeled data and a test data that is label so we know if our function is right or wrong. 18 00:01:22,480 --> 00:01:30,310 So in a supervised learning scenario we can do things like classification to decide is this an apple. 19 00:01:30,310 --> 00:01:38,020 Or is this a paper machine learning model simply draws a line to decide Hey this is an apple and this 20 00:01:38,110 --> 00:01:46,360 is a pair or we might do something called regression instead of classification based on inputs. 21 00:01:46,550 --> 00:01:54,680 For example predicting stock prices another way that we might use supervised learning is for example 22 00:01:54,680 --> 00:02:03,740 to hire engineers based on inputs based on years of experience based on maybe age maybe where they live 23 00:02:04,040 --> 00:02:11,810 what type of computers they have all these sorts of inputs that are labeled can be used in a supervised 24 00:02:11,870 --> 00:02:17,570 learning system to decide should I hire this engineer or should I not. 25 00:02:17,600 --> 00:02:25,370 Now sometimes we have data that doesn't have labels and this is called on supervised learning. 26 00:02:25,370 --> 00:02:32,690 Again think of it as a CSP file without perhaps the column names labeled sometimes with things like 27 00:02:32,690 --> 00:02:33,820 clustering. 28 00:02:33,980 --> 00:02:39,170 We need to create these groups or at least the machine to create these groups. 29 00:02:39,200 --> 00:02:45,490 For example we give it a bunch of data points and then the machine decides oh this is a group. 30 00:02:45,530 --> 00:02:52,550 This is a group and this is a group or we can have something like association rule learning where we 31 00:02:52,550 --> 00:03:00,890 associate different things to predict what a customer perhaps might buy in the future when groups don't 32 00:03:00,890 --> 00:03:01,760 exist. 33 00:03:01,760 --> 00:03:04,200 We call it unsupervised learning. 34 00:03:04,310 --> 00:03:10,910 We can't tell the machine that they are right or wrong like we can when we do apples versus pears since 35 00:03:11,000 --> 00:03:17,050 there are no true categories but we let the machines just create these categories for us. 36 00:03:17,060 --> 00:03:24,010 Finally we have this idea of reinforcement learning and reinforcement learning is really interesting 37 00:03:24,040 --> 00:03:31,130 because it's all about teaching machines through trial and error through rewards and punishment so the 38 00:03:31,130 --> 00:03:38,930 program simply learns a game by playing it millions of times until well it gets the highest score it 39 00:03:38,930 --> 00:03:45,650 doesn't know what it's doing at first but then it tries to maximize the score and eventually figures 40 00:03:45,650 --> 00:03:51,270 out that hey maybe I should try and move where the ball is coming. 41 00:03:51,860 --> 00:03:58,250 So this is seen for skill acquisition or real time learning and you see it a lot in machine learning 42 00:03:58,250 --> 00:04:00,960 programs that play for example video games. 43 00:04:01,010 --> 00:04:08,210 But the idea here is that machine learning has different categories and different ways to accomplish 44 00:04:08,240 --> 00:04:17,830 its goal and topics like neural networks decision trees support vector machines K nearest neighbor are 45 00:04:17,830 --> 00:04:27,580 simply algorithms that are used with these sub fields in order to come to these predictions but remember 46 00:04:27,820 --> 00:04:35,650 the key thing all of these what they're doing is trying to learn from the data that it receives and 47 00:04:35,650 --> 00:04:40,570 predict something I'll see in the next one by.