Home Data Science What is Machine Learning (ML)?

What is Machine Learning (ML)?

Within the broad field of artificial intelligence, machine learning is an important subfield that can be divided into two main categories: supervised and unsupervised ML.

Supervised Learning

In supervised machine learning, the algorithm is provided with training data containing both the input information and the corresponding target information as input.
The model is trained on this labelled data set to recognize relationships and correlations between the data and the target variable provided.
An illustrative example of supervised machine learning is the classification of emails into spam or non-spam. A large number of emails with corresponding labels ("spam" or "non-spam") are sent to the algorithm as training data. The algorithm learns to recognize patterns from this data and is then able to automatically classify new, unlabeled emails as spam or non-spam.

Diagram ML Supervised Learning
Example data set with classification target information (label) Salary greater than or equal to 60,000 euros

In order to better calibrate the classification algorithm, the available data is divided into so-called training, validation and test sets. The training data set is used to train the model by learning patterns between the input data and the target values. The validation dataset enables fine-tuning of the model parameters and monitors performance during training to avoid overfitting.
The test set is used exclusively to evaluate the final performance of the trained model on independent data and to ensure that it makes generalizable predictions.

Diagram ML Supervised Learning

The following graph shows a confusion matrix, a tool for evaluating the performance of a classification model by showing the number of correctly and incorrectly classified examples for each class.

Diagram ML Supervised Learning

These can be used to calculate the two most important metrics of a classification: Recall and precision.
Precision measures the percentage of correctly classified examples in relation to the total number of examples, while recall (also called sensitivity) measures the percentage of correctly positive cases in relation to the total number of actual positive cases.
Using the example of spam email classification mentioned above:

Calculation Recall and Precision Supervised Learning

Unsupervised Learning

In unsupervised machine learning, the algorithms are provided with training data that does not contain any target information (unlabeled). The model automatically recognizes patterns and structures in the data without a target variable being known or specified. This method is often used for data segmentation and discovering hidden correlations. Examples of unsupervised learning include segmentation analysis (clustering) and anomaly detection, which identifies unusual or deviant patterns in data that could deviate from normal behavior, such as the detection of fraudulent transactions in financial data in banking.

Reinforcement Learning

In reinforcement learning, the algorithm learns by interacting with an environment in which rewards or punishments are provided for its actions. The model continuously adapts its strategies to maximize long-term rewards. This method is used in various fields, including robotics, games and decision making.

Further information on the difference between artificial intelligence and machine learning can be found in this wiki article.

Know more?

Would you like to delve deeper into the topic of Machine Learning? We would be happy to talk to you about it in person - just get in touch with us!

Your contact person for Data Science topics
Nadine Matt_2
Nadine Matt
Customer Engagement & Communication Executive

Published by:

Francis Heep

Professional Analytics consultant

author

How did you like the article?

How helpful was this post?

Click on a star to rate!

Average rating 5 / 5.
Number of ratings: 12

No votes so far! Be the first person to rate this post!

INFORMATION

More information

Cover image Wiki What is Artificial Intelligence (AI) (1)

What is Artificial Intelligence (AI)?

Artificial Intelligence stands for artificial intelligence (AI). And since there is no universally accepted philosophical definition of intelligence, different views of what artificial intelligence is can also be found...

Wiki How to R - Beginners Guide (1)

How to R - Beginner Guide Time Series Analysis

In this small tutorial, we will limit ourselves to briefly outlining the essential steps in time series analysis in the programming language R...

EU Taxonomy Wiki

EU taxonomy - implementation in an SAP Analytics system

From KPI definition to reporting: Evaluate your company's sales, CapEx and OpEx activities in accordance with the EU taxonomy criteria. We accompany you step by step in the successful implementation of the EU taxonomy guidelines...

Wiki SCT

Sustainability Control Tower (SCT)

At a time when sustainable business practices are indispensable, the focus is increasingly on monitoring and recording one's own sustainability performance using precise data. The SAP Sustainability Control Tower is a comprehensive sustainability solution that enables holistic reporting...

Wiki What is R

What is R?

R is a programming language designed for statistical applications, which enjoys a high degree of diffusion especially in the field of Data Science...

Wiki Five sustainability KPIs

The five most important sustainability KPIs

Why do you need sustainability KPIs? What are the 5 most important sustainability KPIs? And why are sustainability KPIs important for your company? These questions are answered in the following article...

Wiki Datasphere vs BW4HANA

Comparison: SAP Datasphere vs. SAP BW/4HANA

With solutions such as SAP BW/4HANA and SAP Datasphere, SAP offers different data warehousing options. While BW/4HANA specializes in structured enterprise data and analytics, Datasphere focuses on comprehensive data integration across platforms...

GDPR Cookie Consent with Real Cookie Banner