Introduction
Today, Machine Learning (ML) is everywhere, from chatbots and self-driving cars to Spotify’s music suggestions. But what does “machine learning” actually mean?
In simple terms, Machine Learning is how computers learn from data instead of being programmed with fixed rules. They observe, record, and gradually discover the hidden patterns behind what they see.
The Four Main Types of Machine Learning:
- Supervised learning – learning with guidance
- Unsupervised learning – learning without guidance
- Recommender systems – suggesting what you might like (Netflix, Shopee)
- Reinforcement learning – learning through reward and feedback
In this essay, we’ll focus on the first two, the foundation of most modern ML systems.
Supervised Learning — Learning with a Teacher
Imagine teaching a child to tell the difference between a cat and a dog.
Each time you show a picture, you say, “This is a cat,” or “This is a dog.”
After seeing hundreds of examples, the child figures out the rule:
“Oh, the ones with sharp ears and round eyes are cats.”
That is supervised learning is the machine learns from labeled examples.
The Core Idea
At its core, Machine Learning is about finding a function that maps inputs x to outputs y.
Here, y is the target we want to predict, and x represents the information or features we already know.
Humans define a model – a mathematical form of this function f(x) – such as a line, a curve, or a more complex structure like a neural network.
Through the training process, the algorithm repeatedly adjusts the model’s parameters to make its predicted outputs f(x) match the real values y in the sample data as closely as possible.
Once this function f(x) has been tuned well on the training data, we expect it to generalize – meaning it should make reasonably accurate predictions for new, unseen data.
Regression vs Classification
In supervised learning, the prediction task can take two major forms depending on what kind of target we are trying to predict.
- If the target is a number that can vary continuously, we call it Regression.
- If the target is a category or label, we call it Classification.
Regression problems aim to predict how much or how many, while classification problems aim to predict which one.
Despite their differences, both types learn from labeled data meaning there are known correct answers provided during training so that the model can adjust itself accordingly.
Examples:
- Forecasting website traffic for marketing planning (regression)
- Predicting house prices (regression)
- Detecting spam emails (classification)
Unsupervised Learning — Learning Without a Teacher
This time, no one tells the child which pictures are cats or dogs.
You simply give them a big pile of photos and say:
“Try to find what looks similar.”
The child sorts the pictures into two groups without knowing what those groups mean.
That’s unsupervised learning – learning without labels or supervision.
The Core Idea
- Only the input data x is given; there is no label y.
- The algorithm tries to find patterns, clusters, or hidden structures in the data.
Examples:
- Grouping customers by shopping behavior
- Clustering news articles by topic
- Grouping DNA sequences by similarity
Researchers then interpret what each group represents. For example:
“This group contains young tech enthusiasts, while that one represents middle aged high income professionals.”
Common Techniques
- Clustering – grouping similar data points (K-means, Hierarchical)
- Anomaly Detection – spotting unusual behavior (e.g., fraud detection)
- Dimensionality Reduction – simplifying data for better visualization or modeling (e.g., PCA)
Conclusion
Machine Learning is not magic. It’s essentially statistics automated and repeated at scale with massive amounts of data.
At first, the more data a model sees, the better it can learn from patterns become clearer and predictions more reliable. But beyond a certain point, simply adding more data doesn’t make the model smarter. If the data is repetitive, biased, or noisy, the model just learns the same things again and again. That’s why today’s progress in AI doesn’t rely only on more data, but also on better data, better architectures, and more efficient learning methods.
Like a child who grows not just by seeing more examples, but by learning to understand and generalize, modern AI advances by learning how to learn.