Understanding Training data, Testing data, Actual & Predicted Values

Understanding Training, Testing, Actual & Predicted Values

Apple Example

Think of a Machine Learning Model as a Child

A machine learning model is like a child who does not know anything initially.

Just like a child:

  • The model must be taught
  • The model must then be tested

Training Phase – Teaching the Child

You start teaching the child using fruits.

You show a fruit and ask:

“What fruit is this?”

The child answers something:

  • Maybe “Apple”
  • Maybe “Banana”

This answer is called the predicted value
Because it is the child’s guess.

You immediately tell the child:

“The correct answer is APPLE 🍎”

This correct answer is called the actual value.

So during training:

  • Child gives a predicted value
  • Human provides the actual value
  • Child learns by comparing both

This process repeats many times.

What Exactly Is Training Data?

The apples you show during teaching are called training data.

 Training data means:

Data used to teach the model, along with correct answers (actual values)

Error – How Learning Happens

If:

  • Predicted value = Apple
  • Actual value = Apple
    → No error

If:

  • Predicted value = Banana
  • Actual value = Apple
    → Error happened

 Error = Difference between predicted and actual value

During training:

  • Error is used to improve learning

Testing Phase – Exam Time (New Apple 🍎)

Now the exam starts.

You show a new apple:

  • The child has never seen this exact apple
  • But it is still an apple

 This new apple is called test data.

You ask:

“What fruit is this?”

The child answers:

“Apple”

 This answer is again a predicted value.(but during testing)

Step 1: What Happens During the Exam?

Even in the exam:

  • Humans already know the correct answer
  • The fruit is actually an apple

 That correct answer is still called the actual value.

But now:

  • We do not tell the child the answer
  • We do not help the child learn
  • We only observe what the child answers

Step 2: Observing One Answer Is Not Enough

Suppose:

  • One apple → child says “Apple” ✔

Can we conclude:

“The child has learned perfectly”?

❌ No.

Because:

  • Maybe it was luck
  • Maybe it remembered only one apple

Step 3: Many Exam Questions (Many Test Data)

So in an exam:

  • We ask many questions
  • Not just one apple

Example:

  • Apple 1 → correct
  • Apple 2 → wrong
  • Apple 3 → correct
  • Apple 4 → wrong

 Now the big question:

“Overall, how well did the child perform?”


 How Humans Measure Exam Performance

In school:

  • We don’t check question by question
  • We calculate marks, percentage, grade

 That single score tells:

  • How good the performance is
  • Whether the child passed or failed

 Machines Need the Same Thing

In machine learning:

  • The model predicts on many test data points
  • For each one:
    • We know the actual value
    • We get a predicted value

 We need:

One or a few numbers to summarize performance


 This Summary Is Called a Performance Metric

What Is a Performance Metric? (Slow Introduction)

A performance metric is:

  • A numerical measure
  • That summarizes how accurate the model is
  • Based on errors on test data

 It answers:

  • How close are predictions to actual values?
  • Is the model reliable?
  • Is this model better than another one?

 Apple Example → Performance Metric

Actual (Human Truth)Predicted (Child)
AppleApple
AppleBanana
AppleApple
AppleApple

Instead of checking one by one, we say:

“Child answered 3 out of 4 correctly.”

That score is like a performance metric.


From Apple to Regression

In regression:

  • Instead of correct / wrong
  • We measure how far the prediction is

Example:

  • Actual price = ₹45 lakh
  • Predicted = ₹50 lakh
  • Error = ₹5 lakh

Performance metrics combine many such errors.

Training vs Testing — Final Clear Explanation

(Learning Phase vs Exam Phase)


 Training Phase = Learning Phase

In the training phase, the model is learning, just like a child in class.

What happens?

  • The model predicts an answer
  • Humans already know the actual value
  • The model compares:
    • Predicted value
    • Actual value
  • The difference (error) is used to improve learning

 Actual values are used to TEACH the model

Just like:

A teacher corrects the child after each question in class


Testing Phase = Exam Phase

In the testing phase, the model is examined, not taught.

What happens?

  • The model predicts on new data
  • Humans still know the actual value
  • BUT:
    • The model is not corrected
    • No learning happens
  • We only observe predictions

 Actual values are used only to CHECK performance

Just like:

Teachers already know the answers in exams, but they only calculate marks


 Why Performance Metrics Are Calculated on Test Data

Now make this conclusion very explicit:

Performance metrics are calculated on test data, not training data.

Why?

  • Training data is for learning
  • Test data is for evaluation
  • Test data shows real ability

 Same logic as:

  • Classwork → learning
  • Exam → performance

Final Comparison Table 

AspectTraining PhaseTesting Phase
PurposeLearningEvaluation
Data usedTraining dataTest data
Predicted valuesYesYes
Actual values usedYes (for correction)Yes (for checking only)
Learning happensYesNo
Performance metricsNoYes

Actual values are used during training to improve the model’s learning, while during testing they are used only to evaluate performance using performance metrics.

In exams,

teachers calculate:

  • Marks
  • Percentage
  • Grade

That single number tells:

  • How well the student performed overall

Same Question for Machine Learning Models

After testing the model on many test data points, we ask:

  • Is the model good or bad?
  • How big are its mistakes?
  • Is Model A better than Model B?
  • Are mistakes small or dangerous?

 We cannot judge by looking at individual predictions.

This Is Where Regression Metrics Come In

Regression performance metrics are numerical scores that summarize how well a regression model predicts real-world numerical values.

Just like:

  • Marks summarize exam performance

Regression metrics summarize:

  • Errors made by the model on test data

Why ONLY Test Data? (Clear & Final)

Make this point very clear to avoid confusion:

✔ Training data is used for learning
✔ Test data is used for evaluation
✔ Performance metrics are calculated on test data

Same logic as:

  • Classwork → learning
  • Exam → marks

House Price – Regression

  • Actual house price = ₹45 lakh (human knows)
  • Predicted house price = ₹50 lakh (model guesses)
  • Error = ₹5 lakh

Now imagine:

  • 100 houses
  • 100 such predictions
  • 100 errors

How do we summarize all these errors?

Using regression metrics like:

  • MAE
  • RMSE
  • MAPE