Lets understand why the test set bounds we already got is no longer valid if is the training set.

  • The test set bound was derived under the assumption that the coin-flips (errors ) are i.i.d.
  • If was an independent test set then this is true provided the examples are i.i.d.
  • If the training sample, then the classifier becomes a random variable as it depends on , hence all depend on the same - they are no longer i.i.d!
  • Therefore, the same bound would not hold true when is the training set!