本文介绍: 机器学习评估模型

,b)=mtest1i=1mtest[ytest(i)log(f(xtest(i)))+(1ytest(i))log(1f(xtest(i))]

2 cross-validation set

  1. split the training set into training set, cross-validation set and test set
  2. the cross-validation set is used to automatically choose the better model, and the test set is used to evaluate the model that chosed

3 bias and variance

  1. high bias:

    J

    t

    r

    a

    i

    n

    J_{train}

    Jtrain and

    J

    c

    v

    J_{cv}

    Jcv is both high

  2. high variance:

    J

    t

    r

    a

    i

    n

    J_{train}

    Jtrain is low, but

    J

    c

    v

    J_{cv}

    Jcv is high

在这里插入图片描述

  1. if high bias: get more training set is helpless
  2. if high variance: get more training set is helpful

4 regularization

  1. if

    λ

    lambda

    λ is too small, it will lead to overfitting(high variance)

  2. if

    λ

    lambda

    λ is too large, it will lead to underfitting(high bias)

在这里插入图片描述

5 method

  1. fix high variance:
    • get more training set
    • try smaller set of features
    • reduce some of the higher-order terms
    • increase

      λ

      lambda

      λ

  2. fix high bias:
    • get more addtional features
    • add polynomial features
    • decrease

      λ

      lambda

      λ

6 neural network and bias variance

  1. a bigger network means a more complex model, so it will solve the high bias
  2. more data is helpful to solve high variance

在这里插入图片描述

  1. it turns out that a bigger(may be overfitting) and well regularized neural network is better than a small neural network

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注