# F1 Score
- 有的时候我们注重精准率, 如股票预测
- 有的时候我们注重召回率, 如病人预测
- 二者都兼顾: F1 Score, 是 precision 和 recall 的调和平均值

In [1]:
import numpy as np

In [2]:
def f1_score(precision, recall):
    try:
        return 2. * precision * recall / (precision + recall)
    except:
        return 0.

In [3]:
f1_score(precision=0.5, recall=0.5)

0.5

In [4]:
f1_score(precision=0.1, recall=0.9)

0.18000000000000002

In [5]:
f1_score(precision=0., recall=1.)

0.0

In [6]:
from sklearn import datasets

digits = datasets.load_digits()
X = digits.data
y = digits.target.copy()

y[digits.target == 9] = 1
y[digits.target != 9] = 0

In [7]:
from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=666)

In [8]:
from sklearn.linear_model import LogisticRegression

log_reg = LogisticRegression()
log_reg.fit(X_train, y_train)
log_reg.score(X_test, y_test)



0.9755555555555555

In [9]:
y_predict = log_reg.predict(X_test)

In [10]:
from sklearn.metrics import confusion_matrix

confusion_matrix(y_test, y_predict)

array([[403,   2],
       [  9,  36]])

In [11]:
from sklearn.metrics import precision_score

precision_score(y_test, y_predict)

0.9473684210526315

In [12]:
from sklearn.metrics import recall_score

recall_score(y_test, y_predict)

0.8

In [13]:
from sklearn.metrics import f1_score

f1_score(y_test, y_predict)

0.8674698795180723