Differences

This shows you the differences between two versions of the page.

--- data_mining:xgboost [2019/05/03 00:52] – [XGBoost] phreazer
+++ data_mining:xgboost [2019/05/03 01:10] – phreazer
@@ Line 19: / Line 19: @@
 \hat{y}_i = \sum^K_{k=1} f_k(x_i), f_k \in F
 $$
+===== Gradient boosting =====
 $F$ is space of functions containing all regression trees
@@ Line 57: / Line 59: @@
   * Logistic loss $l(y_i,\hat{y}_i)=y_i \ln(1+e^{-\hat{y}_i})+(1-y_i)\ln(1+e^{\hat{y}_i})$ (LogitBoost)
-Stochastic Gradient Descent can not be applied, since trees are used.
+Stochastic Gradient descent can not be applied, since trees are used.
 Solution is **additive training**: Start with constant prediction, add a new function each time.
@@ Line 80: / Line 82: @@
-Taylor expansion approximation of loss
+==== Taylor expansion ====
 Use taylor expansion to approximate a function through a power series (polynom).