Random Forest Contextual Bandit

RandomForestを使って、文脈付きマルチアームバンディットを実装

Concept

RandomForest is using average of each tree's CVR. so it can also get standard deviation(SD) of CVR at the same time.
If having predicted CVR and CVR SD, you can use Thompson Sampling or UCB1 for arm's expectation CVR.

コンセプト

RandomForestは、各DecisionTreeのCVRを平均化しているから、その過程で分散も出せる
予測CVRと分散がわかれば、トンプソンサンプリングや、UCB1が利用できる
トンプソンサンプリングやUCB1により、サンプル数が薄い箇所の探索が可能になる

部分的アップデート

RandomForestの木を作り直すコストは重たいが、各木の末端の値を変えるのは比較的容易
木の作り直しではなく、末端の値の変更により、予測CVRの精度を改善する

基本方針

RandomForestContextualBandit

multi arm banditの腕の数だけ、RandomForestClassifierで予測器を生成
それぞれのArmのRandomForestの各木でCVRを計算
各木のCVRの平均値と分散から、トンプソンサンプリング、もしくはUCB1を行い、各アームの期待CVRを算出
期待CVRからアームを選択

RandomForestContextualBanditWithArmVector

Armの特徴量を活用するために、全体で一つのRandomForestClassifierを作る
FeatureVector = ContextVector + ArmVector
- ArmVectorが無い場合は、ArmのIDを1hot encoding
RandomForestの各木でCVRを計算
各木のCVRの平均値と分散から、トンプソンサンプリング、もしくはUCB1を行い、各アームの期待CVRを算出
期待CVRからアームを選択

Author

https://twitter.com/tokoroten

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.idea		.idea
README.md		README.md
random_forest_contexual_bandit.py		random_forest_contexual_bandit.py
result.csv		result.csv
score.txt		score.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Random Forest Contextual Bandit

Concept

コンセプト

部分的アップデート

基本方針

RandomForestContextualBandit

RandomForestContextualBanditWithArmVector

Author

About

Releases

Packages

Languages

tokoroten/random_forest_contextual_bandit

Folders and files

Latest commit

History

Repository files navigation

Random Forest Contextual Bandit

Concept

コンセプト

部分的アップデート

基本方針

RandomForestContextualBandit

RandomForestContextualBanditWithArmVector

Author

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages