slope-experiments

Continuous Contextual bandit experiments for the SLOPE estimator

The estimator is developed in the paper titled Adaptive Estimator Selection for Off-Policy Evaluation and this repository was used for the experiments presented in section 4.

The repository contains a simulator for continuous contextual bandits (CB), some estimators for off policy evaluation in the continuous CB setting, and some scripts for running the experiments.

If you are building on this repository, please cite as: Adaptive estimator selection for off-policy evaluation. Yi Su, Pavithra Srinath, Akshay Krishnamurthy. arXiv 2020.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
scripts		scripts
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

slope-experiments

About

Releases

Packages

Languages

VowpalWabbit/slope-experiments

Folders and files

Latest commit

History

Repository files navigation

slope-experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages