Reinforcement learning system. Currently trying to understand how to implement contextual thompson sampling and its details after doing non contextual thompson sampling. My YouTube history is a lot of logistic regression related videos at the moment.