Simplex explained

The concept of simplex appears in science and engineering over and over again. The simplex appears in optimization, information theory, communication systems, linear algebra, etc. Let’s have a quick look at it from different contexts. In general, It can be thought of as a mathematical expression of . It can be a list of numbers… Continue reading Simplex explained

Epsilon-Greedy Algorithm in Bandit Problems

Introduction Bandit problems are the simplest possible reinforcement learning scenario. Here the bandit machine can have k arms and pulling each arm leaves the user a reward. One of the arms will be giving higher rewards in the long run and moreover this pattern could be changing over a time period. Think of the scenario of advertisements… Continue reading Epsilon-Greedy Algorithm in Bandit Problems