Muhammad-Almujtaba Yaseen

Results 4 issues of Muhammad-Almujtaba Yaseen

Hello, Thank you so much for these useful resources! Maybe it's nice to add this paper to the continual learning for RL section: Towards Continual Reinforcement Learning: A Review and...

In the exercise you asked to: Implement the Monte Carlo Prediction to estimate **state-action** values... But your solution returns **state** values instead... Thank you for your hard work!

In the third line of the "strategy" function: def strategy(observation): score, dealer_score, usable_ace = observation **# Stick (action 0) if the score is > 20, hit (action 1) otherwise** return...

Under Summary section: In the sixth point ("Action Value function: q(s, a) estimates how "good" it is for an agent to be in **states** and take action a..."): - I...