000 00277 a2200121 4500
000 nam a22 7a 4500
005 20260106103443.0
020 _a9789385889509
100 _aPhil Winder
245 _aReinforcement Learning
260 _bShroff/O'Reilly Media
942 _cBK
999 _c121188
_d121188