题目2单选题
在强化学习值函数近似中,蒙特卡洛方法对梯度计算是( )A. <img src="https://tihai-oss-cloud.itihey.com/img/c64c76f174de69d7b1330a638b030ddc.jpg">B. <img src="https://tihai-oss-cloud.itihey.com/img/19233f93055b2f45980b959d23a149df.jpg">C. <img src="https://tihai-oss-cloud.itihey.com/img/c51e86d2b2e90bf3dbb005801b358c43.jpg">D. <img src="https://tihai-oss-cloud.itihey.com/img/d4019a2887a154a93aa69a666dd1eeeb.jpg">