reinforcement learning - Should the importance sampling ratio be updated at the end of the for loop in the off-policy Monte Carlo control algorithm? - Artificial Intelligence Stack Exchange
PDF] Off-policy learning based on weighted importance sampling with linear computational complexity | Semantic Scholar
PPT - Weighted Importance Sampling Techniques for Monte Carlo Radiosity PowerPoint Presentation - ID:3681016
Multiple Importance Sampling Characterization by Weighted Mean Invariance | DCGI