  • To delve into the mathematics more formally, policy gradients are a special case of the more general score function gradient estimator. (check the source)

# Alternative words

  • Considering using "lots of"? Try "plethora of".
  • Thinking about using "limits"? Try "throttles" maybe.
  • Things that seem to make sense but lack scientifical or statistical support: use anecdotal