- state: bbox
$b=[x,y,h,w]$ 里的Image patch, 具体操作为crop然后resize$s=\phi(b,F)$ - action:
$a=[\Delta x,\Delta y,\Delta s]$ 为相对的x,y偏移和尺度变化 - Actor:
$a=\mu(s|\theta^\mu)$ - reward: $$r(s,a)=\left{\begin{array}l 1 \quad if IoU(b',G)>0.7\ -1 \quad else \end{array}\right.$$
- 训练提升策略