24thJULY2018

$\arg\max_z l(\theta, z) - \gamma \|z – z^t\|_p^2$
where $z^t$ is a training sample chosen randomly during training. On a side note, I also recommend reading the reviews of this paper.