First, for each time a uniquepositional encoding vectoris generated (SinusoidalPostEmb()), which maps the integer value of the timestep for a given image into a representative vector that we can use for timestep conditioning. For a recap on positional encodings, see the dropdownhere. Next, ...