Manipulating the historical latents of large diffusion models for targeted generation (VoiceScroll Project)