Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation ...

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • بيانات النشر:
      arXiv
    • الموضوع:
      2025
    • Collection:
      DataCite Metadata Store (German National Library of Science and Technology)
    • نبذة مختصرة :
      Existing large-scale video generation models are computationally intensive, preventing adoption in real-time and interactive applications. In this work, we propose autoregressive adversarial post-training (AAPT) to transform a pre-trained latent video diffusion model into a real-time, interactive video generator. Our model autoregressively generates a latent frame at a time using a single neural function evaluation (1NFE). The model can stream the result to the user in real time and receive interactive responses as controls to generate the next latent frame. Unlike existing approaches, our method explores adversarial training as an effective paradigm for autoregressive generation. This not only allows us to design an architecture that is more efficient for one-step generation while fully utilizing the KV cache, but also enables training the model in a student-forcing manner that proves to be effective in reducing error accumulation during long video generation. Our experiments demonstrate that our 8B model ...
    • الرقم المعرف:
      10.48550/arxiv.2506.09350
    • الدخول الالكتروني :
      https://dx.doi.org/10.48550/arxiv.2506.09350
      https://arxiv.org/abs/2506.09350
    • Rights:
      Creative Commons Attribution 4.0 International ; https://creativecommons.org/licenses/by/4.0/legalcode ; cc-by-4.0
    • الرقم المعرف:
      edsbas.5751E020