Semantic Video Composition via Pre-trained Diffusion Model

This repository is a brief introduction and case study of the paper "Training-Free Semantic Video Composition via Pre-trained Diffusion Model" in ICME 2024 (Oral).

😁Introduction

The video composition task aims to integrate specified foregrounds and backgrounds from different videos into a harmonious composite. Current approaches, predominantly trained on videos with adjusted foreground color and lighting, struggle to address deep semantic disparities beyond superficial adjustments, such as domain gaps. Therefore, we propose a training-free pipeline employing a pre-trained diffusion model imbued with semantic prior knowledge, which can process composite videos with broader semantic disparities.

🤯Method

Using the pretrained Stable Diffusion V2-1 as our backbone, we leverage its robust semantic understanding capabilities to propose an training-free video compositing pipeline.

🔆Comparison

input	TF-ICON	Ours

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
cases		cases
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Video Composition via Pre-trained Diffusion Model

😁Introduction

🤯Method

🔆Comparison

🎞More Cases

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Semantic Video Composition via Pre-trained Diffusion Model

😁Introduction

🤯Method

🔆Comparison

🎞More Cases

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Packages