WAN2.1 FLF2V: Incorrect MASK Creation????

Hello! I think that it is maybe error. (Or not, please explain it for me!!)

In **WanImageToVideoPipeline** class in  `pipline_wan_i2v.py`, 
<img width="868" height="243" alt="Image" src="https://github.com/user-attachments/assets/8108a9e9-8632-44a1-93b8-abd9ae6a22cd" />
(the code is the part of `prepare_latents` function)

**For I2V**, masking shape like as below:
```
[[1, 0, 0, ... , 0]
[1, 0, 0, ... , 0]
[1, 0, 0, ... , 0]
[1, 0, 0, ... , 0]]
```
I understood: when the mask is 1, input video frame does not change.
(*Mask shape: [1, 4, 21, 60, 104] = [B, C, F, H, W])
  
**But in the FLF2V case,** masking shape like as below:
```
[[1, 0, 0, ... , 0]
[1, 0, 0, ... , 0]
[1, 0, 0, ... , 0]
**[1, 0, 0, ... , 1]]**
```
Here, **why the last frame mask has 1 only in last channel??**
Is there anyone who can explain this part?  

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WAN2.1 FLF2V: Incorrect MASK Creation???? #12241

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

WAN2.1 FLF2V: Incorrect MASK Creation???? #12241

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions