DPE: Disentanglement of Pose and Expression for General Video Portrait Editing

CORC > 自动化研究所 > 中国科学院自动化研究所 > 多模态人工智能系统全国重点实验室

	DPE: Disentanglement of Pose and Expression for General Video Portrait Editing
	Pang YX(庞有鑫)2,3; Zhang Y(张勇)1; Quan WZ(全卫泽)2,3; Fan YB(樊艳波)1; Cun XD(寸晓东)1; Ying, Shan 1; Yan DM(严冬明)2,3
	2023-03
会议日期	2023-06
会议地点	加拿大
英文摘要	One-shot video-driven talking face generation aims at producing a synthetic talking video by transferring the facial motion from a video to an arbitrary portrait image. Head pose and facial expression are always entangled in facial motion and transferred simultaneously. However, the entanglement sets up a barrier for these methods to be used in video portrait editing directly, where it may require to modify the expression only while maintaining the pose unchanged. One challenge of decoupling pose and expression is the lack of paired data, such as the same pose but different expressions. Only a few methods attempt to tackle this challenge with the feat of 3D Morphable Models (3DMMs) for explicit disentanglement. But 3DMMs are not accurate enough to capture facial details due to the limited number of Blendshapes, which has side effects on motion transfer. In this paper, we introduce a novel self-supervised disentanglement framework to decouple pose and expression without 3DMMs and paired data, which consists of a motion editing module, a pose generator, and an expression generator. The editing module projects faces into a latent space where pose motion and expression motion can be disentangled, and the pose or expression transfer can be performed in the latent space conveniently via addition. The two generators render the modified latent codes to images, respectively. Moreover, to guarantee the disentanglement, we propose a bidirectional cyclic training strategy with well-designed constraints. Evaluations demonstrate our method can control pose or expression independently and be used for general video editing. Code: https://github.com/Carlyx/DPE.
内容类型	会议论文
源URL	[http://ir.ia.ac.cn/handle/173211/52023]
专题	多模态人工智能系统全国重点实验室
通讯作者	Yan DM(严冬明)
作者单位	1.腾讯AI Lab 2.国科大人工智能学院 3.自动化研究所
推荐引用方式 GB/T 7714	Pang YX,Zhang Y,Quan WZ,et al. DPE: Disentanglement of Pose and Expression for General Video Portrait Editing[C]. 见:. 加拿大. 2023-06.

个性服务

查看访问统计

相关权益政策

暂无数据

收藏/分享

所有评论 (0)

[发表评论/异议/意见]

暂无评论

评论
权益异议
反馈意见

评注功能仅针对注册用户开放，请您登录

您对该条目有什么异议，请向管理员反馈。
内容：
Email：	*
单位:
验证码：	刷新

您在知识库使用过程中有什么好的想法或者建议可以反馈给我们。
标题：	*
内容：
Email：	*
验证码：	刷新

相关链接

CORC

联系我们