MM

TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts.

Inspired by the strong ties between vision and language, the two intimate human sensing and communication modalities, our paper aims to explore the generation of 3D human full-body motions from texts, as well as its reciprocal task, shorthanded for …

Action2video: Generating Videos of Human 3D Actions

A temporal VAE archtecture model equipped with Lie Algebra representation for action-conditioned 3D human motion generation.

Dual Learning Music Composition and Dance Choreography

Music and dance have always co-existed as pillars of human activities, contributing immensely to the cultural, social, and entertainment functions in virtually all societies. Notwithstanding the gradual systematization of music and dance into two …

Action2Motion: Conditioned Generation of 3D Human Motions

A temporal VAE archtecture model equipped with Lie Algebra representation for action-conditioned 3D human motion generation.