Paper | Code | FID | ModelName | ReleaseDate |
---|---|---|---|---|
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis | ✓ Link | 122.8 | CaMN | 2022-03-10 |
Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity | ✓ Link | 177.2 | Trimodal | 2020-09-04 |
Audio2Gestures: Generating Diverse Gestures from Speech Audio with Conditional Variational Autoencoders | 223.8 | Audio2Gestures | 2021-08-15 | |
Learning Individual Styles of Conversational Gesture | ✓ Link | 256.7 | Speech2Gestures | 2019-06-10 |
Robots Learning to Say `No': Prohibition and Rejective Mechanisms in Acquisition of Linguistic Negation | 261.3 | Seq2Seq | 2018-10-28 |