You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Did you ever run forward on the LM with just motion to motion or text to text or is this (below) the stage 1 of training as described in the paper.
From Paper:
"To generalize to various downstream tasks like [7, 37, 38, 28], we follow [38] to design an objective, where a
certain percentage (15%) of tokens in the input tokens Xs are randomly replaced with a special
sentinel token. On the other side, the corresponding target sequence is constructed by extracting
the dropped-out spans of tokens, delimited by the same sentinel tokens used in the input sequence,
along with an additional sentinel token to indicate the end of the target sequence. 2) We then learn
the motion-language relation by the supervision of paired text-motion datasets [11, 33]. We train
MotionGPT on the supervised motion-language translation, where the input is either a human motion
or a text description.After unsupervised and supervised training processes, we aim to equip our model
with the understanding of text and motion relationships. "
But the code comment out condition = random.choice(['text', 'motion', 'supervised', 'supervised', 'supervised']) and use condition = random.choice([''supervised', 'supervised', 'supervised']) instead. I am not sure whether this is a mistake. If the code is correct, then the LM is only pretrained in a supervised manner, i.e., motion-to-text, text-to-motion, and motion prediction.
Did you ever run forward on the LM with just motion to motion or text to text or is this (below) the stage 1 of training as described in the paper.
From Paper:
"To generalize to various downstream tasks like [7, 37, 38, 28], we follow [38] to design an objective, where a
certain percentage (15%) of tokens in the input tokens Xs are randomly replaced with a special
sentinel token. On the other side, the corresponding target sequence is constructed by extracting
the dropped-out spans of tokens, delimited by the same sentinel tokens used in the input sequence,
along with an additional sentinel token to indicate the end of the target sequence. 2) We then learn
the motion-language relation by the supervision of paired text-motion datasets [11, 33]. We train
MotionGPT on the supervised motion-language translation, where the input is either a human motion
or a text description.After unsupervised and supervised training processes, we aim to equip our model
with the understanding of text and motion relationships. "
From Code:
mgpt_lm.py
condition = random.choice(
['text', 'motion', 'supervised', 'supervised', 'supervised'])
The text was updated successfully, but these errors were encountered: