Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LM Pretrain #99

Open
palmex opened this issue Jun 26, 2024 · 1 comment
Open

LM Pretrain #99

palmex opened this issue Jun 26, 2024 · 1 comment

Comments

@palmex
Copy link

palmex commented Jun 26, 2024

Did you ever run forward on the LM with just motion to motion or text to text or is this (below) the stage 1 of training as described in the paper.

From Paper:
"To generalize to various downstream tasks like [7, 37, 38, 28], we follow [38] to design an objective, where a
certain percentage (15%) of tokens in the input tokens Xs are randomly replaced with a special
sentinel token. On the other side, the corresponding target sequence is constructed by extracting
the dropped-out spans of tokens, delimited by the same sentinel tokens used in the input sequence,
along with an additional sentinel token to indicate the end of the target sequence. 2) We then learn
the motion-language relation by the supervision of paired text-motion datasets [11, 33]. We train
MotionGPT on the supervised motion-language translation, where the input is either a human motion
or a text description.After unsupervised and supervised training processes, we aim to equip our model
with the understanding of text and motion relationships. "

From Code:
mgpt_lm.py

condition = random.choice(
['text', 'motion', 'supervised', 'supervised', 'supervised'])

@2000ZRL
Copy link

2000ZRL commented Oct 11, 2024

But the code comment out condition = random.choice(['text', 'motion', 'supervised', 'supervised', 'supervised']) and use condition = random.choice([''supervised', 'supervised', 'supervised']) instead. I am not sure whether this is a mistake. If the code is correct, then the LM is only pretrained in a supervised manner, i.e., motion-to-text, text-to-motion, and motion prediction.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants