tensorflow-gpu==1.6.0
numpy==1.14.2
pandas==0.22.0
gensim==3.4.0
Bulid word-based dictonary with min_count=20 and add TA's vocab.
python3 build_filter_words.py ./clr_conversation.txt ./vocab.txt ./word_vocab.txt
python3 filter.py ./clr_conversation.txt ./word_vocab.txt ./filtered_clr_conversation.txt
Preprocessing on filtered training data.
python3 data_preprocessing.py ./filtered_clr_conversation.txt ./word_vocab.txt ./w2v_corpus.txt
Pre-train Word2Vec model with size 256, window 5, iteration 300.
python3 train_w2v.py ./w2v_corpus.txt ./w2vModel_256_w5_mc1_iter300.bin
rm ./models/*
python3 train.py
python3 infer.py ./test_input.txt ./test_output.txt
If you just wanna infer without training, please run hw2_seq2seq.sh.
bash ./hw2_seq2seq.sh ./test_input.txt ./test_output.txt
Please download TA's baseline model before evaluating our model.
cd mlds_hw2_2_data/evaluation
python3 main.py ../../test_input.txt ../../test_output.txt
Pre-train W2V | Beam Search (size) | Perplexity | Correlation Score |
---|---|---|---|
No | No | 6.96 | 0.38256 |
No | 7 | 11.83 | 0.49207 |
Yes | No | 9.26 | 0.45864 |
Yes | 7 | 11.83 | 0.53626 |
Pre-train W2V | Beam Search (size) | Perplexity | Correlation Score |
---|---|---|---|
Yes | No | 9.26 | 0.45864 |
Yes | 1 | 9.26 | 0.45864 |
Yes | 2 | 8.05 | 0.47926 |
Yes | 3 | 9.34 | 0.51268 |
Yes | 4 | 10.13 | 0.52513 |
Yes | 5 | 10.87 | 0.52878 |
Yes | 6 | 11.36 | 0.53464 |
Yes | 7 | 11.83 | 0.53626 |
Yes | 8 | 12.01 | 0.53497 |
Yes | 9 | 12.25 | 0.53313 |
Yes | 10 | 12.44 | 0.53358 |
Yes | 11 | 12.55 | 0.53084 |
Yes | 12 | 12.59 | 0.52807 |
Yes | 13 | 12.70 | 0.52643 |
Yes | 14 | 12.80 | 0.52537 |
Yes | 15 | 12.81 | 0.52101 |
Yes | 16 | 12.82 | 0.51790 |
Yes | 17 | 12.87 | 0.51361 |
Yes | 18 | 12.89 | 0.51016 |
Yes | 19 | 12.90 | 0.50686 |
Yes | 20 | 12.91 | 0.50390 |
Yes | 30 | 12.99 | 0.48391 |
Yes | 40 | 13.16 | 0.47974 |
Yes | 50 | 13.40 | 0.47635 |