Source code and datasets of ACL2017 paper: "CANE: Context-Aware Network Embedding for Relation Modeling"
This folder "datasets" contains three datasets used in CANE, including Cora, HepTh and Zhihu. In each dataset, there are two files named "data.txt" and "graph.txt".
- data.txt: Each line represents the text information of a vertex.
- graph.txt: The edgelist file of current social network.
Besides, there is an additional "group.txt" file in Cora.
- group.txt: Each vertex in Cora has been annotated with a label. This file can be used for vertex classification.
Run the following command for training CANE:
python3 run.py --dataset [cora,HepTh,zhihu] --gpu gpu_id --ratio [0.15,0.25,...] --rho rho_value
For example, you can train like:
python3 run.py --dataset zhihu --gpu 0 --ratio 0.55 --rho 1.0,0.3,0.3
The experimental results are generated by the newest version of codes:
0.15 | 0.25 | 0.35 | 0.45 | 0.55 | 0.65 | 0.75 | 0.85 | 0.95 | |
---|---|---|---|---|---|---|---|---|---|
cora | 85.2 | 90.5 | 92.2 | 93.5 | 93.4 | 93.6 | 94.4 | 95 | 92.5 |
HepTh | 85 | 89.7 | 91.7 | 95 | 94.4 | 94.2 | 95.1 | 95.8 | 93.1 |
zhihu | 64.5 | 67.1 | 69.2 | 69.9 | 72 | 72.2 | 72.5 | 72.8 | 73.3 |
- Tensorflow == 1.11.0
- Scipy == 1.1.0
- Numpy == 1.16.2
If you use the code, please cite this paper:
Cunchao Tu, Han Liu, Zhiyuan Liu, Maosong Sun. CANE: Context-Aware Network Embedding for Relation Modeling. The 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017).
For more related works on network representation learning, please refer to my homepage.