SimRank

SimRank is a measure of similarity between nodes in a directed graph, based on the idea that "two objects are similar if they are related to similar objects." This implementation is optimized to run in O(n³), an improvement on the original paper’s O(n⁴). It also takes weighted edges into account, an improvement taken from SimRank++.

I originally wrote it to find similarities between users on Metafilter based on favorites data taken from the Infodump.

The example demonstrates correct output for the Figure 1 graph in [1].

References

[1] G. Jeh and J. Widom. “SimRank: A Measure of Structural-Context Similarity.” In KDD ’02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 538−543. ACM Press, 2002.

[2] D. Lizorkin, P. Velikhov, M. Grinev and D. Turdakov. “Accuracy Estimate and Optimization Techniques for SimRank Computation.” In VLDB ’08: Proceedings of the 34th International Conference on Very Large Data Bases, pages 422−433.

[3] I. Antonellis, H. Garcia-Molina and C.-C. Chang. “Simrank++: Query Rewriting through Link Analysis of the Click Graph.” In VLDB ’08: Proceedings of the 34th International Conference on Very Large Data Bases, pages 408−421.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
doc		doc
example		example
metafilter		metafilter
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
simrank.hpp		simrank.hpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SimRank

References

About

Releases

Packages

Languages

License

roukaour/simrank

Folders and files

Latest commit

History

Repository files navigation

SimRank

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages