Skip to content

token-based code clones detection tool based on YuichiSemura works

License

Notifications You must be signed in to change notification settings

jbanaszczyk/CCFinderSW

 
 

Repository files navigation

CCFinderSW

CCFinderSW (CCFSW) is a token-based code clone detection tool. Japanese Readme is also available.

What is a code clone?

A code clone is a matching or similar piece of code that exists in source code.
The tool detects and outputs clone pairs, which are pieces of code that are similar to each other.
The tool detects and outputs clone pairs that are similar to each other, and excludes clone pairs that contain inserted or deleted sentences.

Latest version

CCFinderSW-1.0.zip

How to use

There are several documents.

Required Environment

  • Java Runtime Environment( >=8 )
    Currently, this program is implemented in Java only.
  • We are planning to reduce the amount of RAM as much as possible.

Test Environment

some quick tests.

  • CCFinderSW
    • Windows 10 Pro
    • Max OS
    • (probably) Linux
  • Icca
    • Windows 10 Pro

Former Versions

Copyright and Licensing

See LICENSE.
This software contains libraries distributed under Apache Licence 2.0.

Other Information

Author: Yuichi Semura Affiliation: Graduate School of Information Science and Technology, Osaka University, Osaka, Japan

Paper Information

The detailed information is described in our paper. The following is our paper information.

@INPROCEEDINGS{8305997,
  author={Semura, Yuichi and Yoshida, Norihiro and Choi, Eunjong and Inoue, Katsuro},
  booktitle={2017 24th Asia-Pacific Software Engineering Conference (APSEC)},
  title={CCFinderSW: Clone Detection Tool with Flexible Multilingual Tokenization},
  year={2017},
  pages={654-659},
  doi={10.1109/APSEC.2017.80}
}

Notes on use

CCFSW performs lexical analysis, but does not have separate lexical analysis mechanisms for each language.
CCFSW performs lexical analysis tailored to each language by using optional files.
(At the same time, this does not mean that the lexer algorithm is optimized for each language.)
This option file can be modified by the user.
Refer OptionFile.md for the notation of the option file.

About

token-based code clones detection tool based on YuichiSemura works

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 98.3%
  • ANTLR 1.7%