Skip to content
/ cmtu Public

A Node.js utility library for easily removing, extracting, and manipulating comments in code strings with support for popular programming languages such as JavaScript, Python, and etc.

License

Notifications You must be signed in to change notification settings

sina-byn/cmtu

Repository files navigation

cmtu - comment utils NPM version NPM monthly downloads NPM total downloads

cmtu - removing comments from code string has never been this easy.

Cmtu is a Node.js package that helps you easily remove, extract, and magic comments from code strings.

Built with TypeScript and has full type support.

Supports the most popular programming languages like Python, JavaScript, etc. out of the box.

Please consider following this project's author, Sina Bayandorian, and consider starring the project to show your ❤️ and support.

Table of Contents

Install

Install with npm:

$ npm install --save cmtu

Usage

const cmtu = require('cmtu');

// * create and configure a cmtu object
const jsCmtu = cmtu(cmtu.Languages.JS.resolver);
const jsCode = `
/*
this is a multi-line JS comment
*/

const callout = 'this is not a comment';

// this is a single line js comment
`;

const codeWithNoComments = jsCmtu.strip(jsCode);

console.log(codeWithNoComments, codeWithNoComments.length);

see built-in languages for a list of all built-in languages

API

returns a cmtu-object configured based on the args passed to it.

Params

  • resolver : Resolver

  • options : optional - Options

    • stringSensitive : boolean | undefined

      • if set to true will ignore strings that include a comment based on the provided string literals
    • stringLiterals : string[] | undefined

      defines the list of string literals - characters that a string start and end with

      • defaults to JavaScript string literals if stringSensitive is set to true
      • only used when stringSensitive is set to true
    • exclude : RegExp[] | undefined

      • an array of regexes
      • the comments that are matched by either one of these regexes are excluded from the output

Returns

Example

// initialize a pre-configured cmtu-object - jsCmtu
const jsCmtu = cmtu(cmtu.Languages.JS.resolver);
const jsCode = `
/*
this is a multi-line JS comment
*/

const callout = '// this is not a comment';

// this is a single line js comment
`;

const { strip, extract, magic } = jsCmtu;
// in this case the callout value will be included
// even though it's not an actual comment and is 
// a string because this instance the cmtu object
// exposes methods that are not string sensitive

// each of these methods is explained below
// at the cmtu-object section

const comments = jsCmtu.extract(jsCode);

console.log(comments);

returns a cmtu-object configured based on the args passed to it - the difference however is that methods exposed by this cmtu-object are stringSensitive meaning that in the rare cases where your strings might include comments themselves, these methods can understand the difference. Take a look at the example below:

Params

  • resolver : Resolver

  • options : optional - Omit<Options, 'stringSensitive'>

    • stringLiterals : string[] | undefined

      defines the list of string literals - characters that a string start and end with

      • defaults to JavaScript string literals if stringSensitive is set to true
      • only used when stringSensitive is set to true
    • exclude : RegExp[] | undefined

      • an array of regexes
      • the comments that are matched by either one of these regexes are excluded from the output

Returns

Example

// initialize a pre-configured cmtu-object - jsCmtu
const jsCmtu = cmtu.stringSensitive(cmtu.Languages.JS.resolver);
const jsCode = `
/*
this is a multi-line JS comment
*/

const callout = '// this is not a comment';

// this is a single line js comment
`;

const { strip, extract, magic } = jsCmtu;
// in this case the callout value won't be included
// as it's not an actual comment and is a comment
// inside of a srting

// each of these methods is explained below
// at the cmtu-object section

const codeWithNoComments = jsCmtu.strip(jsCode);

console.log(codeWithNoComments, codeWithNoComments.length);

Advanced Usage

you can customize cmtu to use it for languages that are not part of cmtu.Languages by default, for example:

// note that python is included in cmtu.Languages, and
// this is just an example to help you understand how
// to customize cmtu for your own use cases

const pyCode = `
# this is a comment in python
callout = "// this is not a comment in python"
#! python comment to be excluded
`;

// in order to customize the returned cmtu object
// we need to pass a proper resolver for python

// regex string to match python comments
const pyResolver = '#.*';
const pyStringLiterals = ["'", '"', "'''", '"""'];

const pyCmtu = cmtu.stringSensitive(
  pyResolver,
  {
    stringLiterals: pyStringLiterals,
    exclude: [/#!.*/] // optional
  }
);

const { strip, extract, magic } = pyCmtu;

console.log(extract(pyCode));
console.log(strip(pyCode));
console.log(magic(pyCode));

Interfaces

type Resolver =
  | string
  | { block: string }
  | { inline: string }
  | { block: string; inline: string };
type LanguageName = 'JS' | 'CSS' | 'HTML' | 'CPP' | 'GO' | 'PYTHON' | 'PHP';

// support for php multi-line strings is lacking
// a good idea is to use a regex to exclude the
// comments that are within multi-line strings
type Options = {
  stringSensitive?: boolean;
  stringLiterals?: string[];
  exclude?: RegExp[];
};
// 1- methods exposed by any configured cmtu object
// 2- return type of cmtu(...) and cmtu.stringSensitive(...)

{
  // returns a string with with its comments stripped
  strip: (code: string) => string;

  // returns an array of the stripped comments
  extract: (code: string) => string[];

  // returns a tuple [comment-stripped string, stripped comments]
  magic: (code: string) => [string, string[]];
}