Skip to content

dump basic information on every Unicode character in the input

License

Notifications You must be signed in to change notification settings

Doroszewski/udump

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

udump.pl – Unicode dump, a Perl script.

This is a little personal tool of mine. I use it when I need to debug my scripts which deal with text in various ways. I release it publicly, as it may become useful for someone else.

It gives very basic information on every Unicode character in the input, be it stdin, a file or a set of files.

Usage

The whole script is just a single Perl file. It should work with a default Perl installation, no additional modules are needed.

If no arguments are passed, the standard input is analyzed. This is very useful in pipes and that's the main way I use the script. Alternatively, filenames can be given.

Things to do

The tool is not finished yet. It should also:

  • treat combining characters properly;
  • treat all control characters properly (have I missed some?);
  • recognize light terminal themes;
  • do something else maybe?

About

dump basic information on every Unicode character in the input

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages