Skip to content
forked from kanongil/telxcc

A utility to extract and convert teletext CC from TS files

License

Notifications You must be signed in to change notification settings

beeeeswax/telxcc

 
 

Repository files navigation

telxcc

Build Status

NOTE: telxcc is very likely no longer maintained. It is licensed under GPL: anyone can use it for free, anyone has the right to modify it if she does not use the name "telxcc" for further releases. Please rename your clone after forking.

telxcc is utility extracting teletext Closed Captions from Transport Stream binary files (TS & M2TS) into SubRip text files (SRT).

telxcc is:

  • tiny and lightweight (few KiBs binary, no lib dependencies)
  • easy to use
  • open-source
  • multiplatform (Mac, Windows and Linux @ x86, ARM etc.)
  • modern (fully supports UTF-8 (Unicode Normalization Form C (NFC)), colours in SRT files, conforms to ETSI 300 706 Presentation Level 1.5/2.5, able to process TS and M2TS)
  • stable
  • secure (does not require any escalated privileges)
  • high performing (even SSD is the bottleneck)
  • well tested (every build is automatically tested against English, German, Czech, Italian, Norwegian, Swedish, Finnish, Slovenian and Polish TS files from different TV stations)
  • 100% handcrafted in Prague, CZ. :)

telxcc is easy to use and flexible at the same time:

  • telxcc could be run in "search engine mode", in which case it produces plain text output suitable for indexing (UTC airtime and caption in plain text)
  • telxcc tries to automatically detect all parameters needed (transport stream ID, teletext CC page, timestamps) and environment (the way it is started on Windows for instance)
  • it could be easily integrated (files could be redirected or specified on command line as parameters)

telxcc is the only utility having correct implementation for following languages

  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Italian
  • Lettish
  • Lithuanian
  • Polish
  • Portuguese
  • Rumanian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish
  • Swedish
  • Turkish

telxcc also has limited/untested implementation of cyrillic and Greek alphabet for

  • Bulgarian
  • Croatian
  • Greek
  • Russian
  • Serbian
  • Ukrainian

and it is also already prepared for arabic and hebrew scripts (no such TS samples are unfortunately available for testing).

Important: telxcc will never ever be like Emacs; it is simple and specialized utility. Do you need another output format? Just transform current one. Is online/realtime processing over TCP/IP required? I suggest Node.js as a wrapper…

Unfortunately I am unable to provide you with free support. Please, do not ever ask me to assist you with source code modifications or to make a special build for you etc., if you use telxcc for your business (especially if you have not donated to its development). It is your job you are paid for.

3rd party software known to be using telxcc

Binaries

For precompiled binary files see Releases page.

Build

To install, or uninstall telxcc on Linux and Mac:

$ make install ↵

$ make uninstall ↵

To build binary for Intel Core 2 processor architecture just type:

$ make ↵

On Mac typically you can use clang preprocessor:

$ make CC=clang ↵

You can also copy any *.ts files into the current directory and build a profiled version (up to 3% performance gain on repeat tasks):

$ make profiled ↵

Or you can disable all optimizations (binary target is any x86 processor):

$ make CCFLAGS="-Wall -pedantic -std=gnu99"

Windows binary is build in MinGW by (MinGW must be included in PATH):

C:\devel\telxcc> mingw32-make -f Makefile.win strip

telxcc has no lib dependencies and is easy to build and run on Linux, Mac and Windows. (Generic binary files are included.)

Command line params