Skip to content

fordfrog/xml2csv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 

Repository files navigation

xml2csv

Simple XML to CSV conversion utility.

What it does exactly?

It converts any XML file to CSV:

<root>
    <item>
    	<subitem1>
            <value1>...</value1>
            <value2>...</value2>
    	</subitem1>
    	<subitem2>
            <value3>...</value3>
    	</subitem2>
    </item>
    <item>
        <subitem1>
            <value1>...</value1>
    	</subitem1>
    	<subitem2>
            <value3>...</value3>
    	</subitem2>
    </item>
    ...
</root>

Choose any XML element using XPath expression in order to select XML elements for conversion to CSV file. Only child elements that match the expression will be converted.

Prerequisities

  • JRE or JDK 7+
  • Apache Maven 3+ (just in case you want to compile the application yourself)

Compilation

You are not required to compile the application yourself, you can download latest binary from https://github.com/fordfrog/xml2csv/downloads. Anyway, compilation of xml2csv is easy. Once you install Apache Maven, you just need to run mvn package in the root directory of the sources, where pom.xml file is located.

Running

Here is the usage information that xml2csv outputs if run without parameters:

Usage: java -jar xml2csv-*.jar --columns <columns> --input <file> --output <file> --item-name <xpath>

General command line switches:

--columns <columns>
    List of columns that should be output to the CSV file. These names must
    correspond to the element names within the item element.
--input <file>
    Path to the input XML file.
--item-name
    XPath which refers to XML element which will be converted to a row. It cannot
    end with slash (/).
--join
    Join values of multiple elements into single value using (, ) as a separator.
    By default value of the first element is saved to CSV.
--output <file>
    Path to the output CSV file. Output file content is always in UTF-8 encoding.
--separator <character>
    Character that should be used to separate fields. Default value is (;).
--trim
    Trim values. By default values are not trimmed.

Filtering rows:

--filter-column <name>
    Column on which the filter should be applied. When specifying filter command
    line switches, you must use this switch as the first one as it initializes
    new filter. You can specify more filters, each one beginning with this
    switch. You can filter the rows even on columns that are not part of the
    output. Filtering is performed before remapping.
..filter.values <file>
    Path to file containing values that the filter should use. Empty rows are
    added to the values too.
--filter-exclude
    Excludes all rows where the column value matches one of the specified values.
--filter-include
    Includes all rows where the column value matches one of the specified values.
    This is the default behavior if --filter-exclude|--filter-include is not
    specified.

Remapping (replacing) values:

--remap-column <name>
    Column in which original values should be replaced with values from map
    file. When specifying remapping command line switches, you must use this
    switch as the first one as it initializes new remapping. You can specify
    more remappings, each one beginning with this switch. Remapping is performed
    after filtering.
--remap-map <file>
    Path to file containing original value and new value pairs. The file uses
    CSV format. Values can be escaped either using single-quote (') or
    double-quote ("). Quotes within values can be escaped either doubling them
    ("" and '') or backslash-escaping them (\" and \').

Characters encoding:

Application expects all files being in UTF-8 encoding.

To do

Nothing at this moment.

License

xml2csv is distributed under MIT license.

Changelog

Version 1.2.2

  • Fixed removal of new lines from the strings.

Version 1.2.1

  • Updated plugin and dependency versions.

Version 1.2.0

  • Added support for trimming values.
  • Added support for custom separator.
  • Added support for joining values of repeated XML elements.
  • Added support for handling any XML document.

Version 1.1.0

  • Added support for filtering rows.
  • Added support for remapping (replacing) values.

Version 1.0.0

Initial release.

About

Simple XML to CSV conversion utility

Resources

Stars

Watchers

Forks

Packages

 
 
 

Languages