xml2json

Translate any XML document to corresponding JSON or vice versa

HISTORY

Most of this code was written by Stefan Gössner and published at: https://goessner.net/download/prj/jsonxml/

You can read a description of what the library originally did at: https://www.xml.com/pub/a/2006/05/31/converting-between-xml-and-json.html

I did 2 major changes to the original code:

XML attributes are translated as siblings, rather than children, to their corresponding "parent" element in JSON. This produces more deterministic translation.
Use JSON list structure more frequently. This ensures lossless translation in more cases than the original and also is capable of handling some "mixed" cases that the original would just pass along as untranslated XML.

Details of these changes are explained below. For anything else, read the xml.com article linked above.

USAGE

The xml2json library works both in a browser and node.js. (In node.js, it depends on libxml plugin.)

For example usage in a browser, see xmljson_demo.html

For example usage in node.js, see https://github.com/henrikingo/node-xml2json

DETAILS

An explanation of changes wrt the original library by Stefan Grössner follows:

Consider the following 2 translations in the original library:

<e>text</e>
{
  "e":"text"
}

vs

<e name="value">text</e>
{
  "e":{
    "@name":"value",
    "#text":"text"
  }
}

=> the path to "text" changes depending on whether the element has attributes present or not. Very bad if there could be optional attributes, or attributes added over time.

For more examples, see https://goessner.net/download/prj/jsonxml/xmljson_test.html

Solution is to lift out attributes to be on the same level as the element itself:

<e name="value">text</e>
{
  e@name:"value",
  e:"text"
}

Note that for empty elements, we still explicitly print null for clarity:

<e name="value" />
{
  e@name:"value",
  e:null
}

The other fix is to use lists to more closely resemble original xml structure in case of multiple children with same node name.

Original:

<e> <a>text</a> <a>text</a> </e>
{
  "e":{"a":[
      "text",
      "text"
    ]}
}

Me:

<e> <a>text</a> <a>text</a> </e>
{
  e:[
        {
      a:"text"
    },
        {
      a:"text"
    }
  ]
}

This approach allows to handle "mixed content" elements better, where original gives up:

<a>x<c/>y</a>
{
  "a":"x<c/>y"
}

Me:

<a>x<c/>y</a>
{
  a:[
    "x",
        {
      c:null
    },
    "y"
  ]
}

(Same fix also supports CDATA sections, should there be any.)

TODO

All types are strings. Preferably types would be defined in xml schema, and the xml parser would return node values in correct javascript type. If this is not the case, we can emulate something by converting strings to other types in a separate function to be done later (ie xml schema hard coded into a js function) For database searches, having the correct types will help.

Not yet supported in json->xml:

Multi-dimensional arrays. What is the correct xml presentation for this?

{ e : [["a", "b"], ["c", "d"]] }

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
README.md		README.md
json2xml.js		json2xml.js
node.js		node.js
package.json		package.json
xml2json.js		xml2json.js
xml2json_demo.html		xml2json_demo.html
xmljson_test.html		xmljson_test.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

xml2json

HISTORY

USAGE

DETAILS

TODO

About

Releases

Packages

Languages

License

henrikingo/xml2json

Folders and files

Latest commit

History

Repository files navigation

xml2json

HISTORY

USAGE

DETAILS

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages