Skip to content

roboatory/xml_jsonl_converter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Note: To successfully convert documents from xml to jsonl, one must download the following script developed by blackrock: https://github.com/blackrock/xml_to_json. Here, I present two additional modifications to the aforementioned script.

How to use:

Navigating to the folder where blackrock's script is located, run the following command to batch convert a folder of xml files:

python xml_to_json.py -e "INSERT XPATH QUERIES HERE" -x "INSERT PATH TO XSD FILE" "INSERT PATH TO XML FILES"

Additional Modifications to the CLI:

  1. To join resulting jsonl files into one file (one document per each line), specify path to where the files are located utilizing the -j argument in the CL. Give the output file a name utilizing -f. Individual jsonl files will also be stored in a temp directory.

About

Scripts to Convert XML to JSONL

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages