Skip to content

Fix character encoding issues

Jonathan Neuhauser requested to merge github/fork/Ede123/encoding into master

Created by: Ede123

This PR fixes character encoding issues (#26 (closed)) by

  • always reading binary data (both from files and from stdin) and letting the XML parser deal with the decoding
  • always writing binary data (both to files and to stdout) since we assume an UTF8 encoding for the output and already encode it accordingly

Tests on Windows 7 with Python 3.5.0 and Python 2.7.11 as well as on Ubuntu 15.04 with Python 2.7.9 worked well for files with UTF and ISO 8859-15 encoding. I tested both file input as well as stdin/stdout input/output.

Merge request reports

Loading