Error : filename is not valid UTF-8

Comments, questions, bug reports, etc.
Post Reply
Skyblue
Posts: 17
Joined: Fri Apr 08, 2016 7:12 am

Error : filename is not valid UTF-8

Post by Skyblue »

Hi synchronicity,

The latest release of wimlib ( V 1.13.4 ) reports the following error message while capturing a folder on Oracle Linux 5.8 32-bit :

Code: Select all

[ERROR] "/appdata/krbapp/prodappl/xxkrb/11.5.0/bin/file▒200.xls": filename is not valid UTF-8.  This is not supported.
Is there a way to archive those non-UTF-8 files with wimlib? Thanks.

synchronicity
Site Admin
Posts: 335
Joined: Sun Aug 02, 2015 10:31 pm

Re: Error : filename is not valid UTF-8

Post by synchronicity »

I'm afraid not, as the WIM file format stores filenames as Windows-style wide character strings (UTF-16LE, with unpaired surrogates allowed). As a result there is no way to represent a UNIX-style arbitrary byte sequence filename unless it is valid UTF-8 (with unpaired surrogates allowed).

Edit: in principle filenames with a well-defined encoding other than UTF-8, say ISO-8859-1, could be mapped to UTF-16 as well. Almost everyone uses UTF-8 now though, so there hasn't been a need to support this.

Skyblue
Posts: 17
Joined: Fri Apr 08, 2016 7:12 am

Re: Error : filename is not valid UTF-8

Post by Skyblue »

Hi synchronicity,

Thanks for the info. Luckly, I had the option to delete the offending files and wimlib did the job.

Post Reply