13
2
I have many zip-files where there are encoding errors for the German umlauts (äüöÄÜÖß). They show up in both the filename.zip as well as in the included directories and files like this:
- Fünf = Fu╠ênf
- Räuber = Ra╠êuber
- Überfall = U╠êberfall
and so on. Usually I use Linux, but because of this issues I also tried a Windows7 VM but it results in the same encoding mess up. On Linux I played around with convmv and detox, but with no success.
When I use
- convmv -f iso-8859-1 -t utf8 --replace --notest -r *
I get "Skipping, already UTF-8".
Any thoughts about this?
@cider try
find -type f -print0 |xargs -r -n1 -0 convmv -f WINDOWS-1252 -t UTF-8 --notest
This find files from current dir forward and runs convmv separately on each file. Filename is encoed as nullterminated list. – Manwe – 2015-06-26T21:35:34.443What antique system are you using ? All current Linux distributions uses UTF-8 now. – BatchyX – 2013-01-13T15:51:36.530
Could this be a filesystem problem? Perhaps it is not mounted in UTF? – terdon – 2013-01-13T15:59:16.257
I use Linux Mint 13 (based on Ubuntu 12.04 LTS with Kernel 3.2.0-23), so this is far from antique. And as I already wrote I also tried that files on a Windows 7 VM. But of course I don't know what the one who created the zip files used. – cider – 2013-01-13T15:59:52.673
1This encoding seems some kind of DOS encoding. Usually if I see issues with UTF8 encoding the German umlauts look like ä = ä or Ü = Ãœ – cider – 2013-01-13T16:05:16.830