I have so far worked mostly with UTF-8 text files or source code (pure ASCII and so implicit UTF-8). However now I am starting to work with files in CP 1252. Big trouble (and I mean big as in BIG 😢 ) ...
I thought that if I could identify and generate a list of the filenames which could not be encoded to cp1252, I could rename each filename to an acceptable filename ...
Here we explain a little bit about Unicode and why we may encounter UnicodeDecodeError or UnicodeEncodeError exceptions. While much of the world runs on UTF-8 these ...