Version of LINUX Red Hat Enterprise Linux ES release 4
I need to confirm if an extract from a database has correctly output data with UTF-8 encoding. I created the file using the mechanism specified by the database vendor, but when I did
$ file extract.txt
it returned
ASCII text, with very long lines
However when I created a sub file from the main extract file and did
$ file sub_extract.txt
it returned
UTF-8 Unicode text, with very long lines
Therefore is my file actually OK and there is some limitation of the file command? Is there a better way of checking if a file contains UTF-8 data?