Filename type: UNICODE or ANSI

Hi,

I've been trialling Dopus9 for a couple of days and I have to say it is really good:)

However, I'm having trouble with filenames in general. I work with Japanese and Chinese files. Some of them compressed as ZIP or RAR. The zip plugin does not seem to be able to open my ZIP files giving me an error about the system cant find the path specified. The RAR plugin seem to be doing fine. Is there another ZIP plugin that can read into my zip files?

Also, is Dopus able to show me whether a file name is in ANSI or UNICODE? If so, is there a way to convert all the file names from ANSI to UNICODE? I have some files that seem to display correctly no matter what system locale i set it to, but there are some files that gets mangled once I change the locale, so I'm guessing that the file name coding is the problem.

Its rather annoying to change my locale between chinese-japanese-english all the time.

Thank you! :smiley:

The Zip format does not seem to support unicode for filenames. See here:

blogs.msdn.com/michkap/archive/2 ... 16181.aspx

(Assuming that is still accurate. The comments do mention a reserved feature of the Zip format for Unicode filenames but I don't know if anything supports it.)

If you're looking at files on an NTFS disk then the filenames are already Unicode. If it's a FAT disk, or inside an archive format which stores filenames as ANSI, then the filenames get converted to Unicode on the fly based on the current locale/codepage.

There's no way to permanently convert a filename to Unicode if the underlying filesystem or archive format doesn't support it (obviously, since the filesystem cannot store such a filename), and on filesystems that do support Unicode the filenames should already be in Unicode.

Perhaps there are archive formats where filenames can be either "ANSI" or Unicode in which case extracting the files under the correct code page to an NTFS volume would convert their names to the correct Unicode characters, and then re-creating the archive (in a way which stores the filenames as Unicode) should preserve those Unicode characters.

So NTFS stores the filenames as unicode by itself without the need for ANSI code pages? So the best solution for me is to convert the ZIP files to RAR and this should give me the correct encoding?

Thanks for the info:)