View Ticket
Not logged in
2012-11-29
14:50 Ticket [e399bc1edf] Non-ASCII characters in file and folder names are not handled correctly status still Fixed with 3 other changes artifact: a0ad8e4a68 user: drh
14:05 Ticket [e399bc1edf]: 3 changes artifact: 724e5982cf user: anonymous
13:15 Fixed ticket [e399bc1edf]. artifact: 83178da4ee user: drh
13:15
Translate decomposed-UTF8 filenames on MacOS into precomposed-UTF8. Fix for ticket [e399bc1edfe45b] check-in: ca728447a6 user: drh tags: trunk
13:12
Translate decomposed-UTF8 filenames on MacOS into precomposed-UTF8. Fix for ticket [e399bc1edfe45b] Closed-Leaf check-in: 1002e7238c user: drh tags: ticket-e399bc
03:14
Untested changes designed to fix broken filename encodings on MacOS. Move all character-set converting routines into a new file called utf8.c. Add the fossil_filename_to_utf8() routine, which converts Unicode to utf8 on windows and decomposed utf8 into precomposed utf8 on Mac. Ticket [e399bc1edfe45b] check-in: c24e568fb0 user: drh tags: ticket-e399bc
00:54 Ticket [e399bc1edf] Non-ASCII characters in file and folder names are not handled correctly status still Open with 3 other changes artifact: 3872e76295 user: drh
00:52 Ticket [e399bc1edf]: 2 changes artifact: 8f697cecf9 user: drh
2012-11-28
21:54 New ticket [e399bc1edf]. artifact: 7b69597542 user: anonymous

Ticket UUID: e399bc1edfe45b2edb56eb037f63a6bf4cdbc211
Title: Non-ASCII characters in file and folder names are not handled correctly
Status: Fixed Type: Code_Defect
Severity: Critical Priority:
Subsystem: Resolution: Fixed
Last Modified: 2012-11-29 14:50:48
Version Found In: 1.24
Description & Comments:
Using the latest version 1.24 the issue occurs after the following steps:

A. For the folder and filenames:

  1. Create a folder named "MañósosCarácteres"
  2. Create a file named "cönáñón" inside the new folder
  3. fossil addremove
  4. fossil commit -m "Añadí una carpeta y un archivo con tildes"

B. then on a different machine

  1. fossil update The new folder named "MañósosCarácteres" is now available
  2. Add a new file called "éstedesdemác.rst" to the folder "MañósosCarácteres"
  3. fossil commit -m "Añadí nuevo archivo con tildes desde mi mac"

C. Back to the first machine

  1. fossil update A second folder with the name "MañósosCarácteres" has been added. How it can add a folder with exactly the same name and have the file system accept this I do not understand. It happens on both Ubuntu 12.04LTS and OSX10.7.

One folder contains both files, and the other contains only the first file ("cönáñón").

In batch of files I was working on for a customer, that contained many non-ASCII characters, sometimes the file names would suddenly duplicate with inverse accents (going from á to à). If you need more examples let me know and I will do a similar run with all non-ascii characters that we use in Spanish.