|
2014-03-13
| ||
| 13:43 | • Closed ticket [09dd818564]: "fossil: manifest file" "is malformed" when commiting plus 4 other changes artifact: 0c019e9b1c user: jan.nijtmans | |
|
2012-11-29
| ||
| 14:50 | • Ticket [e399bc1edf] Non-ASCII characters in file and folder names are not handled correctly status still Fixed with 3 other changes artifact: a0ad8e4a68 user: drh | |
| 14:05 | • Ticket [e399bc1edf]: 3 changes artifact: 724e5982cf user: anonymous | |
| 13:15 | • Fixed ticket [e399bc1edf]. artifact: 83178da4ee user: drh | |
| 13:15 | Translate decomposed-UTF8 filenames on MacOS into precomposed-UTF8. Fix for ticket [e399bc1edfe45b] check-in: ca728447a6 user: drh tags: trunk | |
| 13:12 | Translate decomposed-UTF8 filenames on MacOS into precomposed-UTF8. Fix for ticket [e399bc1edfe45b] Closed-Leaf check-in: 1002e7238c user: drh tags: ticket-e399bc | |
| 03:14 | Untested changes designed to fix broken filename encodings on MacOS. Move all character-set converting routines into a new file called utf8.c. Add the fossil_filename_to_utf8() routine, which converts Unicode to utf8 on windows and decomposed utf8 into precomposed utf8 on Mac. Ticket [e399bc1edfe45b] check-in: c24e568fb0 user: drh tags: ticket-e399bc | |
| 00:54 | • Ticket [e399bc1edf] Non-ASCII characters in file and folder names are not handled correctly status still Open with 3 other changes artifact: 3872e76295 user: drh | |
| 00:52 | • Ticket [e399bc1edf]: 2 changes artifact: 8f697cecf9 user: drh | |
|
2012-11-28
| ||
| 21:54 | • New ticket [e399bc1edf]. artifact: 7b69597542 user: anonymous | |
| Ticket Hash: | e399bc1edfe45b2edb56eb037f63a6bf4cdbc211 | ||
| Title: | Non-ASCII characters in file and folder names are not handled correctly | ||
| Status: | Fixed | Type: | Code_Defect |
| Severity: | Critical | Priority: | |
| Subsystem: | Resolution: | Fixed | |
| Last Modified: |
2012-11-29 14:50:48 13.34 years ago |
Created: |
2012-11-28 21:54:12 13.34 years ago |
| Version Found In: | 1.24 | ||
| Description: | ||||
|
Using the latest version 1.24 the issue occurs after the following steps:
A. For the folder and filenames:
B. then on a different machine
C. Back to the first machine
One folder contains both files, and the other contains only the first file ("cönáñón"). In batch of files I was working on for a customer, that contained many non-ASCII characters, sometimes the file names would suddenly duplicate with inverse accents (going from á to à). If you need more examples let me know and I will do a similar run with all non-ascii characters that we use in Spanish. | ||||
| User Comments: | ||||
drh added on 2012-11-29 00:54:47:
Konstantin Khomoutov writes on the fossil-users mailing list: I'm just handwaving, but Git's code base recently received some modifications to specifically deal with issues a native Mac OS X filesystem have with regard to UTF-8. AFAIK the deal was about that filesystem pefrorming one of standard UTF-8 normalizations either when writing or when reading (or both) so that when you create a directory entry and then read it back, you might get an octet string different from that you wrote. See the extensive commit message in [1] and [2] in general. 1. [https://github.com/git/git/commit/76759c7dff53e8c84e975b88cb8245587c14c7ba] 2. [http://en.wikipedia.org/wiki/HFS_Plus] anonymous added on 2012-11-29 14:05:10: I checked this a bit further to include a Windows 7 environment. Win 7 and OSX 10.7 respond differently A. On the Ubuntu 12.04LTS machine # add the folder and "WiThCaPiTals" and "WiThCaPiTalsó" # In both folders "withcapitals" and "WiThCaPiTalsó" add the file "FilEWithCapItals.rst" # In the folder "withcapitals" add the file "filewithcapitals.rst" B. Update the repo on OSX 10.7 # fossil: changes 3 files modified # fossil: WARNING: 1 unmanaged files were overwritten # the result is one folder: "WiThCaPiTalsó" containing the file "FilEWithCapItals.rst" # Two folders and two files dissapeared. B. Update the repo on Windows 7 # fossil: changes 2 files modified # fossil: WARNING: 1 unmanaged files were overwritten # the result is two folders: "WiThCaPiTals" and "WiThCaPiTalsó" both containing the file "FilEWithCapItals.rst" # One folder and one file dissapeared It looks indeed as if this is caused by how the underlying OS deals with capitalization and non-ascii characters. I used fossil for a workshop on version control and some participants had issues like the change of direction of the accent. But I have not been able to replicate those. Neither have I been able to replicate the issues in the Wiki we saw, that is why I have not reported it separately. drh added on 2012-11-29 14:50:48: See also: http://en.wikipedia.org/wiki/Unicode_equivalence#Errors_due_to_normalization_differences | ||||