0006789: Allow non-ISO646 (UTF-8 compatible) characters in universal charstrings

(0012224)
Gyorgy Rethy (reporter)
06-10-2014 10:55

*For STF discussion*

(0012323)
Jacob Wieland - Spirent (reporter)
09-10-2014 13:58

As the only escape-character in TTCN-3 charstring literals is the quote-symbol, I guess this would have to be used.

"aaa"0706"bbb", for instance, could then be the same as "aaa" & <unicode of 0706> & "bbb".

As far as I can see, this does not introduce any backward incompatiblity as there is at the moment no grammar rule which allows a number directly behind a charstring literal.

(0012401)
Axel Rennoch (developer)
04-11-2014 14:00

Based on Jacob's idea we may allow different representations, please see examples in the attachment, since characters do not appear in this box. ;-)

(0012446)
Gyorgy Rethy (reporter)
06-11-2014 08:58

We shall not extend the scope of the CR. If more/other feature is needed, another CR shall be submitted.

The standard specifies the TTCN-3 modules to be saved in UTF-8, TTCN-3 editors should support UTF-8 characters (at least a reasoable subset), because they are allowed in comments. So, in principle no technical difficulties to allow their direct use in universal charstring values as well.

The additional syntax brings in new problems:
- in case of "aaa"0706"bbb", how to know what the user wanted to write? it may be a simple typing error and he/she meant "aaa""0706""bbb"! For this reason I strongly oppose this syntax, i.e. to extend the smantics associated with the "
character.

Anyway, UTF-8 today covers wast majority of really used characters, therefore the char(U4E2D, U56FD) syntax will become rarely used or used due to local style guides.

(0012472)
Axel Rennoch (developer)
06-11-2014 15:11

Following the discussion only a note has been added in the attached file to 6.1.1 e).

(0012473)
Axel Rennoch (developer)
06-11-2014 15:12

Please advice if the new note is sufficient.

(0012498)
Jacob Wieland - Spirent (reporter)
07-11-2014 13:37

no problem with me. I just pointed out that a direct inclusion into the charstring literals needs to use the " character as that is the only escape character we have (unfortunately). In principle, I agree with Gyorgys reasoning that mostly, their will be no need for the char-syntax to be used.

The only exception I see is standardization bodies which want to publish their testsuites in a non-UTF8 based format.

(0012649)
Gyorgy Rethy (reporter)
06-01-2015 18:26

Added to draft V4.6.3

Issue History
Date Modified	Username	Field	Change
23-07-2014 10:51	Gyorgy Rethy	New Issue
06-10-2014 10:55	Gyorgy Rethy	Note Added: 0012224
06-10-2014 10:55	Gyorgy Rethy	Target Version	=> v4.7.1 (published 2015-06)
09-10-2014 13:58	Jacob Wieland - Spirent	Note Added: 0012323
03-11-2014 16:33	Gyorgy Rethy	Assigned To	=> Axel Rennoch
03-11-2014 16:33	Gyorgy Rethy	Status	new => assigned
04-11-2014 13:58	Axel Rennoch	File Added: draft-res-6789-v1.docx
04-11-2014 14:00	Axel Rennoch	Note Added: 0012401
06-11-2014 08:58	Gyorgy Rethy	Note Added: 0012446
06-11-2014 15:09	Axel Rennoch	File Added: draft-res-6789-v2.docx
06-11-2014 15:11	Axel Rennoch	Note Added: 0012472
06-11-2014 15:12	Axel Rennoch	Note Added: 0012473
06-11-2014 15:12	Axel Rennoch	Assigned To	Axel Rennoch => Gyorgy Rethy
06-11-2014 15:12	Axel Rennoch	Status	assigned => acknowledged
07-11-2014 11:50	Gyorgy Rethy	Status	acknowledged => confirmed
07-11-2014 13:37	Jacob Wieland - Spirent	Note Added: 0012498
06-01-2015 18:23	Gyorgy Rethy	Status	confirmed => resolved
06-01-2015 18:23	Gyorgy Rethy	Resolution	open => fixed
06-01-2015 18:26	Gyorgy Rethy	Note Added: 0012649
06-01-2015 18:26	Gyorgy Rethy	Status	resolved => closed
06-01-2015 18:26	Gyorgy Rethy	Fixed in Version	=> v4.7.1 (published 2015-06)

Notes
(0012224) Gyorgy Rethy (reporter) 06-10-2014 10:55	For STF discussion

(0012323) Jacob Wieland - Spirent (reporter) 09-10-2014 13:58	As the only escape-character in TTCN-3 charstring literals is the quote-symbol, I guess this would have to be used. "aaa"0706"bbb", for instance, could then be the same as "aaa" & <unicode of 0706> & "bbb". As far as I can see, this does not introduce any backward incompatiblity as there is at the moment no grammar rule which allows a number directly behind a charstring literal.

(0012401) Axel Rennoch (developer) 04-11-2014 14:00	Based on Jacob's idea we may allow different representations, please see examples in the attachment, since characters do not appear in this box. ;-)

(0012446) Gyorgy Rethy (reporter) 06-11-2014 08:58	We shall not extend the scope of the CR. If more/other feature is needed, another CR shall be submitted. The standard specifies the TTCN-3 modules to be saved in UTF-8, TTCN-3 editors should support UTF-8 characters (at least a reasoable subset), because they are allowed in comments. So, in principle no technical difficulties to allow their direct use in universal charstring values as well. The additional syntax brings in new problems: - in case of "aaa"0706"bbb", how to know what the user wanted to write? it may be a simple typing error and he/she meant "aaa""0706""bbb"! For this reason I strongly oppose this syntax, i.e. to extend the smantics associated with the " character. Anyway, UTF-8 today covers wast majority of really used characters, therefore the char(U4E2D, U56FD) syntax will become rarely used or used due to local style guides.

(0012472) Axel Rennoch (developer) 06-11-2014 15:11	Following the discussion only a note has been added in the attached file to 6.1.1 e).

(0012473) Axel Rennoch (developer) 06-11-2014 15:12	Please advice if the new note is sufficient.

(0012498) Jacob Wieland - Spirent (reporter) 07-11-2014 13:37	no problem with me. I just pointed out that a direct inclusion into the charstring literals needs to use the " character as that is the only escape character we have (unfortunately). In principle, I agree with Gyorgys reasoning that mostly, their will be no need for the char-syntax to be used. The only exception I see is standardization bodies which want to publish their testsuites in a non-UTF8 based format.

(0012649) Gyorgy Rethy (reporter) 06-01-2015 18:26	Added to draft V4.6.3

Relationships

ETSI's Bug Tracker