#503 — Issues in 7.8.4 String Literals

bug_id: 503
creation_ts: 2012-07-11 19:12:00 -0700
short_desc: Issues in 7.8.4 String Literals
delta_ts: 2014-07-20 20:48:06 -0700
product: Draft for 6th Edition
component: technical issue
version: Rev 9: July 8, 2012 Draft
rep_platform: All
op_sys: All
bug_status: VERIFIED
resolution: FIXED
priority: Normal
bug_severity: major
everconfirmed: true
reporter: Norbert
assigned_to: Allen Wirfs-Brock

commentid: 1278
comment_count: 0
who: Norbert
bug_when: 2012-07-11 19:12:15 -0700

(1) For string literals, it is essential that we interpret them as Unicode code points, not Unicode characters. The Unicode standard doesn't define the term "Unicode character", but from the usage it's clear that it's a subset of Unicode code points that certainly excludes all code points that are reserved for future allocation. String literals must be able to express all possible values of String values, i.e., all sequences of 16-bit integers.

(2) First note: "Basic MultilingualPlane" -> "Basic Multilingual Plane"

(3) First note: Representing Unicode code points in the BMP as single code units is part of UTF-16 just like encoding supplementary characters as two code units. The note could just say that the source code point sequence is mapped to its corresponding UTF-16 code unit sequence.

commentid: 1571
comment_count: 1
who: Allen Wirfs-Brock
bug_when: 2012-08-30 16:32:59 -0700

corrected in editor's draft

commentid: 1677
comment_count: 2
who: Allen Wirfs-Brock
bug_when: 2012-09-28 12:24:10 -0700

fixed in rev10, Sept. 27 2012 draft

commentid: 9426
comment_count: 3
who: Norbert
bug_when: 2014-07-20 20:48:06 -0700

Verified in rev 26 draft.

archives

#503 — Issues in 7.8.4 String Literals