Allow reserved identifiers as Attribute names?

#21 · April 25, 2025, 3:36 am

Quote from Darren Duncan on April 25, 2025, 3:36 am

Quote from dandl on April 25, 2025, 1:44 am

So the aim is (or might be) to allow the database designer total freedom in the choice of table names, column names, etc. The designer can choose select * from payroll where id like '%"%' as the name of the table produced by that query. So then the language needs a way to denote arbitrary strings of characters as identifiers, just to keep the parser happy.

Trivial. The result is a single token, marked out by start and end sequences. The only minor challenge is devising an end sequence in a way that it cannot appear inside the token. Here's one way to do it. If you don't like this one, I've got plenty more.

"John O'Dwyer"

@"John O'Dwyer"

@/"John O'Dwyer"/

@#/"John O'Dwyer/"#

@\v~!@#$%^&*()_+John O'Dwyer-={}[]:";',./<>?\v

I have 2 responses to this:

1. As I said previously, just reuse the existing delimiters and escape sequences we already have for regular string literals; there is no need to invent more like in your examples, or have any of the added complexity. Most application programming languages with quoted identifiers do this very thing already, I believe, reuse their String type or whatever its called for identifiers.

2. That said, regarding your concern about the delimiter not appearing between the delimiters, I happened to make a design decision for my Muldis stuff to have all escape sequences not include the character they are a sequence for, and also I have an escape sequence for quotation marks, so my string literals never include the literal delimiter, and it is in fact illegal. So for example, the escape sequence for the quotation mark and backslash are respectively \q and \k respectively (I avoided using the letters with established meanings for these things, eg \n means what it normally means). I made this choice, rather than say using \" so that the parsing is much simpler. Also I only have a single quoted character string literal that doubles for use with identifiers.

Quote from dandl on April 25, 2025, 1:44 am

So the aim is (or might be) to allow the database designer total freedom in the choice of table names, column names, etc. The designer can choose select * from payroll where id like '%"%' as the name of the table produced by that query. So then the language needs a way to denote arbitrary strings of characters as identifiers, just to keep the parser happy.

Trivial. The result is a single token, marked out by start and end sequences. The only minor challenge is devising an end sequence in a way that it cannot appear inside the token. Here's one way to do it. If you don't like this one, I've got plenty more.

"John O'Dwyer"

@"John O'Dwyer"

@/"John O'Dwyer"/

@#/"John O'Dwyer/"#

@\v~!@#$%^&*()_+John O'Dwyer-={}[]:";',./<>?\v

I have 2 responses to this:

1. As I said previously, just reuse the existing delimiters and escape sequences we already have for regular string literals; there is no need to invent more like in your examples, or have any of the added complexity. Most application programming languages with quoted identifiers do this very thing already, I believe, reuse their String type or whatever its called for identifiers.

2. That said, regarding your concern about the delimiter not appearing between the delimiters, I happened to make a design decision for my Muldis stuff to have all escape sequences not include the character they are a sequence for, and also I have an escape sequence for quotation marks, so my string literals never include the literal delimiter, and it is in fact illegal. So for example, the escape sequence for the quotation mark and backslash are respectively \q and \k respectively (I avoided using the letters with established meanings for these things, eg \n means what it normally means). I made this choice, rather than say using \" so that the parsing is much simpler. Also I only have a single quoted character string literal that doubles for use with identifiers.

#22 · April 25, 2025, 3:47 am

Quote from Erwin on April 24, 2025, 8:37 pm

Quote from Darren Duncan on April 23, 2025, 8:57 pm

... We already have fully solved the problem ...

Good for you. ...

To your opening remark, "good for you", let me be clear that by "we" I mean the industry at large; I'm not talking about myself as an individual or any team I might be part of.

The approach I mention has lots of precedent in many popular programming languages, where identifiers are the same character string data type, or easily convertible with such, as character strings for regular user data, and programmers write both with the same syntax.

So not "good for you".

TTM Forum

The Forum for Discussion about The Third Manifesto and Related Matters

Allow reserved identifiers as Attribute names?