IBM DB2 support #25

ukupat · 2017-06-17T17:58:38Z

I used ftp://ftp.software.ibm.com/ps/products/db2/info/vr105/pdf/en_US/DB2SQLRefVol1-db2s1e1051.pdf for learning and setting up DB2 configuration.

Unfortunately I didn't find any other solution for this...

ukupat · 2017-06-17T17:59:41Z

src/core/Tokenizer.js

@@ -57,11 +63,11 @@ export default class Tokenizer {
    // 5. national character quoted string using N'' or N\' to escape
    createStringPattern(stringTypes) {
        const patterns = {
+            "``": "((`[^`]*($|`))+)",


Reordered them to match documentation order.

ukupat · 2017-06-17T18:00:09Z

src/languages/Db2Formatter.js

+];
+
+const reservedToplevelWords = [
+    "ADD", "AFTER", "ALTER COLUMN", "ALTER TABLE", "DELETE FROM", "EXCEPT", "FETCH FIRST", "FROM", "GROUP BY", "GO", "HAVING",


Only thing that was added is "FETCH FIRST"

So to this group has "FETCH FIRST" compared to StandardSqlFormatter.

Might be worth to document it. Another thing: would be nice if the lines aligned with StandardSqlFormatter, so I could see differences when I just run diff for these files.

Of course a best diff would be one word per line. Perhaps it's actually worth doing that, although it creates pretty long files, it should be easier to maintain. Like currently I see that there are lots of differences in the reservedWords list, but it's really hard to tell what are the exact differences.

I tried making toplevel and newline words easier to compare because these are the most important ones.

I think one word per line is overkill. If you really want to compare differenct dialects config then you can use online tools or write a quick script with Lodash (that's what I did).

coveralls · 2017-06-17T18:01:48Z

Coverage remained the same at 100.0% when pulling a727543 on db2-support into 69a1246 on master.

ukupat · 2017-06-17T18:04:08Z

src/core/Tokenizer.js

     */
    constructor(cfg) {
-        this.WORD_REGEX = /^(\w+)/;
+        this.WORD_REGEX = /^([\w|#|@]+)/;


I tried playing around with word and operators tokenizing order (match operators before words) but that was no-go. Currently I went with the easiest solution that shouldn't break anything...

We should also make this WORD_REGEX configurable. We could have a config option like specialWordChars: ["@", "#"].

Other SQL dialects don't allow # and @ inside identifiers, which currently results in a scenario where formatting the following query with StandardSqlFormatter:

SELECT a#comment, here

results in:

SELECT a#comment, here

while correct formatting would be:

SELECT a #comment, here

nene · 2017-06-30T09:51:15Z

test/Db2FormatterTest.js

+            "  -- This is a comment\n" +
+            "  MyTable;\n"
+        );
+    });


We should also test the formatting of @ and # inside identifiers. This test partially covers the latter, but it would be better to have a separate test with clear description, explaining the @ and # are part of identifiers (e.g. not treated as operators).

…are part of identifiers

coveralls · 2017-07-04T11:41:26Z

Coverage remained the same at 100.0% when pulling 1d3d93e on db2-support into 69a1246 on master.

coveralls · 2017-07-04T11:45:11Z

Coverage remained the same at 100.0% when pulling b9b58bb on db2-support into 69a1246 on master.

nene · 2017-07-06T07:45:08Z

src/core/Tokenizer.js

@@ -49,6 +50,10 @@ export default class Tokenizer {
        return new RegExp(`^(${reservedWordsPattern})\\b`, "i");
    }

+    createWordRegex(specialChars = []) {
+        return new RegExp(`^([\\w|${specialChars.join("|")}]+)`);


nene · 2017-07-06T07:47:43Z

test/Db2FormatterTest.js

@@ -38,7 +38,8 @@ describe("Db2Formatter", function() {

    it("recognizes @ and # as identifiers", function() {


It would be more correct to say: "as part of identifiers"

coveralls · 2017-07-06T08:37:33Z

Coverage remained the same at 100.0% when pulling a0d16a4 on db2-support into 69a1246 on master.

Update CLI for 5.0.0 beta

This got added in the original PR for DB2 support: #25 Don't know how Uku interpreted the DB2 manual, but I for one don't see these characters being allowed in identifiers. Unfortunately I didn't check the manual myself when I originally accepted this PR.

Uku Pattak added 6 commits June 16, 2017 14:58

Make line comments configurable

d71f64a

Setup DB2 reserved words and formatter

dc8f3b2

Add missing semicolon

2de4607

Make col#1 and col@miami words

a37bfc6

Unfortunately I didn't find any other solution for this...

Update README and demo

92fd10e

I wonder why Atom is removing this semicolon

a727543

ukupat requested a review from nene June 17, 2017 17:58

ukupat self-assigned this Jun 17, 2017

ukupat added the feature label Jun 17, 2017

ukupat commented Jun 17, 2017

View reviewed changes

nene reviewed Jun 30, 2017

View reviewed changes

Uku Pattak added 3 commits July 4, 2017 13:44

Write a separate test with clear description, explaining the @ and # …

15eb73f

…are part of identifiers

Use @ and # chars as identifiers only in DB2 dialect formatter

c4c1da2

Remove useless test cfg

1d3d93e

Try to make newline and toplevel words list easier to compare

b9b58bb

nene reviewed Jul 6, 2017

View reviewed changes

Uku Pattak added 2 commits July 6, 2017 11:32

Improve DB2 test description

d64838d

Remove | from word regex

a0d16a4

ukupat merged commit 08af1f6 into master Jul 7, 2017

ukupat deleted the db2-support branch July 7, 2017 12:15

nene pushed a commit that referenced this pull request Apr 27, 2022

Merge pull request #25 from inferrinizzard/dev/update-cli

4b03803

Update CLI for 5.0.0 beta

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

IBM DB2 support #25

IBM DB2 support #25

Uh oh!

ukupat commented Jun 17, 2017

Uh oh!

ukupat Jun 17, 2017

Uh oh!

ukupat Jun 17, 2017

Uh oh!

nene Jun 30, 2017

Uh oh!

ukupat Jul 4, 2017

Uh oh!

coveralls commented Jun 17, 2017

Uh oh!

ukupat Jun 17, 2017

Uh oh!

nene Jun 30, 2017

Uh oh!

nene Jun 30, 2017

Uh oh!

coveralls commented Jul 4, 2017

Uh oh!

coveralls commented Jul 4, 2017

Uh oh!

nene Jul 6, 2017

Uh oh!

nene Jul 6, 2017

Uh oh!

coveralls commented Jul 6, 2017

Uh oh!

Uh oh!

		@@ -38,7 +38,8 @@ describe("Db2Formatter", function() {

		it("recognizes @ and # as identifiers", function() {

IBM DB2 support #25

IBM DB2 support #25

Uh oh!

Conversation

ukupat commented Jun 17, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coveralls commented Jun 17, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coveralls commented Jul 4, 2017

Uh oh!

coveralls commented Jul 4, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coveralls commented Jul 6, 2017

Uh oh!

Uh oh!