Using connection string schema when working with metadata. #1156

cstiborg · 2024-07-23T07:36:52Z

Adding possiblity for database backends to register the schema to use when working with metadata.

Currently this is hardcoded to public.

PostgreSQL and MySQL have been implemented.

… when working with metadata

cstiborg · 2024-07-23T08:19:09Z

I'm trying to figure out what is going on with the PostgreSQL check - I can reproduce the error in a clean SOCI master build - but not in my cstiborg master.

vadz

Sorry if I'm missing something, but I expected this PR to add the possibility to set the schema to use, is this something you plan to do later or did I misunderstand you?

include/soci/soci-backend.h

src/backends/postgresql/session.cpp

cstiborg · 2024-07-23T14:40:55Z

While MySQL and Sqlite seems to be no brainers, PostgreSQL or any of the other supported databases which in fact do support schemas requires some decisions to be made.

I did start out with an attempt of adding an extra parameter to the metadata methods - however, I got discouraged when I realized that:

It's not necessary for MySQL and SQLite
For consistency it would extent to the DDL code as well, despite it not having the same issue of public being hardcoded.

I then thought, maybe foolishly, that when you connect to the database using a search_path, PostgreSQL will (I know PostgreSQL the best of all the backends supported and I lack the infrastructure and experience to test any changes on other platforms) actually do what you want it to do in terms of DDL.

While I can see the purpose of being able to override a schema in the API, I don't believe such a solution should stand alone as everything else seems to be working out of the box, while the metadata currently always reads from public.

So, what I set out to do is mimicking the functionality that I see in the DDL - which of course is governed by the underlying library or RDBMS itself, as I believe this will lead to the least confusion in terms of consistency between the different parts of the SOCI API.

…etadata functions

…orks

cstiborg · 2024-08-11T15:10:34Z

I've added all requirements set up in the comments:

MySQL is now not looking in the public schema for table configuration, instead it looks in the schema for the database.
PostgreSQL will return table names in .<table_name> format.
PostgreSQL will parse the search path coming from the database and use it to look up tables if a table name using the .<table_name> format isn't used.
The PostgreSQL test is updated accordingly.

Apparently there is an effect on Oracle and SQLite3 so I will work on that.

…schema

Work done in soci

…latforms

cstiborg · 2024-08-19T11:19:13Z

It seems that only macOS tests and Windows tests are failing now (macOS due to lack of funds to run the test and windows due to postgresql missing in the test image)

I believe the rest is good.

vadz

Thanks for for the updates!

I've hopefully fixed the CI in #1161 (definitely for macOS, still waiting for AppVeyor), so the next push should run all CI jobs successfully.

But there are some changes needed, notably to make code more obviously safe and correct, i.e. avoid delete[] (by using std::string), PGClear() (by using postgresql_result RAII helper) and, I think, also avoid the linked list of schema table name objects: why do we bother with this instead of just using a std::vector<> of them?

include/private/soci-compiler.h

include/soci/column-info.h

include/soci/mysql/soci-mysql.h

include/soci/session.h

src/backends/postgresql/session.cpp

src/core/session.cpp

Get newest tests

cstiborg · 2024-08-25T14:41:08Z

... and, I think, also avoid the linked list of schema table name objects: why do we bother with this instead of just using a std::vector<> of them?

The point of not using std::vector, is that as it fills up it may move memory around on the heap. The reason for having the data allocated on the heap in the first place is that the prepare_temp_type created to fetch the column data and table data uses the memory addresses of the parameters of the prepared statement, thus these cannot change for the lifetime of the prepare_temp_type object.
I've changed it to a std::forward_list.

vadz

Thanks for the updates, changes look broadly good but could we please have some new tests showing how can this actually be used?

And updating the docs would be useful too.

TIA!

include/private/soci-compiler.h

include/soci/column-info.h

src/backends/postgresql/session.cpp

include/soci/session.h

src/core/session.cpp

tests/postgresql/test-postgresql.cpp

src/backends/postgresql/session.cpp

vadz · 2024-08-25T14:55:18Z

The point of not using std::vector, is that as it fills up it may move memory around on the heap.

Thanks, I've realized this too now, while rereading the code and using std::forward_list is fine, but it would be worth adding a comment saying that we use the list because we need the pointers/references to remain stable as this might not be obvious (as it wasn't do me).

Co-authored-by: VZ <vz-github@zeitlins.org>

cstiborg · 2024-08-25T15:45:04Z

I will get the docs updated as well as adding some extra tests.

cstiborg · 2024-08-26T17:51:05Z

I've added an extra test to PostgreSQL and I've ported both DDL tests from PostgreSQL to MySQL with minor changes.
I've also added a small amendment to the docs - as this PR isn't so much about changing functionality as attempting to do what the documentation promises.
This makes me want to add that I am only changing MySQL and PostgreSQL, which makes the current status look like:

DB2 - Uses the "old" default implementation which will look in the public schema - i.e. highly likely not doing anything constructive.
Firebird - Uses the "old" default implementation which will look in the public schema - i.e. highly likely not doing anything constructive.
MySQL - I've updated the implementation and added tests. Supports schemas.
ODBC - Uses the "old" default implementation which will look in the public schema - i.e. highly likely not doing anything constructive. Also, how would this ever work if you don't know which RDBMS you are working with?
Oracle - Has its own implementation, which I believe will work. It does not take schemas into consideration.
PostgreSQL - I've updated the implementation and added tests. Supports schemas.
Sqlite3 - has it's own implementation for collecting column info, sqlite3 doesn't use schemas.

For the missing RDBMSes, as I've mentioned earlier, I haven't got access to those anywhere, where I can test against them.

…ferent results between Linux MySQL and Windows MySQL)

…test (Due to differences in MySQL between Linux and Windows)

vadz

Thanks, looks good now, except for one simple refactoring that I'd like to do just to avoid duplicating the code dealing with the current user name.

To answer your previous comment: I think nobody cares about DB/2 (or, if somebody still does, I haven't encountered them yet) and Firebird and SQLite don't support schemas at all (they use multiple databases instead, but this is not the same thing), so I am not sure how to interpret "highly likely not doing anything constructive" — do you mean that this doesn't improve anything for, e.g., FB? If so, this is true, of course, but I don't see what else could be done for it, as it doesn't support schemas anyhow.

ODBC could be improved because we can know which database we're connected to, see odbc_session_backend::get_database_product() and it could indeed be nice to have schema support for SQL Server (which can only be used via this backend) and PostgreSQL (which sometimes happens to be used via ODBC rather than natively due to whatever reasons).

Please let me know if you'd like to add support for this to ODBC backend too (this would require some refactoring) or if we should merge this as is.

Thanks again!

src/backends/postgresql/session.cpp

Co-authored-by: VZ <vz-github@zeitlins.org>

cstiborg · 2024-09-01T12:04:30Z

"highly likely not doing anything constructive"

For Firebird this means it is the default code - as it has been since the original implementation. I know nothing about Firebird, except that it doesn't use schemas, as you just taught me. However, without schemas, and the default implementation looking in information_schema.tables and information_schema.columns I am almost certain that it will simply fail when using the soci metadata api. I haven't tested it, so I don't know for sure, but it doesn't add up for me. That's what I meant.

There is a specialisation for SQLite3 - still hasn't tested it so I'm just talking - which I assume will work as it collects table information from the sqlite_master table and some SQLite3 voodoo (pragma_table_info(<table_name>)) for the columns. Again, I don't know anything about Firebird, but possibly something like that could be done to make the metadata API work across the board. If there's no way to get the data, then I guess the error produced by FB is as good as any error.

Please let me know if you'd like to add support for this to ODBC backend too (this would require some refactoring) or if we should merge this as is.

Tricky question. A couple of things: I am working on SOCI because I use it for another project. That project currently supports MySQL and PostgreSQL. Which means I have a setup to test that. I have a requirement to support MSSQL as well, however, I'm not quite there yet. So, I would like to make ODBC work, particularly for MSSQL. But I don't have a test setup yet.

All of this makes me think, that I personally, would appreciate to have the current PR merged, and I will then work on ODBC (MSSQL and PostgreSQL) when I'm there on my other project.

Adding possiblity for database backends to register the schema to use…

83a46f8

… when working with metadata

cstiborg mentioned this pull request Jul 23, 2024

Metadata and schemas other than public #1155

Open

Following coding standard

62224ad

vadz reviewed Jul 23, 2024

View reviewed changes

include/soci/soci-backend.h Outdated Show resolved Hide resolved

src/backends/postgresql/session.cpp Outdated Show resolved Hide resolved

vadz marked this pull request as draft July 23, 2024 16:05

cstiborg added 7 commits August 2, 2024 15:44

MySQL uses the word "int" for integer types

e50e135

Adding functionality for MySQL and PostgreSQL to use schemas in the m…

bca17c1

…etadata functions

Updating tests for metadata

e0efa58

Adding a bit of debug info to figure out how the postgresql version w…

3d8a719

…orks

Always collect schema from DB for metadata

5f42362

Ignore schema for tests

09955e6

Handling memory leak

97e9cbe

cstiborg marked this pull request as ready for review August 11, 2024 12:14

Merge branch 'master' into master

dce77e3

cstiborg marked this pull request as draft August 11, 2024 15:09

cstiborg added 3 commits August 12, 2024 08:40

Handling both cases in prepare_column_descriptions: With and without …

82e317e

…schema

Merge branch 'master' of github.com:cstiborg/soci

f5e0cc7

Work done in soci

Remove unused header

5075c50

cstiborg marked this pull request as ready for review August 12, 2024 09:02

cstiborg added 5 commits August 12, 2024 11:02

Adding support for bigint and double types

9ff2fc0

Using predefined BOOST definition to handle compilation on multiple p…

bd9e7ce

…latforms

Handling memory leak

01e1ce5

Using correct type

cbd81a4

Implementing SOCI_FALLTHROUGH for multiple platforms

7af75a4

vadz requested changes Aug 19, 2024

View reviewed changes

Merge pull request #2 from SOCI/master

3d0265a

Get newest tests

Adding MSVC support for fallthrough

2df0434

vadz reviewed Aug 25, 2024

View reviewed changes

cstiborg and others added 6 commits August 25, 2024 17:01

Update src/backends/postgresql/session.cpp

bb1b16f

Co-authored-by: VZ <vz-github@zeitlins.org>

Update src/backends/postgresql/session.cpp

fca8d97

Co-authored-by: VZ <vz-github@zeitlins.org>

Update include/soci/session.h

04300dc

Co-authored-by: VZ <vz-github@zeitlins.org>

Update src/backends/postgresql/session.cpp

23a70d0

Co-authored-by: VZ <vz-github@zeitlins.org>

Update include/private/soci-compiler.h

90afe84

Co-authored-by: VZ <vz-github@zeitlins.org>

Merge branch 'master' into master

6606faa

Removing usage of heap operations, new and delete

ac5ee7b

cstiborg marked this pull request as draft August 26, 2024 14:40

cstiborg added 2 commits August 26, 2024 16:53

Adding paragraph on how to use schema with table

c60f4f2

Adding new tests for metadata

10cd4ab

cstiborg marked this pull request as ready for review August 26, 2024 16:57

cstiborg added 8 commits August 26, 2024 17:56

Removing nullable test, which is irrelevant (and apparently gives dif…

c568656

…ferent results between Linux MySQL and Windows MySQL)

Debug for test on Windows

6e3b5e6

Using case insensitive checks for table and column names in metadata …

aa2526a

…test (Due to differences in MySQL between Linux and Windows)

Redesigning test for MySQL to discover information_schema.tables first

afcd127

Printing more debug info

34b194a

Attemting more debug output

03705a5

Only table names output may be upper- or lowercase

f8fd4bd

Ignoring case on MySQL metadata column info query

e3f6dfd

cstiborg requested a review from vadz August 28, 2024 07:13

vadz reviewed Aug 29, 2024

View reviewed changes

src/backends/postgresql/session.cpp Outdated Show resolved Hide resolved

src/backends/postgresql/session.cpp Outdated Show resolved Hide resolved

cstiborg and others added 2 commits September 1, 2024 13:14

Update src/backends/postgresql/session.cpp

895f20e

Co-authored-by: VZ <vz-github@zeitlins.org>

Remove duplicate code

400c716

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using connection string schema when working with metadata. #1156

Using connection string schema when working with metadata. #1156

cstiborg commented Jul 23, 2024

cstiborg commented Jul 23, 2024

vadz left a comment

cstiborg commented Jul 23, 2024

cstiborg commented Aug 11, 2024

cstiborg commented Aug 19, 2024

vadz left a comment

cstiborg commented Aug 25, 2024

vadz left a comment

vadz commented Aug 25, 2024

cstiborg commented Aug 25, 2024

cstiborg commented Aug 26, 2024

vadz left a comment

cstiborg commented Sep 1, 2024

Using connection string schema when working with metadata. #1156

Are you sure you want to change the base?

Using connection string schema when working with metadata. #1156

Conversation

cstiborg commented Jul 23, 2024

cstiborg commented Jul 23, 2024

vadz left a comment

Choose a reason for hiding this comment

cstiborg commented Jul 23, 2024

cstiborg commented Aug 11, 2024

cstiborg commented Aug 19, 2024

vadz left a comment

Choose a reason for hiding this comment

cstiborg commented Aug 25, 2024

vadz left a comment

Choose a reason for hiding this comment

vadz commented Aug 25, 2024

cstiborg commented Aug 25, 2024

cstiborg commented Aug 26, 2024

vadz left a comment

Choose a reason for hiding this comment

cstiborg commented Sep 1, 2024