sqlalchemy2-sample

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
args.py		args.py
connect.py		connect.py
create_data_model.sql		create_data_model.sql
drop_data_model.sql		drop_data_model.sql
model.py		model.py
pgadapter.py		pgadapter.py
requirements.txt		requirements.txt
run_sample.py		run_sample.py
sample.py		sample.py
test_add_singer.py		test_add_singer.py
test_create_model.py		test_create_model.py
test_create_random_singers_and_albums.py		test_create_random_singers_and_albums.py
test_create_venue_and_concert_in_transaction.py		test_create_venue_and_concert_in_transaction.py
test_delete_album.py		test_delete_album.py
test_get_album.py		test_get_album.py
test_get_album_with_stale_engine.py		test_get_album_with_stale_engine.py
test_get_singer.py		test_get_singer.py
test_get_track.py		test_get_track.py
test_metadata_reflect.py		test_metadata_reflect.py
test_print_albums_first_character_of_title_equal_to_first_or_last_name.py		test_print_albums_first_character_of_title_equal_to_first_or_last_name.py
test_print_albums_released_before_1980.py		test_print_albums_released_before_1980.py
test_print_concerts.py		test_print_concerts.py
test_print_singers_and_albums.py		test_print_singers_and_albums.py
test_print_singers_with_limit_and_offset.py		test_print_singers_with_limit_and_offset.py
test_print_venues_with_capacity_at_least_5000.py		test_print_venues_with_capacity_at_least_5000.py
test_update_singer.py		test_update_singer.py
util_random_names.py		util_random_names.py

README.md

PGAdapter and SQLAlchemy 2.x

PGAdapter supports SQLAlchemy 2.0 with the psycopg driver. This document shows how to use this sample application, and lists the limitations when working with SQLAlchemy 2.x with PGAdapter.

The sample.py file contains a sample application using SQLAlchemy 2.x with PGAdapter. Use this as a reference for features of SQLAlchemy 2.x that are supported with PGAdapter. This sample assumes that the reader is familiar with SQLAlchemy 2.x, and it is not intended as a tutorial for how to use SQLAlchemy in general.

See Limitations for a full list of known limitations when working with SQLAlchemy 2.x.

Running the Sample

You can run the sample directly on the Spanner emulator with the following command. It will automatically start both PGAdapter and the Spanner emulator. This requires Docker on the local machine to work:

python run_sample.py

You can also connect to a real Spanner instance instead of the emulator by running the sample like this. The database must exist. The sample will automatically create the tables that are needed for the sample:

python run_sample.py \
  --project my-project \
  --instance my-instance \
  --database my-database \
  --credentials /path/to/credentials.json

Start PGAdapter Manually

You can also start PGAdapter before you run the sample and connect to that PGAdapter instance. The following command shows how to start PGAdapter using the pre-built Docker image and then run the sample against that instance. See Running PGAdapter for more information on other options for how to run PGAdapter.

export GOOGLE_APPLICATION_CREDENTIALS=/path/to/credentials.json
docker pull gcr.io/cloud-spanner-pg-adapter/pgadapter
docker run \
  -d -p 5432:5432 \
  -v ${GOOGLE_APPLICATION_CREDENTIALS}:${GOOGLE_APPLICATION_CREDENTIALS}:ro \
  -e GOOGLE_APPLICATION_CREDENTIALS \
  -v /tmp:/tmp \
  gcr.io/cloud-spanner-pg-adapter/pgadapter \
  -p my-project -i my-instance \
  -x
python run_sample.py \
  --host localhost \
  --port 5432 \
  --database my-database

Creating the Sample Data Model

The sample data model is created automatically by the sample script.

The sample data model contains example tables that cover all supported data types the Cloud Spanner PostgreSQL dialect. It also includes an example for how interleaved tables can be used with SQLAlchemy. Interleaved tables is a Cloud Spanner extension of the standard PostgreSQL dialect.

The corresponding SQLAlchemy model is defined in model.py.

You can also create the sample data model manually using for example psql. Run the following command in this directory. Replace the host, port and database name with the actual host, port and database name for your PGAdapter and database setup.

psql -h localhost -p 5432 -d my-database -f create_data_model.sql

You can also drop an existing data model using the drop_data_model.sql script:

psql -h localhost -p 5432 -d my-database -f drop_data_model.sql

Data Types

Cloud Spanner supports the following data types in combination with SQLAlchemy 2.x.

PostgreSQL Type	SQLAlchemy type
boolean	Boolean
bigint / int8	Integer, BigInteger
varchar	String
text	String
float8 / double precision	Float
numeric	Numeric
timestamptz / timestamp with time zone	DateTime(timezone=True)
date	Date
bytea	LargeBinary
jsonb	JSONB
Arrays	Column()

Limitations

The following limitations are currently known:

Limitation	Workaround
Creating and Dropping Tables	Cloud Spanner does not support the full PostgreSQL DDL dialect. Automated creation of tables using `SQLAlchemy` is therefore not supported.
metadata.reflect()	Cloud Spanner does not support all PostgreSQL `pg_catalog` tables. Using `metadata.reflect()` to get the current objects in the database is therefore not supported.
DDL Transactions	Cloud Spanner does not support DDL statements in a transaction. Add `?options=-c spanner.ddl_transaction_mode=AutocommitExplicitTransaction` to your connection string to automatically convert DDL transactions to non-atomic DDL batches.
INSERT ... ON CONFLICT	`INSERT ... ON CONFLICT` is not supported.
SAVEPOINT	Rolling back to a `SAVEPOINT` can fail if the transaction contained at least one query that called a volatile function.
SELECT ... FOR UPDATE	`SELECT ... FOR UPDATE` is not supported.
Server side cursors	Server side cursors are currently not supported.
Transaction isolation level	Only SERIALIZABLE and AUTOCOMMIT are supported. `postgresql_readonly=True` is also supported. It is recommended to use either autocommit or read-only for workloads that only read data and/or that do not need to be atomic to get the best possible performance.
Stored procedures	Cloud Spanner does not support Stored Procedures.
User defined functions	Cloud Spanner does not support User Defined Functions.
Other drivers than psycopg 3.x	PGAdapter does not support using SQLAlchemy 2.x with any other drivers than `psycopg 3.x`.

Generated Primary Keys

Spanner supports the serial data type. Columns with this data type use a backing bit-reversed sequence to generate unique values that are safe to use as primary key values in Spanner. The values are not monotonically increasing.

You must set a default_sequence_kind for your database before you can create a column with data type serial. The create_data_model.sql file includes a statement to set the default.

The TicketSale model in this sample application uses a serial for auto-generated primary keys:

See model.py for the model definition.
See create_data_model.sql for the table definition.

See https://cloud.google.com/spanner/docs/primary-key-default-value#serial-auto-increment for more information on the serial data type in Spanner.

Example Mapping for Generated Primary Key using serial

Python model definition:

class Singer(Base):
  id = Column(Integer, primary_key=True)
  name = Column(String(100))

singer = Singer(name="Alice")
session.add(singer)
session.commit()

Database option and table definition:

alter database db set spanner.default_sequence_kind='bit_reversed_positive';
create table if not exists singers (
  id   serial primary key,
  name varchar not null
);

ON CONFLICT Clauses

INSERT ... ON CONFLICT ... are not supported by Cloud Spanner and should not be used. Trying to use https://docs.sqlalchemy.org/en/20/dialects/postgresql.html#sqlalchemy.dialects.postgresql.Insert.on_conflict_do_update or https://docs.sqlalchemy.org/en/20/dialects/postgresql.html#sqlalchemy.dialects.postgresql.Insert.on_conflict_do_nothing will fail.

SAVEPOINT - Nested transactions

Rolling back to a SAVEPOINT can fail if the transaction contained at least one query that called a volatile function or if the underlying data that has been accessed by the transaction has been modified by another transaction.

Locking - SELECT ... FOR UPDATE

Only SELECT ... FOR UPDATE without any additional options is supported. The NOWAIT and SKIP LOCKED options are not supported.

Performance Considerations

Read-only Transactions

SQLAlchemy will by default use read/write transactions for all database operations, including for workloads that only read data. This will cause Cloud Spanner to take locks for all data that is read during the transaction. It is recommended to use either autocommit or read-only transactions for workloads that are known to only execute read operations. Read-only transactions do not take any locks. You can create a separate database engine that can be used for read-only transactions from your default database engine by adding the postgresql_readonly=True execution option.

read_only_engine = engine.execution_options(postgresql_readonly=True)

Autocommit

Using isolation level AUTOCOMMIT will suppress the use of (read/write) transactions for each database operation in SQLAlchemy. Using autocommit is more efficient than read/write transactions for workloads that only read and/or that do not need the atomicity that is offered by transactions.

You can create a separate database engine that can be used for workloads that do not need transactions by adding the isolation_level="AUTOCOMMIT" execution option to your default database engine.

autocommit_engine = engine.execution_options(isolation_level="AUTOCOMMIT")

Stale reads

Read-only transactions and database engines using AUTOCOMMIT will by default use strong reads for queries. Cloud Spanner also supports stale reads.

A strong read is a read at a current timestamp and is guaranteed to see all data that has been committed up until the start of this read. Spanner defaults to using strong reads to serve read requests.
A stale read is read at a timestamp in the past. If your application is latency sensitive but tolerant of stale data, then stale reads can provide performance benefits.

You can create a database engine that will use stale reads in autocommit mode by adding the following to the connection string and execution options of the engine:

conn_string = "postgresql+psycopg://user:password@localhost:5432/my-database" \
              "?options=-c spanner.read_only_staleness='MAX_STALENESS 10s'"
engine = create_engine(conn_string).execution_options(isolation_level="AUTOCOMMIT")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

sqlalchemy2-sample

sqlalchemy2-sample

README.md

PGAdapter and SQLAlchemy 2.x

Running the Sample

Start PGAdapter Manually

Creating the Sample Data Model

Data Types

Limitations

Generated Primary Keys

Example Mapping for Generated Primary Key using serial

ON CONFLICT Clauses

SAVEPOINT - Nested transactions

Locking - SELECT ... FOR UPDATE

Performance Considerations

Read-only Transactions

Autocommit

Stale reads

Files

sqlalchemy2-sample

Directory actions

More options

Directory actions

More options

Latest commit

History

sqlalchemy2-sample

Folders and files

parent directory

README.md

PGAdapter and SQLAlchemy 2.x

Running the Sample

Start PGAdapter Manually

Creating the Sample Data Model

Data Types

Limitations

Generated Primary Keys

Example Mapping for Generated Primary Key using serial

ON CONFLICT Clauses

SAVEPOINT - Nested transactions

Locking - SELECT ... FOR UPDATE

Performance Considerations

Read-only Transactions

Autocommit

Stale reads