POC table joining #117

NathanQingyangXu · 2025-08-11T19:07:30Z

A little bit simplified than the original POC but given MongoDB's lookup and unwind stages are not identifical to SQL joining, there are still quite some subtle tech challenges.

Just realized this PR only covers basic @ManyToOne (maybe also @OneToOne, but definitively not @OneToMany(mappedBy=...), which requires tons of code refactoring on our MQL translator; I created a broken simple integration testing class so my colleagues who love challenge could try. It is hard, I have to prewarn here). Table joining is the heart and soul of SQL and Hibernate ORM, so we have to build our features little by little.

Take the following enitties with natural hiararchical relationshiop as example:

@Entity
@Table(name = "countries")
class Country { ... }

@Entity
@Table(name = "provinces")
class Province {
   ...
    @ManyToOne
    Country country
}

when we load a province, internally MQL translator is supposed to translate the table joining from provinces to countries; MQL's lookup and unwind will help as below:

during lookup stage, we need to create an array field containing the joining result; a natural way is to use the table alias generated by Hibernate (c1_0 for countries and p1_0 for provinces), so after the joining, provinces doc will contain a new array field named c1_0; given this is a toOne association, the new array field contains only one entry
during the next unwind stage, the array field will be transformed to a non-array field, ending up with SQL's cartisan product analog

but there is big difference between the above transformation and SQL's table joinining. SQL's table joining has no embedding relatinoshiop and each table involved in the join will have its own global namespace (or its table alias created automatically by Hibernate).

Let us consider the project stage which requires irrevocable changes to our existing code for sure. In the SQL AST model, the projection list will be as below:

SELECT p1_0._id, c1_0._id, c1_0.name, p1_0.name
from province as p1_0 left join countries as c1_0 
on p1_0.country__id = c1_0._id

So in our MQL doc, c1 is embedded as a new field of provinces collection, this PR introudces some minor code loigc to translate the above SQL projection list as below:

{
  "aggregate": "provinces",
  "pipeline": [
    {
      "$lookup": {
        "from": "countries",
        "localField": "country_code",
        "foreignField": "_id",
        "as": "c1_0"
      }
    },
    {
      "$unwind": {
        "path": "$c1_0"
      }
    },
    {
      "$match": {
        ... ...
      }
    },
    {
      "$project": {
        "f0": "$_id",
        "f1": "$c1_0._id",
        "f2": "$c1_0.name",
        "f3": "$name",
        "_id": false
      }
    }
  ]
}

Now new project fields need to be accommodated other than the entity fields per se, so the above projection relies on the projection set feature; the new assigned field names doesn't matter for Hibernate v6 only requires the order is aligned correctly (ResultSet JDBC methods accepting field names are not used by Hibernate ORM at all!).

Another complexity is the lookup could be recursive (e.g. cities -> provinces -> countries), but the final project SQL AST uses flat global table namespaces. We need to map the deeply nested lookup new field (e.g. p1_0.c1_0 is the new country nested doc in cities collection), so some transformation logic is required. That is why a new columnQualifierFullPaths map field was introduced in AbstractMqlTranslator, so it could translate c1_0.name SQL projection entry to p1_0.c1_0.name in the cities collection doc.

After the table joining emulation is sorted out, the following typical association relationships integration testing cases all passed:

@OneToOne
@ManyToOne
@OneToMany
@ManyToMany

… IT tests

NathanQingyangXu added 2 commits August 11, 2025 15:11

poc implementation of table joining

68f070c

fix existing testing case (mainly due to the change in project stage)

1401eaf

NathanQingyangXu changed the base branch from main to HIBERNATE-48 August 11, 2025 19:11

NathanQingyangXu force-pushed the poc-table-joining branch from fb554a9 to 1401eaf Compare August 11, 2025 19:34

NathanQingyangXu added 3 commits August 11, 2025 16:04

add broken testing case for @OneToMany

e310f08

add @OnetoOne mapping testing class

b9afa37

implement @onetomany translation

bb25e0b

vbabanin added the POC label Aug 13, 2025

NathanQingyangXu force-pushed the poc-table-joining branch from 6bf3d15 to 358a260 Compare August 14, 2025 02:42

add @manytomany integration testing case

7ed4f24

NathanQingyangXu force-pushed the poc-table-joining branch from 358a260 to 7ed4f24 Compare August 14, 2025 02:45

NathanQingyangXu added 2 commits August 13, 2025 23:00

avoid lookup as field name conflicting with existing entity column name

ea13bcb

enrich logic to cover more corner cases; add laziness verification to…

4009d86

… IT tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

POC table joining #117

POC table joining #117

Uh oh!

NathanQingyangXu commented Aug 11, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

POC table joining #117

Are you sure you want to change the base?

POC table joining #117

Uh oh!

Conversation

NathanQingyangXu commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NathanQingyangXu commented Aug 11, 2025 •

edited

Loading