New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Metadata schema #23

Open

ianmcorvidae wants to merge 19 commits into cyverse-de:main from ianmcorvidae:metadata-schema

Member

ianmcorvidae commented Nov 25, 2021

This is not complete, but I was instructed to timebox it to the end of today. I think that these changes should all still be acceptable, but without finishing the remaining namespaces we won't be able to use the merged DB. One possible exception: I suspect that if anything I've changed uses transactions from other parts of the code (i.e. outside the persistence namespaces), those transactions will no longer apply. I was planning to do those after finishing the persistence namespace changes.

ianmcorvidae added 19 commits

October 20, 2021 15:49


          Very first integration of next.jdbc/honeysql, for a small function in…

9611ff4

… favorites


          Use next.jdbc features better

6b017c1


          move select-favorites-of-type to next.jdbc

a90bf99


          migrate insert-favorite

40e7f7a


          Migrate delete-favorite

5eb6de1


          Migrate delete-favorites-of-type

c3b0f17


          Remove korma stuff from metadata.persistence.favorites namespace

7eaa8b8


          Start migrating tags to use next.jdbc/honeysql

ac877f5


          migrate tags-base-query and things using it

ccc6a23


          Migrate more tags stuff

fcba766


          Migrate some insert/update stuff in tags

760ceac


          finish up tags

07a71e5


          partially complete updates to avu persistence

3608ba3


          migrate delete-target-avu

37534cb


          hopefully-final AVUs work

28fd95e


          Migrate comments

e156bc8


          Migrate ontologies-table stuff

971ea25


          Migrate ontologies namespace

bdeeea4


          rename imports as in other persistence namespaces

13ea1ab

ianmcorvidae commented

View reviewed changes

src/metadata/persistence/avu.clj

+                            (dissoc :select) ;; change to select distinct
+                            (h/select-distinct cols)
+                            (h/where [:in :target_id target-ids]
+                                     [:in :target_type (map jtypes/as-other target-types)]

Member Author

ianmcorvidae Nov 25, 2021

First common change: jtypes/as-other is how we make things with our enums work properly; we don't need our ->enum-val any more.

src/metadata/persistence/avu.clj

Comment on lines +12 to +17

+              (def avu-columns [:id :attribute :value :unit :target_id :target_type :created_by :modified_by :created_on :modified_on])
+              (defn- avus-base-query
+                [cols]
+                (-> (apply h/select cols)
+                    (h/from (t "avus"))))

Member Author

ianmcorvidae Nov 25, 2021

I've been doing a pattern like these two definitions in a bunch of places. Basically, have a plain def with the most common setup of columns, plus a whatever-base-query function that takes a list of columns and selects them from the appropriate table. I'm not sure if it's super useful, but it does cut down a little bit at least. May be worth generalizing, idk

src/metadata/persistence/avu.clj

+                                     [:in :target_type (map jtypes/as-other target-types)]
+                                     [:in :attribute attributes]
+                                     [:in :value values]))]
+                  (plan/select! ds cols (sql/format q))))

Member Author

ianmcorvidae Nov 25, 2021

This line means:

turn the query into SQL + parameters
fetch results from the DB defined by ds
on each row, run select-keys cols, and return a collection of the results. Or, if the cols slot is not a vector of keywords, it will be applied as a function (which also means, passing a single keyword will extract just that column)

src/metadata/persistence/avu.clj

-                (first (select :avus (where {:id id}))))
+                (let [q (-> (avus-base-query avu-columns)
+                            (h/where [:= :id id]))]
+                  (plan/select-one! ds avu-columns (sql/format q))))

Member Author

ianmcorvidae Nov 25, 2021

This is exactly like select! but only does it for the first result row, and doesn't return a collection as a result.

Or basically it's the same as running first on the output of the same call with select!. except probably more efficient I guess

src/metadata/persistence/avu.clj

Comment on lines +74 to +77

+                  (jsql/insert-multi! ds
+                                      (t "avus")
+                                      cols
+                                      (map fmt-avu avus))))

Member Author

ianmcorvidae Nov 25, 2021

The format for these bulk inserts is different than korma. For this, we pass a collection of columns, plus a collection-of-collections, where each entry should be a collection corresponding to the columns listed before.

i.e. (:id :attribute ...) and then [("4609ef48-...." "some-attribute" ...), ...]

src/metadata/persistence/comments.clj

Comment on lines +100 to +105

+                (let [insert-vals (jsql/insert! ds
+                                                (t "comments")
+                                                {:owner_id    owner
+                                                 :target_id   target-id
+                                                 :target_type (jtypes/as-other target-type)
+                                                 :value       comment})]

Member Author

ianmcorvidae Nov 25, 2021

Unlike the multi-insert, the single insert still takes a map of columns to values.

src/metadata/persistence/comments.clj

Comment on lines +137 to +141

+                (jsql/update! ds
+                              (t "comments")
+                              {:retracted false
+                               :retracted_by nil}
+                              {:id comment-id})

Member Author

ianmcorvidae Nov 25, 2021

Hopefully fairly self-explanatory how the update works. After DB and table is updates, then conditions.

src/metadata/persistence/favorites.clj

-                      (aggregate (count :*) :cnt)
-                      (where {:target_id target-id :owner_id user}))
-                  first :cnt pos?))
+                (let [q (-> (h/select [[:> :%count.* 0] :is_favorite])

Member Author

ianmcorvidae Nov 25, 2021

This is a bit of a change. I made it do SELECT count(*) > 0 AS is_favorite rather than doing the pos? in clojureland.

src/metadata/persistence/ontologies.clj

Comment on lines 73 to +79

               (defn search-classes-subselect
                 [ontology-version search-term]
-                (-> (search-classes-base ontology-version search-term)
-                    (subselect (fields :iri))))
+                (let [search-term (str "%" (format-query-wildcards search-term) "%")]
+                  (-> (h/select :iri)
+                      (h/from (t "ontology_classes"))
+                      (h/where [:= :ontology_version ontology-version]
+                               [:ilike :label search-term]))))

Member Author

ianmcorvidae Nov 25, 2021

We weren't using search-classes-base anywhere else, and I think it was an artifact of the select* vs. select vs subselect distinction in korma due to it handling both query generation and execution. With honeysql + next.jdbc they're already separate, so h/select handles all three.

Member Author

ianmcorvidae Nov 25, 2021

I also used ilike instead of lowercasing both sides, which is inconsistent with one other spot. I guess we should check if that affects indexing and maybe use the lowercasing if that's better, but dunno

src/metadata/persistence/permanent_id_requests.clj

@@ @@ -198,7 +198,7 @@ @@
               (defn update-permanent-id-request
                 "Records the Permanent ID for a given Request."
                 [request-id permanent-id]
-                (sql/update :permanent_id_requests
+                (ksql/update :permanent_id_requests

Member Author

ianmcorvidae Nov 25, 2021

These updates are the bits in progress. I've been changing things over to use the ksql name, since jsql I'm using for next.jdbc. Should go away ultimately anyway.

psarando reviewed

View reviewed changes

Member

psarando left a comment

So far so good 👍

src/metadata/persistence/favorites.clj

-                            :owner_id    user})))
+                (-> (h/select :target_id)
+                    (h/from (t "favorites"))
+                    (h/where [:in :target_type [:inline target-types]]

Member

psarando Dec 14, 2021

In the avu namespace [:in :target_type (map jtypes/as-other target-types)] was used.
Is it not needed here, or is this an alternative expression?

Member Author

ianmcorvidae Dec 14, 2021

I'm... actually not sure. I did this namespace first, so it could be that I actually have it wrong in the other namespace. I think they might work as alternates though -- jtypes/as-other will mark them correctly to be included as bound parameters, while :inline will inline them in the expression rather than using bind parameters at all. I think that since in raw SQL without bind parameters you can basically treat enums like strings, it might just work out the same.

I think this inline version is a bit cleaner, so when I get a chance to get back to this and test it out, I might just use that everywhere I can.

src/metadata/persistence/favorites.clj

-                  (where {:owner_id    user
-                          :target_type [in (map db/->enum-val target-types)]})))
+                (let [q (-> (h/delete-from (t "favorites"))
+                            (h/where [:in :target_type [:inline target-types]]

Member

psarando Dec 14, 2021

Same question about [:in :target_type (map jtypes/as-other target-types)] here.

src/metadata/persistence/ontologies.clj

Comment on lines +61 to +65

+                    (let [new-values (mapv #(vector (:ontology_version %) (:iri %) (:label %) (:description %)) class-values)]
+                      (jsql/insert-multi! ds
+                                          (t "ontology_classes")
+                                          [:ontology_version :iri :label :description]
+                                          new-values)))))

Member

psarando Dec 14, 2021

This LGTM as-is, but alternatively, if we wanted to define the set of update columns only once:

Suggested change

      
                  (let [new-values (mapv #(vector (:ontology_version %) (:iri %) (:label %) (:description %)) class-values)]
          
                    (jsql/insert-multi! ds
          
                                        (t "ontology_classes")
          
                                        [:ontology_version :iri :label :description]
          
                                        new-values)))))
          
                  (let [update-cols [:ontology_version :iri :label :description]
          
                        new-values (mapv #(mapv (partial get %) update-cols) class-values)]
          
                    (jsql/insert-multi! ds
          
                                        (t "ontology_classes")
          
                                        update-cols
          
                                        new-values)))))

I'm not sure if this way is more confusing, though 🤔

Member Author

ianmcorvidae Dec 14, 2021

Hm, I think I like your version a bit more. I was trying to avoid having the same lists multiple times, but I think I was also getting a bit hurried by the time I was doing ontologies and didn't think of doing it that particular way.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet