Question loading refactor #544

jsharkey13 · 2023-06-29T15:22:13Z

For historic reasons (MongoDB! 😡), when loading a [question page/topic summary/gameboard] we would load every single question attempt a user had ever made. We did this on question pages because we need both the attempts at the question parts on the question page, but also the attempts at any related questions. The others just inherited the approach.

For registered users we can optimise this. Instead we now load the question attempts for the specific page in full; i.e. the complete QuestionValidationResponses. We need these in full for the bestAttempt attribute on the question page.
For related questions and gameboard items, we can instead load LightweightQuestionValidationResponses since we don't need to deserialise all of the JSON to say whether they are complete or not. We can walk the content and only load the questions for the relevant question page IDs.
Anonymous users have less question attempts and there are no equivalent methods for loading a subset of attempts (they still live in one JSON blob just like MongoDB), so we just load everything for them.

There's my new favourite Java type: Map<String, ? extends Map<String, ? extends List<? extends LightweightQuestionValidationResponse>>> all over the place, instead of Map<String, Map<String, List<QuestionValidationResponse>>> or the lightweight equivalent. I had used this before in some places, but now it is everywhere. This allows us to say "we only need Lightweight validation responses" but accept full validation responses in some cases (like anonymous users for whom we only ever have full responses). It's a pain to read, but very powerful.

The main place left, that this improved approach won't work for, but that still faces the main issue is the generateTemporaryGameboard method; i.e. making a new random gameboard. We try to load questions you have not completed; but how to we do that without loading the questions in advance?

These methods alter the passed in object. We did not use the returned value, but it suggested that the returned object may be different. Remove it so that it is clear this is in-place modification.

For historic reasons (MongoDB!), when loading a question page we would load every single question attempt a user had ever made. We did this because we need both the attempts at the question parts on the question page, but also the attempts at any related questions. For registered users we can optimise this. Instead we now load the question attempts for the specific page in full; i.e. the complete QuestionValidationResponses. We need these for the bestAttempt attribute on the question page. For related questions, we can instead load LightweightQuestionValidationResponses since we don't need to deserialise all of the JSON to say whether they are complete or not. We can walk the content and only load the questions for the relevant question page IDs. Anonymous users have less question attempts and there are no equivalent methods for loading a subset of attempts (they still live in one JSON blob just like MongoDB), so we just load everything for them. The efficiency gains for highly active users are immense; for a user with thousands of question attempts, the page load time speedup is 10x. Most of the gain is from no longer deserialising irrelevant attempts, converting them back to strings for the ETag calculation.

Again, we don't need to load every question a user has ever made when loading a topic summary page.

This likely still isn't right, but it is better than before.

Again, we don't need to load every question attempt for registered users, only those relevant to the board.

codecov · 2023-06-29T15:27:13Z

Codecov Report

Patch coverage has no change and project coverage change: -0.05 ⚠️

Comparison is base (994eb01) 26.48% compared to head (131a34c) 26.43%.

❗ Current head 131a34c differs from pull request most recent head a069383. Consider uploading reports for the commit a069383 to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #544      +/-   ##
==========================================
- Coverage   26.48%   26.43%   -0.05%     
==========================================
  Files         504      504              
  Lines       22896    22935      +39     
  Branches     2834     2839       +5     
==========================================
  Hits         6064     6064              
- Misses      16281    16320      +39     
  Partials      551      551

Impacted Files	Coverage Δ
...a/uk/ac/cam/cl/dtg/isaac/api/GameboardsFacade.java	`9.79% <0.00%> (-0.07%)`	⬇️
...n/java/uk/ac/cam/cl/dtg/isaac/api/PagesFacade.java	`19.35% <0.00%> (-1.47%)`	⬇️
.../ac/cam/cl/dtg/isaac/api/managers/GameManager.java	`11.72% <ø> (ø)`
...k/ac/cam/cl/dtg/isaac/quiz/PgQuestionAttempts.java	`8.19% <0.00%> (-0.43%)`	⬇️
...cam/cl/dtg/segue/api/managers/QuestionManager.java	`4.73% <0.00%> (-0.03%)`	⬇️

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

jsharkey13 · 2023-06-29T15:54:19Z

~~Do not merge; there seem to be some caching bugs!~~
These issues have been addressed in another PR; this one is good to go.

src/main/java/uk/ac/cam/cl/dtg/segue/api/managers/QuestionManager.java

mlt47 · 2023-07-10T16:41:55Z

src/main/java/uk/ac/cam/cl/dtg/isaac/api/PagesFacade.java

@@ -505,6 +521,7 @@ public final Response getTopicSummaryPage(@Context final Request request,

        // Calculate the ETag on current live version of the content
        // NOTE: Assumes that the latest version of the content is being used.
+        // Should this include the question attempts?


I would have thought so...
I think the reason this isn't a problem is that topic summary pages now don't have any related questions. They might have at some point although I think we moved to related gameboards quite quickly and then dropped them too.

src/main/java/uk/ac/cam/cl/dtg/segue/api/managers/QuestionManager.java

src/main/java/uk/ac/cam/cl/dtg/isaac/api/PagesFacade.java

The names now make the return types much clearer.

jsharkey13 added 5 commits June 28, 2023 16:33

Remove return type of in-place augmentation methods

ec8f7d1

These methods alter the passed in object. We did not use the returned value, but it suggested that the returned object may be different. Remove it so that it is clear this is in-place modification.

Only load relevant questions for topic page views

88fc457

Again, we don't need to load every question a user has ever made when loading a topic summary page.

Adjust topic summary caching behaviour

4b06420

This likely still isn't right, but it is better than before.

Avoid loading unnecessary questions for gameboards

131a34c

Again, we don't need to load every question attempt for registered users, only those relevant to the board.

mlt47 reviewed Jul 11, 2023

View reviewed changes

Rename lightweight question manager methods

a069383

The names now make the return types much clearer.

mlt47 merged commit 1d461b9 into master Jul 11, 2023
3 checks passed

mlt47 deleted the feature/question-loading-refactor branch July 11, 2023 11:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question loading refactor #544

Question loading refactor #544

jsharkey13 commented Jun 29, 2023

codecov bot commented Jun 29, 2023 •

edited

Loading

jsharkey13 commented Jun 29, 2023 •

edited

Loading

mlt47 Jul 10, 2023

Question loading refactor #544

Question loading refactor #544

Conversation

jsharkey13 commented Jun 29, 2023

codecov bot commented Jun 29, 2023 • edited Loading

Codecov Report

jsharkey13 commented Jun 29, 2023 • edited Loading

mlt47 Jul 10, 2023

Choose a reason for hiding this comment

codecov bot commented Jun 29, 2023 •

edited

Loading

jsharkey13 commented Jun 29, 2023 •

edited

Loading