hash un-numeric input when used as PRNG seed, fixes #800 #801

brontolosone · 2024-10-14T18:53:21Z

Closes #800

As discussed on Slack, but there may be more to discuss!

What has been done to verify that this works as intended?

Ran the tests, added some tests too. Ideally I'd put this JR version in an ODKCollect to witness some choice randomization change when altering the seed value in this test form, but I didn't manage that today ;-)

Why is this the best possible solution? Were any other approaches considered?

The nice thing is that it doesn't change existing randomizations that were working well (eg, strings interpretable as numerics). It only changes randomizations for which the seed wasn't working (eg, strings not interpretable as numerics).

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

In very specific use cases this can cause an unwelcome change.

For instance: In some survey the randomization of the order of choice fields is part of the survey. Also, it wasn't actually working due to issue #800, but anyway. Now if ODKCollect is updated with this new OpenRosa version halfway through the survey, some users will have seen randomization X for seed Z while others will have seen randomization Y for that same seed Z. If the research draws conclusions based on either X or Y, then the researchers need to know which version of ODKCollect was used so they know (or can deduce) which choice randomization went with seed Z.

Do we need any specific form for testing your changes? If so, please attach one.

as mentioned above, running ODKCollect on this form and witnessing the choice list reshuffle based on string input variations in the seed field, would be extra convincing.

Does this change require updates to documentation? If so, please file an issue here and include the link below.

Maybe. We could say something like:

"As of ODKCollect version X, the choice randomization based on using an RNG seed from some other field actually works for more inputs (notably, any text). If you have ongoing research that depends on this type of randomization, and where the analysis is dependent on the specific choice list ordering that used to go with certain seeds, you may want to test with this new version of ODKCollect to see if your choice list randomization will change, and decide to not upgrade ODKCollect while this research is ongoing".

hash un-numeric input when used as PRNG seed, fixes getodk#800

db109ec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hash un-numeric input when used as PRNG seed, fixes #800 #801

hash un-numeric input when used as PRNG seed, fixes #800 #801

brontolosone commented Oct 14, 2024

hash un-numeric input when used as PRNG seed, fixes #800 #801

Are you sure you want to change the base?

hash un-numeric input when used as PRNG seed, fixes #800 #801

Conversation

brontolosone commented Oct 14, 2024

What has been done to verify that this works as intended?

Why is this the best possible solution? Were any other approaches considered?

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

Do we need any specific form for testing your changes? If so, please attach one.

Does this change require updates to documentation? If so, please file an issue here and include the link below.