New function: gcLocatorCreate #63

camillegiuliano · 2025-08-27T16:58:08Z

This adds a new function called gcLocatorCreate.
This function builds the gcLocator raster layer. This was built according to the data we have for Saskatchewan, where we already had a productivity class and spatial unit raster layer, along with a growth curve lookup csv that assigns the correct gcid to each combination of productivity class, spatial unit id, and leading species for the province. Whenever I would try to run a new data source, I'd have to rebuild the gcLocator file, so I figured a quick function would be simplest moving forward.

Currently the function only works if all 3 raster layers are in the same CRS and all have the same extents. I can either add a check for this and put a stop in the function if they don't and explicitly let the user know that these don't match, or I can add some sort of if/else situation that will reproject so that all the layers match. I'm leaning towards just a check and letting the user mess with their own files rather than forcing a projection change.

With this function, you can build a gcLocator layer as long as you have a growth curve lookup table, and raster layers for productivity class, spatial unit IDs, and leading species. This should hopefully make using new data sources in traditional (non LandR) spadesCBM simpler.

I'm setting this as a draft for now, so I can settle on a decision about what to do if extents/CRS don't match, get the documentation in order, and also because the package version numbers will definitely be wildly different once some of the larger plotting PRs go through later, so I haven't change that on my end.

suz-estella · 2025-08-27T17:20:56Z

I think there's a chance that we don't need this function since CBM_dataPrep and CBM_vol2biomass_SK can accept multiple columns for curveID. e.g. (with most arguments omitted):

setupProject(

  ## omit the gcIndexLocator argument

  curveID = c("speciesId", "prodclass"),
  cohortLocators = list(
    speciesId  = leadSpeciesRaster,
    prodclass = siteProductivityRaster
))

This will give you a cohortDT table including the columns spatial_unit_id, speciesId, and prodclass. As long as userGcMeta and userGcM3 also have the speciesId and prodclass columns, CBM_vol2biomass_SK will generate the gcMeta and growth_increments tables to have a unique gcids column for every spatial_unit_id, speciesId, and prodclass combination defined in the userGcSPU input object (created by CBM_dataPrep). This new gcids column is also added to cohortDT.

More info here: PredictiveEcology/CBM_vol2biomass_SK#27

camillegiuliano · 2025-08-27T17:55:40Z

That's a good point it could work like this too, and I could get rid of this function entirely. Unless we would want to have a tangible gcLocator raster for whatever reason (in which case we can probably go about it a different way too). I'll look into doing a run this way with the SCANFI files.

Currently our userGcMeta and userGcM3 tables don't have prodClass (in fact, I don't think productivity class is present anywhere in current spadesCBM runs at the moment). I could update them to have that column though. I've been wanting to update those SK tables to include all the SK gcid options for a while, so that we don't run into issues of missing growth curves whenever we use a new data source, or hit that error Dominique hit last week where the two files didn't match. I definitely could make all those updates, then rely on the raw productivity class and leading species files when running any data source in SK and they all should run without too much effort, we could also just get rid of the gcIndex file we currently use for CASFRI at that point, since that file was built in the same way as the function here anyway.

suz-estella · 2025-08-27T18:32:05Z

Definitely could still be a nice function to have in our repertoire regardless!

One thing that is nice about using the multiple columns from "raw" sources is it leads to more reproducible code - no need to ask "how was this gcIndexLocator raster made"? It also makes it easy if we want to swap just one source - e.g. speciesID - to see how it changes things.

camillegiuliano · 2025-09-02T18:20:54Z

So.... weird update here, I ran the SCANFI data with the edited curveID rather than the gcIndex raster I'd built, and it runs with no real issues, EXCEPT results are different between the two somehow, investigating what is happening here.
I did find out the SCANFI age raster was using decimals for ages at some point, so it's possible I'm looking at old results where I was using the decimal version.

cboisvenue · 2025-09-03T17:22:49Z

Good work here.
Here is my input: I think we absolutely need to use curveID as what defines what curve get used where will be different for each study area. As we get better remote sensing information, these columns in curveID may change to things I can't even think of yet.
Note: the changes in results are expected if we change what curve is used where. I would be checking that CBM_vol2biomass is running correctly, with the corrections to curves making sense. I am happy to help with this if needed.

suz-estella · 2025-09-03T17:34:10Z

Looking into this a bit: I put a temporary stop here to ensure that curveID is just 1 column: https://github.com/PredictiveEcology/CBM_vol2biomass_SK/blob/23d767afc9c221bc9ef582d0f79b1060b173e882/CBM_vol2biomass_SK.R#L151

I threw this in because there's quite a bit of following code in the Init event that treats curveID like it is a length 1 vector. However, it would likely be easy to take a bit of time to update the event to allow for more than 1 column. I think it would be a worthwhile thing to do.

camillegiuliano added 4 commits August 27, 2025 09:02

add param descriptions to cumPoolsCreate

bd026b2

new function: gcLocatorCreate

ca0d03d

gcLocatorCreate: added description

2c590a0

doc update

80a43c7

camillegiuliano requested review from cboisvenue and suz-estella August 27, 2025 16:58

gcLocatorCreate fix

f412e1a

camillegiuliano added 2 commits August 27, 2025 15:15

gcLocatorCreate: adds a check if rasters share the same CRS

a99936a

gcLocatorCreate: doc update

17d86ef

suz-estella mentioned this pull request Sep 3, 2025

curveID can now represent multiple columns in userGcMeta PredictiveEcology/CBM_vol2biomass_SK#28

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New function: gcLocatorCreate #63

New function: gcLocatorCreate #63

camillegiuliano commented Aug 27, 2025

Uh oh!

suz-estella commented Aug 27, 2025

Uh oh!

camillegiuliano commented Aug 27, 2025

Uh oh!

suz-estella commented Aug 27, 2025

Uh oh!

camillegiuliano commented Sep 2, 2025

Uh oh!

cboisvenue commented Sep 3, 2025

Uh oh!

suz-estella commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

New function: gcLocatorCreate #63

Are you sure you want to change the base?

New function: gcLocatorCreate #63

Conversation

camillegiuliano commented Aug 27, 2025

Uh oh!

suz-estella commented Aug 27, 2025

Uh oh!

camillegiuliano commented Aug 27, 2025

Uh oh!

suz-estella commented Aug 27, 2025

Uh oh!

camillegiuliano commented Sep 2, 2025

Uh oh!

cboisvenue commented Sep 3, 2025

Uh oh!

suz-estella commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants