Skip to content

Commit

Permalink
browsertrix: store crawl workflow metadata
Browse files Browse the repository at this point in the history
  • Loading branch information
makew0rld committed Jul 22, 2024
1 parent 143c6e6 commit 139c419
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 0 deletions.
1 change: 1 addition & 0 deletions docs/attributes.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,7 @@ The majority of these attributes are set automatically upon ingestion, but all c
- `project_id`: the name for the project this asset was ingested under
- `project_path`: the path for the project within the sync folder
- `asset_origin_sig_key_name`: may exist if the ingestion process involved verifiying a known, named public key
- Browsertrix crawl info: `crawl_workflow_name`, `crawl_workflow_description`, `crawl_workflow_tags`

Encrypted files have the `encryption_type` attribute, currently always set to `secretstream`. See [encryption.md](./encryption.md) for more info.

Expand Down
3 changes: 3 additions & 0 deletions webhook/browsertrix.go
Original file line number Diff line number Diff line change
Expand Up @@ -260,6 +260,9 @@ func handleBrowsertrixEvent(w http.ResponseWriter, r *http.Request) {
metadataMap["asset_origin_type"] = []string{"wacz"}
metadataMap["project_id"] = projectId
metadataMap["file_name"] = e.Resources[0].Name
metadataMap["crawl_workflow_name"] = crawlInfo.Name
metadataMap["crawl_workflow_description"] = crawlInfo.Description
metadataMap["crawl_workflow_tags"] = crawlInfo.Tags

err = util.MoveFile(tempFilePath, filepath.Join(outputDirectory, cid))
if err != nil {
Expand Down

0 comments on commit 139c419

Please sign in to comment.