From 4ab217b268375014c554de2a8e451c91361d5a23 Mon Sep 17 00:00:00 2001 From: Olga Bulat Date: Wed, 13 Mar 2024 20:10:00 +0300 Subject: [PATCH] Update the Additional search views IP (#3868) * Update the Additional search views IP Signed-off-by: Olga Bulat * Update documentation/projects/proposals/additional_search_views/20230719-implementation_plan_additional_search_views.md Co-authored-by: zack <6351754+zackkrida@users.noreply.github.com> * Add changes from code review Signed-off-by: Olga Bulat * Update documentation on parameter validation Signed-off-by: Olga Bulat * Update documentation/projects/proposals/additional_search_views/20230719-implementation_plan_additional_search_views.md Co-authored-by: Krystle Salazar * Add changes from code review Signed-off-by: Olga Bulat --------- Signed-off-by: Olga Bulat Co-authored-by: zack <6351754+zackkrida@users.noreply.github.com> Co-authored-by: Krystle Salazar --- ...roject_proposal_additional_search_views.md | 2 +- ...ementation_plan_additional_search_views.md | 446 +++++++++-------- ...ementation_plan_additional_search_views.md | 450 ++++++++++++++++++ 3 files changed, 710 insertions(+), 188 deletions(-) create mode 100644 documentation/projects/proposals/additional_search_views/SUPERSEDED-20230719-implementation_plan_additional_search_views.md diff --git a/documentation/projects/proposals/additional_search_views/20230424-project_proposal_additional_search_views.md b/documentation/projects/proposals/additional_search_views/20230424-project_proposal_additional_search_views.md index fbd6b35c962..258df41940a 100644 --- a/documentation/projects/proposals/additional_search_views/20230424-project_proposal_additional_search_views.md +++ b/documentation/projects/proposals/additional_search_views/20230424-project_proposal_additional_search_views.md @@ -5,7 +5,7 @@ ## Reviewers - [x] @obulat -- [ ] @krysal +- [x] @krysal ## Project summary diff --git a/documentation/projects/proposals/additional_search_views/20230719-implementation_plan_additional_search_views.md b/documentation/projects/proposals/additional_search_views/20230719-implementation_plan_additional_search_views.md index b0d2195fc47..2bd93f874f7 100644 --- a/documentation/projects/proposals/additional_search_views/20230719-implementation_plan_additional_search_views.md +++ b/documentation/projects/proposals/additional_search_views/20230719-implementation_plan_additional_search_views.md @@ -10,7 +10,11 @@ - [x] @zackkrida -- [ ] @sarayourfriend +- [x] @sarayourfriend + +```{note} +The original version of this plan was significantly revised on 2024-03-01 due to problems discovered during the implementation work. See [Plan revisions](#plan-revisions) section for summary of changes, and [Original plan](/projects/proposals/additional_search_views/SUPERSEDED-20230719-implementation_plan_additional_search_views.md) for the original version. +``` ## Project links @@ -24,41 +28,53 @@ -API endpoints return all media with the selected tag, from the selected source -or by the selected creator, sorted by date added to Openverse. +API returns all media with the selected tag, from the selected source or by the +selected creator within a given source, sorted by date added to Openverse, when +`collection` parameter is set in the search endpoints. Sample API URLs: + +- `/v1//?collection=tag&tag=cat` +- `/v1//?collection=source&source=flickr` +- `/v1//?collection=creator&source=flickr&creator=Photographer` Frontend allows to browse media items by a selected creator, source, or with a -selected tag. +selected tag on `/collection` page. Sample frontend URLs: + +- `//collection?tag=cat`, +- `//collection?source=flickr` and +- `//collection?source=flickr&creator=Photographer`. -The single result pages link to these collection views; the external links are -also updated to clearly show that they are external. +These pages are indexed by search engines, have relevant SEO properties and can +be shared with an attractive thumbnail and relevant data. + +The single result pages are updated to add the links to source, creator and tag +collections. ## Step-by-step plan -1. Update the Elasticsearch index to enable exact matching of the `tag`, - `source` and `creator` fields (both the query analyzer and the index - analyzer). This will require reindexing. -2. Add API endpoints for exact matching of the `tag`, `source` and `creator` - fields. -3. Create the new components: `VCollectionHeader`, `VCollectionLink` and `VTag`. -4. Update the store and utils used to construct the API query to allow for - searching by `tag`, `creator` or `source`, in addition to the current search - by title/description/tags combination. -5. Add a switchable "additional_search_views" feature flag. -6. Create a page for `tag` / `creator` /`source` collections. The page should - handle fetching and updating the search store state. -7. Update the single result pages: tags area, the "creator" and "source" area +1. API changes: Add API collection ES query builder for exact matching of the + `tag`, `source` and `creator` fields to the search controller. Update the + request serializer to validate collection parameters. +2. Add a switchable "additional_search_views" feature flag. +3. Update the store and utils used to construct the API query to allow for + retrieving the collections by `tag`, `creator` or `source`. +4. Create a page for collections that handles parameter validation, media + fetching and setting relevant SEO properties. +5. Create the new components: `VCollectionHeader`, `VCollectionLink` and `VTag`. +6. Update the single result pages: tags area, the "creator" and "source" area under the main media item. -8. Add the Analytics event `VISIT_SOURCE_LINK` and update where - `VISIT_CREATOR_LINK` is sent. -9. Cleanup after the feature flag is removed: - - Remove conditional rendering on the single result pages. +7. Add the Analytics event `VISIT_SOURCE_LINK` and update where + `VISIT_CREATOR_LINK` is sent. Also update the `SELECT_SEARCH_RESULT`, + `REACH_RESULT_END` and `LOAD_MORE` events to include the new views. +8. Cleanup after the feature flag is removed: + - Remove conditional rendering (single result pages, sources page). - Remove the `additional_search_views` feature flag and `VMediaTag` component. + - Stabilize the `collection` query parameters in the API (remove `unstable__` + prefix) ## Step details -### 1. Search controller updates +### 1. API changes Currently, when filtering the search results, the API matches some query parameters in a fuzzy way: an item matches the query if the field value contains @@ -84,12 +100,14 @@ database rather than ES to access anything: - `creator` is not indexed in the API database, so a query against it will be very slow. +#### Search controller updates + To enable exact matching, we don't need any changes in Elasticsearch index because we already have the `.keyword` fields for `creator`, `source` and -`tags`. We just need to use them in the query. This will allow for exact -matching of the values (e.g. `bike` will not match `bikes` or `biking`), and -will probably make the search more performant since the fields and the query -won't need to be analyzed and/or stemmed. +`tags`. Using these fields in the `term` query will allow for exact matching of +the values (e.g. `bike` will not match `bikes` or `biking`), and will probably +make the search more performant since the fields and the query won't need to be +analyzed and/or stemmed. The search controller's `search` method should be refactored to be smaller and allow for more flexibility when creating the search query. The current @@ -111,35 +129,196 @@ for fuzzy matches. **Finally**, we apply the ranking and sorting parameters, and "highlight" the fields that were matched in the results. -The new search controller should allow for using different filters for the first -step and to not use the full-text search. We should also create a new serializer -for collection search requests. It should include the common parameters for -`list` requests, such as `page` and `page_size`, and the parameters for the -exact matches: `tag`, `creator` and `source`. +The search controller needs to be updated to extract the first 2 steps into a +`build_search_query` method. A new `build_collection_query` should be added and +used when the `collection` parameter is present. This method should create a +filter query for the relevant field and value. + +The pagination and dead link cleanup should be the same for additional search +queries as for the default search queries. + +#### Search request serializer updates + +_The initial version of this plan proposed to add new API endpoints and use path +parameters for the values (e.g., `/image/tag/cat`) to make the URLs more +readable, easier to share, will be easier to cache or perform cache invalidation +required by [#1969](https://github.com/WordPress/openverse/issues/1969). +However, since tags and creator names contain characters that are special for +path segments (`/`, `?` and `&`) that cannot be properly encoded, this approach +turned out to be not feasible. See +[PR#3793](https://github.com/WordPress/openverse/pull/3793) for attempts at +making the path parameters work_ + +```{note} +The new query parameters (`collection` and `tag`) will have `unstable__` prefix until this project is +launched. In the text below, for brevity, the prefix is omitted. +``` + +The collections will use the `search` endpoint with `collection` query parameter +set to `tag`, `creator` or `source`, and the following additional query +parameters: + +- `collection=tag` will require `tag` parameter to be set +- `collection=source` will require `source` parameter to be set +- `collection=creator` will require both `source` and `creator` parameter to be + set. + +This means that the existing `source` and `creator` parameters will be reused. +For the tag collection, the new singular `tag` parameter should be used, rather +than the existing plural `tags`, since we are only presenting a single tag. + +`MediaSearchRequestSerializer` should be updated to validate the `collection` +parameter. This validator will check that the request contains the necessary +additional parameters: `source` for `collection=source`, `creator` and `source` +for `collection=creator`, and `tag` for `collection=tag`. It will only validate +that the parameters _have_ values, but not the values themselves. + +The `MediaSearchRequestSourceSerializer` currently splits the `source` parameter +value by `,` and validates that each value is a valid source. This validator +will be updated: + +- If the `collection` parameter is not present, it will behave as before. +- If the `collection` parameter is set to `source` or `creator`, it will take + the value of `source` as is, and check if this value is present in the list of + valid sources, without splitting it by `,`. If the value is not valid, it will + return a 400 error, showing the invalid value, as well as a list of valid + sources for the media type. +- If the `collection` parameter is set to `tag`, it will return the value as-is, +- because it will be ignored in the search controller. + +The documentation for the `source` parameter should be updated to reflect that +it accepts a single source when there is a `collection` parameter set to +`source` or `creator`, and a comma-separated list of sources when there is no +`collection` parameter. Similarly, the documentation for `creator` parameter +should say that it accepts a comma-separated list of values when there is no `q` +parameter, and a single URI-encoded value when the `collection=creator` +parameter is present. The creator serializer does not need any changes as the +parsing (splitting by comma for regular searches and URI-decoding for `creator` +collections) is only done in the search controller. + +The additional documentation for the search parameters should be added as a +draft so that it's not published on the API documentation site until we launch +this project and remove the `unstable__` prefix. + +### 2. Add the `additional_search_views` feature flag + +The flag should be switchable, and off by default. This will mean that the +changes are visible on [staging](https://staging.openverse.org) when the flag is +switched on using the preferences page. When the features are stable, we will +turn the flag on to test the new views in production. After we conclude that the +project is successful, we will remove the flag and the conditional rendering. + +### 3. Nuxt store and API request changes + +We can reuse the search store as is for these pages. + +_Previously, the frontend search store had a `searchBy` filter that allowed to +search within the `creator` field. When `searchBy` value was set, the API `q` +parameter was replaced with the `=` API query parameter. +This filter was removed because `searchBy` is not strictly a filter that can be +toggled on or off_ + +#### Add `strategy` and `collectionParams` to `search` store + +The `strategy` parameter will be used to determine the API request query +parameters. If it is set to `"search"`, then the API query will be created using +the current approach. If it is set to `"collection"`, the query will be +constructed using the new method (`buildCollectionQuery`) to set `collection` +and other relevant parameters using the new `collectionParams` object in the +store. It will _not_ be setting the filter parameters such as license or +category, and will ignore such unsupported query parameters. + +### 4. Create collection pages + +We should add the following pages: + +- `/image/collection.vue` +- `/audio/collection.vue` + +#### Validation of the collection query parameters + +This page will use the `collection` middleware to validate the collection query +parameters: -### 2. New API endpoints +- if `collection` is set to `creator`, there should also exist a `source` + parameter +- `source` parameter should be validated to be an existing source using the + provider store +- `creator` and `tag` parameters should be decoded using `decodeURIComponent`. -The new routes should use path parameters instead of query parameters for the -`tag`, `creator` and `source` values. This will make the URLs more readable, -easier to share, will be easier to cache or perform cache invalidation required -by #1969. The path parameters should be URL encoded to preserve special -characters and spaces. +The middleware validates that the values of parameters is not empty, and that +the `source` parameter is an existing source in the provider store. If the +values are invalid, the 404 yellow error page is shown. -Instead of using query strings, we can describe the resource via the path: -`//source//creator/` is very clean, easy to read -and understand, and very easy to manage the cache for because it is a static -path. The source page can use the same route by leaving off the creator. This -removes the need to manage specific query params as well and would allow us to -add querying within these routes more easily in the future behind the regular q -parameter if we wanted. +#### Fetching the media + +This page should also update the state (`searchType`, `collectionParams` and +`strategy`) in the `search` store and handle fetching using `mediaStore`'s +`fetchMedia` method in the `useFetch` hook. + +The media collections should be updated (not as part of this project) to move +the load more methods to the page that fetches the media (i.e., `search.vue` or +`collection.vue`), and the `mediaStore` should be updated to remove the load +more methods. + +#### SEO + +This page should also have relevant SEO properties, and should be indexed by the +search engines. + +The following titles should be used for the pages: + +- "Images by Olga at Flickr" for the creator page +- "Audio from Wikimedia" for the source page +- "Images with the cat tag" for the tag page + +There are i18n consideration for these titles that we will work on during the +implementation. It is important to make the titles translatable, which is +difficult if the non-translatable dynamic names are used inside the sentence due +to different sentence structures. + +The generic Openverse thumbnail will be used. We could also generate a thumbnail +for the collection pages in the future, but this is not in scope for this +project. + +### 5. Update the single result pages + +All of these changes should be conditional on whether the +`additional_search_views` feature flag is enabled. -For the tag route, the singular `tag` rather than plural `tags` should be used -for legibility since we are presenting a single tag. +The Figma links for new designs: + +- [Image single result](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-63284&mode=dev) +- [Audio single result](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-63285&mode=dev) + +#### Update the `VCollectionLink` area on the single result page + +The content info line under the main item on the single result page should be +replaced with a section that has two buttons: one for a creator link and a +source link. This section should be horizontally scrollable on mobile. +Initially, we planned to implement a scroll-snap (example: +https://play.tailwindcss.com/AbfA33Za50), however, since the source and creator +names can be very long, a scroll snap can make the links unusable, as they will +always snap to the start of the source name and not allow scrolling past it. + +#### Use `VTag` with links for the tags + +The tags should be rendered using the `VTag` component with links to the tag +collection page. + +#### Update the tags area on the single result page + +The tags area should be collapsible to make long lists of tags collapsible: +https://github.com/WordPress/openverse/issues/2589 + +#### Add the information popover next to source and provider links + +The information popover should be added next to the source and provider links +that explains the difference between the source and provider. -The new views should use the same pagination and dead link cleanup as the search -views. +[**Figma link**](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56521&mode=dev) -### 3. New and updated components +### 6. New and updated components #### Extract the `VAudioCollection` component @@ -159,10 +338,12 @@ The header should also display the number of results, "251 audio files with the selected tag", "604 images provided by this source", "37 images by this creator in Hirshhorn Museum and Sculpture Garden". -_Note_: There are sources that only have works by one creator. In this case, we +```{note} +There are sources that only have works by one creator. In this case, we should probably still have two separate pages for the source and the creator, but we might want to add a note that the creator is the only one associated with this source. +``` **Figma links**: **creator** [desktop](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56323&mode=design&t=pN0PPAzlKQEKT9tJ-4) @@ -213,144 +394,16 @@ page and "provider" in the The audio page should be updated to match the image page: the `foreign_landing_url` link should be added to the "source", not provider. -### 4. Nuxt store and API request changes - -We can reuse the search store as is for these pages. Currently, the frontend can -perform searches by `source` parameter. If `searchBy` value is set, then the `q` -parameter is replaced with the `=` query parameter. - -For this project, we should add new values to the `searchBy` filter. - -The API request URL is constructed from the search store state in the -[`prepare-search-query-params` method](https://github.com/WordPress/openverse/blob/b4b46b903731870015c475d2c08eebef7ec6b25b/frontend/src/utils/prepare-search-query-params.ts#L22-L25). -We will need to update this method to use the `searchBy` filter value to -construct the API request path as described in the "API Changes" section. - -#### Update the `searchBy` filter - -The `searchBy` filter will be used to determine the shape of the API request. -While currently these parameters will be mutually exclusive (we can only search -by one of them), we might want to allow searching by multiple parameters in the -future. - -For other filters, we only use `toggle` method to update the value. However, for -`searchBy`, we need to be able to check one of the `searchBy` parameters, and -uncheck the others. To enable that, we should add a new `search` store method. - -If `searchBy` is set to `tag`, `creator` or `source`, then the media store -should create search path instead of the search query. So, instead of calling -`prepareSearchQuery` to create the query parameters, it should call -`prepareSearchPath` to create the path. - -```typescript -const searchPathOrQuery = searchBy - ? prepareSearchPath(searchParams, mediaType) - : prepareSearchQuery(searchParams, mediaType) - -const prepareSearchPath = ( - searchParams: Record, - mediaType: SupportedMediaType -) => { - let path - if (searchBy === "tag") { - path = `${mediaType}/tag/${searchTerm}` - } else { - path = `${mediaType}/source/${searchParams[`${mediaType}Provider`]}` - if (searchBy === "creator") { - path += `/creator/${searchTerm}` - } - } - return path -} -``` - -### 5. Add the `additional_search_views` feature flag - -The flag should be switchable, and off by default. - -### 6. Create a page for `tag` / `creator` /`source` collections. - -Nuxt allows creating nested dynamic routes like -`/pages/_collection/_mediaType/_term`. - -We should add the following pages: - -- `/pages/_mediaType/tag/_tag` -- `/pages/_mediaType/source/_source` -- `/pages/_mediaType/source/_source/creator/_creator` (this page might not be - needed as it might be handled by the source page) - -To make sure that the `mediaType`, `source` and `creator` parameters are valid, -this page should use the -[`validate` method](https://v2.nuxt.com/docs/components-glossary/validate/) to -make sure that and show an error page if necessary. - -```typescript -function validate({ params, $pinia }): boolean { - const { collection, mediaType, term } = params - // Check that collection is one of ["tag", "creator" or "source"], - // and mediaType is one of `supportedMediaTypes`. - // Check that `term` is correctly escaped. - // If the params are not valid, return `false` to show the error page. - return isValid ? true : false -} -``` - -This page should also update the state (`searchType`, `searchTerm` and -`searchBy` and `provider` filters) in the `search` store and handle fetching -using `mediaStore`'s `fetchMedia` method in the `useFetch` hook. - -Since it is not possible to change the path or query parameters from this page -client-side, fetching can be much simpler than on the current search page (that -has to watch for changes in the route and fetch if necessary). - -This page should use `VCollectionHeader` and the image grid or the audio -collection. - -### 7. Update the single result pages - -All of these changes should be conditional on whether the -`additional_search_views` feature flag is enabled. - -The Figma links for new designs: - -- [Image single result](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-63284&mode=dev) -- [Audio single result](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-63285&mode=dev) - -#### Update the `VCollectionLink` area on the single result page - -The content info line under the main item on the single result page should be -replaced with a section that has two buttons: one for a creator link and a -source link. This section should be horizontally scrollable on mobile. It should -implement a scroll-snap (example: https://play.tailwindcss.com/AbfA33Za50) - -#### Use `VTag` with links for the tags - -The tags should be rendered using the `VTag` component with links to the tag -collection page. - -#### Update the tags area on the single result page - -The tags area should be collapsible to make long lists of tags collapsible: -https://github.com/WordPress/openverse/issues/2589 - -#### Add the information popover next to source and provider links - -The information popover should be added next to the source and provider links -that explains the difference between the source and provider. - -[**Figma link**](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56521&mode=dev) - -### 8. Additional analytics events +### 7. Additional analytics events Some existing events will already track the new views events. The views can be tracked as page views, so no separate event is necessary. The only way to access the pages is directly or via links on the single results, which will all be captured by standard page visits. Clicking on the items will be tracked as `SELECT_SEARCH_RESULT` events. These events can be narrowed by pathname -(`/search` or `/tag\*`, for example) to determine where the event occurred. +(`/search` or `/collection`, for example) to determine where the event occurred. -Two analytics events should be added or updated: +Analytics events should be added or updated: - The clicks on external creator link in the `VCollectionHeader` should be tracked as `VISIT_CREATOR_LINK` events. @@ -358,11 +411,18 @@ Two analytics events should be added or updated: - We should also a special event for visiting source `VISIT_SOURCE_LINK`, similar to `VISIT_CREATOR_LINK`. -### 9. Cleanup after the feature flag is enabled in production +- The `REACH_RESULT_END`, `LOAD_MORE` and `SELECT_SEARCH_RESULT` events should + add strategy (`search`/`tag`/`creator`/`source`) and set `query` to tag name, + source name or source/creator pair: `cat`, `flickr` or `flickr/Olga`. + +### 8. Cleanup after the feature flag is enabled in production After the feature flag is enabled in production, we should remove the conditional rendering on the single result pages and remove the -`additional_search_views` feature flag and (old) `VMediaTag` component. +`additional_search_views` feature flag and (old) `VMediaTag` component.` + +Remove the `unstable__` prefix from the `collection` and `tag` query parameters +in the API. ### Tests @@ -414,6 +474,11 @@ not dependent on anything. The work on the single result pages (step 7) can be done in parallel with the work on the collection pages (step 6), but should follow the previous steps. +The new frontend views can use the existing query parameters (`source`, +`creator` and `tags` - instead of `tag`) until the API changes are implemented +since this _will_ return _some_ relevant results, and changing the query +parameter names is easy. + ## Blockers @@ -441,6 +506,13 @@ The biggest risk I see is that this project might be seen as an "invitation" to scraping. Hopefully, frontend rate limiting and the work on providing the dataset would minimize such risks. -## Prior art +## Plan revisions - +This plan was significantly revised on 2024-03-01 due to problems discovered +during the implementation of the path parameters. The API changes were updated +to use query parameters instead of path parameters, and to describe the new +request serializer. The frontend store and API request changes were updated to +use the new query parameters. The collection pages descriptions were updated to +reflect the new query parameters. The steps were reordered to reflect the new +implementation plan. More details on SEO was added in the collection pages +section. The analytics event parameters were updated. diff --git a/documentation/projects/proposals/additional_search_views/SUPERSEDED-20230719-implementation_plan_additional_search_views.md b/documentation/projects/proposals/additional_search_views/SUPERSEDED-20230719-implementation_plan_additional_search_views.md new file mode 100644 index 00000000000..47497d36e7a --- /dev/null +++ b/documentation/projects/proposals/additional_search_views/SUPERSEDED-20230719-implementation_plan_additional_search_views.md @@ -0,0 +1,450 @@ +# SUPERSEDED 2023-07-20 Implementation Plan: Additional Search Views + +**Author**: @obulat + + + + +## Reviewers + + + +- [x] @zackkrida +- [x] @sarayourfriend + +```{warning} +This is the original version. For the current version, see [the updated plan](/projects/proposals/additional_search_views/20230719-implementation_plan_additional_search_views.md). +``` + +## Project links + + + +- [Project Thread](https://github.com/WordPress/openverse/issues/410) +- [Project Proposal](https://docs.openverse.org/projects/proposals/additional_search_views/20230424-project_proposal_additional_search_views.html) +- [Milestone](https://github.com/WordPress/openverse/milestone/17) + +## Expected Outcomes + + + +API endpoints return all media with the selected tag, from the selected source +or by the selected creator, sorted by date added to Openverse. + +Frontend allows to browse media items by a selected creator, source, or with a +selected tag. + +The single result pages link to these collection views; the external links are +also updated to clearly show that they are external. + +## Step-by-step plan + +1. Update the Elasticsearch index to enable exact matching of the `tag`, + `source` and `creator` fields (both the query analyzer and the index + analyzer). This will require reindexing. +2. Add API endpoints for exact matching of the `tag`, `source` and `creator` + fields. +3. Create the new components: `VCollectionHeader`, `VCollectionLink` and `VTag`. +4. Update the store and utils used to construct the API query to allow for + searching by `tag`, `creator` or `source`, in addition to the current search + by title/description/tags combination. +5. Add a switchable "additional_search_views" feature flag. +6. Create a page for `tag` / `creator` /`source` collections. The page should + handle fetching and updating the search store state. +7. Update the single result pages: tags area, the "creator" and "source" area + under the main media item. +8. Add the Analytics event `VISIT_SOURCE_LINK` and update where + `VISIT_CREATOR_LINK` is sent. +9. Cleanup after the feature flag is removed: + - Remove conditional rendering on the single result pages. + - Remove the `additional_search_views` feature flag and `VMediaTag` + component. + +## Step details + +### 1. Search controller updates + +Currently, when filtering the search results, the API matches some query +parameters in a fuzzy way: an item matches the query if the field value contains +the query string as a separate word. When indexing the items, we "analyze" them, +which means that we split the field values by whitespace and stem them. We also +do the same to the query string. This means that the query string "bike" will +match the field value "bikes", "biking", or "bike shop". + +For these pages, however, we need an exact match. + +One alternative implementation considered when writing this plan was to use the +database instead of the Elasticsearch to get the results. This would make it +easy to get the exact matches. However, there are some problems with using the +database rather than ES to access anything: + +- The database does not cache queries in the same way that ES does. Repeated + queries will not necessarily be as efficient as from ES. +- The database does not score documents at all, so the order will different + dramatically to the way that ES would order the documents. That's an issue + with respect to popularity data today already, but will become even more of an + issue if we start to score documents based on other metrics as theorised by + our search relevancy discussions. +- `creator` is not indexed in the API database, so a query against it will be + very slow. + +To enable exact matching, we don't need any changes in Elasticsearch index +because we already have the `.keyword` fields for `creator`, `source` and +`tags`. We just need to use them in the query. This will allow for exact +matching of the values (e.g. `bike` will not match `bikes` or `biking`), and +will probably make the search more performant since the fields and the query +won't need to be analyzed and/or stemmed. + +The search controller's `search` method should be refactored to be smaller and +allow for more flexibility when creating the search query. The current +implementation of query building consists of 3 steps. + +We **first** apply the `filters`: if the query string has any parameters other +than `q`, we use them for exact matches that must be in the search results, or +must be excluded from the search results (if the parameter starts with +`exclude_`). + +**Then**, if `q` parameter is present, we apply the `q` parameter, which is a +full-text search within `tags`, `title` and `description` fields. This is a +fuzzy search, which means that the query string is stemmed and analyzed, and the +field values are stemmed and analyzed, and the documents are scored based on the +relevance of the match. If `q` is not present, but one of the +`creator`/`source`/`tags` parameter is present, we search within those fields +for fuzzy matches. + +**Finally**, we apply the ranking and sorting parameters, and "highlight" the +fields that were matched in the results. + +The new search controller should allow for using different filters for the first +step and to not use the full-text search. We should also create a new serializer +for collection search requests. It should include the common parameters for +`list` requests, such as `page` and `page_size`, and the parameters for the +exact matches: `tag`, `creator` and `source`. + +### 2. New API endpoints + +The new routes should use path parameters instead of query parameters for the +`tag`, `creator` and `source` values. This will make the URLs more readable, +easier to share, will be easier to cache or perform cache invalidation required +by #1969. The path parameters should be URL encoded to preserve special +characters and spaces. + +Instead of using query strings, we can describe the resource via the path: +`//source//creator/` is very clean, easy to read +and understand, and very easy to manage the cache for because it is a static +path. The source page can use the same route by leaving off the creator. This +removes the need to manage specific query params as well and would allow us to +add querying within these routes more easily in the future behind the regular q +parameter if we wanted. + +For the tag route, the singular `tag` rather than plural `tags` should be used +for legibility since we are presenting a single tag. + +The new views should use the same pagination and dead link cleanup as the search +views. + +### 3. New and updated components + +#### Extract the `VAudioCollection` component + +Currently, it is not possible to reuse the audio collection from the audio +search result page because it is a part of the `audio.vue` page. We should +extract the part that shows the loading skeleton, the column of `VAudioTrack` +rows and the Load more section into `VAudioCollection` component. This component +will be reused in the audio search page and on the Additional search views. + +#### Add a `VCollectionHeader` component + +The header should have an icon (tag, creator or source) and the name of the +tag/creator/source. For source and the creator, there should be an external link +button if it's available (not all creators have urls). + +The header should also display the number of results, "251 audio files with the +selected tag", "604 images provided by this source", "37 images by this creator +in Hirshhorn Museum and Sculpture Garden". + +_Note_: There are sources that only have works by one creator. In this case, we +should probably still have two separate pages for the source and the creator, +but we might want to add a note that the creator is the only one associated with +this source. + +**Figma links**: **creator** +[desktop](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56323&mode=design&t=pN0PPAzlKQEKT9tJ-4) +and +[mobile](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56362&mode=design&t=suLIyJHNmZrM0mPH-4), +**source** +[desktop](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56381&mode=design&t=suLIyJHNmZrM0mPH-4) +and +[mobile](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56421&mode=design&t=suLIyJHNmZrM0mPH-4), +**tag** +[desktop](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1216-58402&mode=design&t=suLIyJHNmZrM0mPH-4) +and +[mobile](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56457&mode=design&t=suLIyJHNmZrM0mPH-4). + +#### Add `VCollectionLink` component + +This component should be a `VButton` with `as="VLink"`, should have an icon, and +should accept a localized link to the creator or source page. + +**Figma link**: +[creator and source buttons](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56972&mode=dev) + +#### Create a new `VTag` component to be a `VButton` wrapping a `VLink` + +The +[`VMediaTag` component](https://github.com/WordPress/openverse/blob/c7b76139d5a001ce43bde27805be5394e5732d1a/frontend/src/components/VMediaTag/VMediaTag.vue) +should be updated to be a `VButton` wrapping a `VLink`, and should match the +design in the +[Figma mockups](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56515&mode=design&t=nCX20BtJYqMOFAQm-4). +The component should link to the localized page for the tag collection. + +#### Update links in the "information" section + +The links to creator in the +[image](https://github.com/WordPress/openverse/blob/35f08e5710d59f6c10f0cf54103fcce151adfefe/frontend/src/pages/image/_id/index.vue#L62-L73) +and +[audio](https://github.com/WordPress/openverse/blob/35f08e5710d59f6c10f0cf54103fcce151adfefe/frontend/src/components/VAudioTrack/layouts/VFullLayout.vue#L32-L41) +single result pages Information section should have an "external link" icon. + +Audio creator link should also be updated to match the image creator link. It +should be a conditional component: `VLink` if the `creator_url` is truthy and +`span` if the `creator_url` is falsy. + +Currently, the `foreign_landing_url` is linked to the "source" in the +[image](https://github.com/WordPress/openverse/blob/35f08e5710d59f6c10f0cf54103fcce151adfefe/frontend/src/components/VImageDetails/VImageDetails.vue#L26-L28) +page and "provider" in the +[audio page](https://github.com/WordPress/openverse/blob/35f08e5710d59f6c10f0cf54103fcce151adfefe/frontend/src/components/VAudioDetails/VAudioDetails.vue#L61C1-L63). +The audio page should be updated to match the image page: the +`foreign_landing_url` link should be added to the "source", not provider. + +### 4. Nuxt store and API request changes + +We can reuse the search store as is for these pages. Currently, the frontend can +perform searches by `source` parameter. If `searchBy` value is set, then the `q` +parameter is replaced with the `=` query parameter. + +For this project, we should add new values to the `searchBy` filter. + +The API request URL is constructed from the search store state in the +[`prepare-search-query-params` method](https://github.com/WordPress/openverse/blob/b4b46b903731870015c475d2c08eebef7ec6b25b/frontend/src/utils/prepare-search-query-params.ts#L22-L25). +We will need to update this method to use the `searchBy` filter value to +construct the API request path as described in the "API Changes" section. + +#### Update the `searchBy` filter + +The `searchBy` filter will be used to determine the shape of the API request. +While currently these parameters will be mutually exclusive (we can only search +by one of them), we might want to allow searching by multiple parameters in the +future. + +For other filters, we only use `toggle` method to update the value. However, for +`searchBy`, we need to be able to check one of the `searchBy` parameters, and +uncheck the others. To enable that, we should add a new `search` store method. + +If `searchBy` is set to `tag`, `creator` or `source`, then the media store +should create search path instead of the search query. So, instead of calling +`prepareSearchQuery` to create the query parameters, it should call +`prepareSearchPath` to create the path. + +```typescript +const searchPathOrQuery = searchBy + ? prepareSearchPath(searchParams, mediaType) + : prepareSearchQuery(searchParams, mediaType) + +const prepareSearchPath = ( + searchParams: Record, + mediaType: SupportedMediaType +) => { + let path + if (searchBy === "tag") { + path = `${mediaType}/tag/${searchTerm}` + } else { + path = `${mediaType}/source/${searchParams[`${mediaType}Provider`]}` + if (searchBy === "creator") { + path += `/creator/${searchTerm}` + } + } + return path +} +``` + +### 5. Add the `additional_search_views` feature flag + +The flag should be switchable, and off by default. + +### 6. Create a page for `tag` / `creator` /`source` collections. + +Nuxt allows creating nested dynamic routes like +`/pages/_collection/_mediaType/_term`. + +We should add the following pages: + +- `/pages/_mediaType/tag/_tag` +- `/pages/_mediaType/source/_source` +- `/pages/_mediaType/source/_source/creator/_creator` (this page might not be + needed as it might be handled by the source page) + +To make sure that the `mediaType`, `source` and `creator` parameters are valid, +this page should use the +[`validate` method](https://v2.nuxt.com/docs/components-glossary/validate/) to +make sure that and show an error page if necessary. + +```typescript +function validate({ params, $pinia }): boolean { + const { collection, mediaType, term } = params + // Check that collection is one of ["tag", "creator" or "source"], + // and mediaType is one of `supportedMediaTypes`. + // Check that `term` is correctly escaped. + // If the params are not valid, return `false` to show the error page. + return isValid ? true : false +} +``` + +This page should also update the state (`searchType`, `searchTerm` and +`searchBy` and `provider` filters) in the `search` store and handle fetching +using `mediaStore`'s `fetchMedia` method in the `useFetch` hook. + +Since it is not possible to change the path or query parameters from this page +client-side, fetching can be much simpler than on the current search page (that +has to watch for changes in the route and fetch if necessary). + +This page should use `VCollectionHeader` and the image grid or the audio +collection. + +### 7. Update the single result pages + +All of these changes should be conditional on whether the +`additional_search_views` feature flag is enabled. + +The Figma links for new designs: + +- [Image single result](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-63284&mode=dev) +- [Audio single result](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-63285&mode=dev) + +#### Update the `VCollectionLink` area on the single result page + +The content info line under the main item on the single result page should be +replaced with a section that has two buttons: one for a creator link and a +source link. This section should be horizontally scrollable on mobile. It should +implement a scroll-snap (example: https://play.tailwindcss.com/AbfA33Za50) + +#### Use `VTag` with links for the tags + +The tags should be rendered using the `VTag` component with links to the tag +collection page. + +#### Update the tags area on the single result page + +The tags area should be collapsible to make long lists of tags collapsible: +https://github.com/WordPress/openverse/issues/2589 + +#### Add the information popover next to source and provider links + +The information popover should be added next to the source and provider links +that explains the difference between the source and provider. + +[**Figma link**](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?type=design&node-id=1200-56521&mode=dev) + +### 8. Additional analytics events + +Some existing events will already track the new views events. The views can be +tracked as page views, so no separate event is necessary. The only way to access +the pages is directly or via links on the single results, which will all be +captured by standard page visits. Clicking on the items will be tracked as +`SELECT_SEARCH_RESULT` events. These events can be narrowed by pathname +(`/search` or `/tag\*`, for example) to determine where the event occurred. + +Two analytics events should be added or updated: + +- The clicks on external creator link in the `VCollectionHeader` should be + tracked as `VISIT_CREATOR_LINK` events. + +- We should also a special event for visiting source `VISIT_SOURCE_LINK`, + similar to `VISIT_CREATOR_LINK`. + +### 9. Cleanup after the feature flag is enabled in production + +After the feature flag is enabled in production, we should remove the +conditional rendering on the single result pages and remove the +`additional_search_views` feature flag and (old) `VMediaTag` component. + +### Tests + +We should add visual-regression tests for the new views. To minimize flakiness +due to slow loading of the images, we should probably use the +[ `{ filter: brightness(0%); }` trick](https://github.com/WordPress/openverse/blob/b4b46b903731870015c475d2c08eebef7ec6b25b/frontend/test/playwright/visual-regression/pages/pages.spec.ts#L49-L55) +for the images on the page. + +The search store tests should be updated to reflect the changes to the filters. + +## Dependencies + +### Infrastructure + + + +These views potentially might cause more load on our infrastructure due to +increase in scraping activity. + +### Tools & packages + + + +No new tools or packages are necessary. + +### Other projects or work + + + +Not applicable. + +## Design + + + +[Figma designs in the dev mode](https://www.figma.com/file/niWnCgB7K0Y4e4mgxMrnRC/Additional-search-views?node-id=22%3A13656&mode=dev) + +## Parallelizable streams + + + +The API changes can be done independently of the frontend changes, although they +should be finished before the final testing of the frontend changes. + +Adding the new components (step 3), Nuxt store update (step 4) and the +`additional_search_views` feature flag (step 5) can be done in parallel, and are +not dependent on anything. + +The work on the single result pages (step 7) can be done in parallel with the +work on the collection pages (step 6), but should follow the previous steps. + +## Blockers + + + +The main blocker could be the maintainer capacity. + +## Accessibility + + + +We should make sure that the search titles are accessible, and the pages clearly +indicate the change of context. + +## Rollback + + + +To roll back the changes, we would need to set the feature flag to `OFF`. + +## Risks + + + +The biggest risk I see is that this project might be seen as an "invitation" to +scraping. Hopefully, frontend rate limiting and the work on providing the +dataset would minimize such risks. + +## Prior art + +