Skip to content

Add more info to each Chatbot Document #454

@AdamFipke

Description

@AdamFipke

Idea is to add total number of pages, whether or not parseAsSlides was true, and the size of the document to each document here:

Image

Motivation is that it'd be nice to know what documents are really heavy that a course has uploaded.

For size of the document (in MB), that would be pretty easy to obtain since I think I save that on the chatbot-doc-pdf.entity.

For total number of pages... this might (?) be easy to do, but I think the main challenge here would be the backend since I don't think we collect that data in the chatbot document aggregates right now, so a funky migration could be required that estimates number of pages based on data from its chunks. Most of the work here would be backend work on the chatbot repo. @bhunt02 would know more

For parseAsSlides, I think we only use that during upload and then discard it and I don't think we save that information... but we should!

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions