-
Notifications
You must be signed in to change notification settings - Fork 2
Description
Idea is to add total number of pages, whether or not parseAsSlides was true, and the size of the document to each document here:
Motivation is that it'd be nice to know what documents are really heavy that a course has uploaded.
For size of the document (in MB), that would be pretty easy to obtain since I think I save that on the chatbot-doc-pdf.entity.
For total number of pages... this might (?) be easy to do, but I think the main challenge here would be the backend since I don't think we collect that data in the chatbot document aggregates right now, so a funky migration could be required that estimates number of pages based on data from its chunks. Most of the work here would be backend work on the chatbot repo. @bhunt02 would know more
For parseAsSlides, I think we only use that during upload and then discard it and I don't think we save that information... but we should!