Multiple researchers at CHI suggested replacing the Azure API calls with an on-device model, which may greatly simplify setup without compromising output quality. However, we should investigate feasibility with current consumer devices as on-device models likely have significantly higher performance overhead, degrading VRSight's object detection performance.