Expose cloud and edge inference speed metrics #173

cat101 · 2024-02-06T22:45:58Z

This change helps users get a breakdown of the time spent during inference. This is useful to experiment with different inference pipelines.

camiloaz

As a general comment, in the past, we have been asked by @bibireata and other people to remove these performance metrics from the response of cloud inference. I believe we should not expose this for cloud inference, maybe.

camiloaz · 2024-02-06T23:14:00Z

landingai/predict.py

+# performance_metrics keeps performance metrics for the last call to _do_inference()
+performance_metrics: Dict[str, int] = {}
+


Why store it as a global variable? Why not associate it with a Predictor instance?

I guess my brain was running low on creativity 😞 ... I changed to a private variable to the class...thanks!

Regarding the comment about performance metrics I think in the long term we need a tutorial on how to optimize models for speed and how to use metrics to profile and find problems. For now they are a bit obscure but at least now we expose them as part of the predict class 🤷‍♂️

pappacena

LGTM

Expose inference speed metrics

8fc34ae

camiloaz reviewed Feb 6, 2024

View reviewed changes

cat101 added 2 commits February 14, 2024 19:00

Made the metrics private to the class

f312df5

Type fixes

2f1d12d

pappacena approved these changes Feb 23, 2024

View reviewed changes

cat101 merged commit 5d6a85a into main Feb 23, 2024
13 checks passed

cat101 deleted the matias-expose-metrics branch February 23, 2024 20:34

camiloaz mentioned this pull request Mar 11, 2024

Fix OCR predictor after performance metrics addition #186

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose cloud and edge inference speed metrics #173

Expose cloud and edge inference speed metrics #173

cat101 commented Feb 6, 2024 •

edited

Loading

camiloaz left a comment

camiloaz Feb 6, 2024

cat101 Feb 14, 2024

cat101 Feb 14, 2024

pappacena left a comment

		# performance_metrics keeps performance metrics for the last call to _do_inference()
		performance_metrics: Dict[str, int] = {}

Expose cloud and edge inference speed metrics #173

Expose cloud and edge inference speed metrics #173

Conversation

cat101 commented Feb 6, 2024 • edited Loading

camiloaz left a comment

Choose a reason for hiding this comment

camiloaz Feb 6, 2024

Choose a reason for hiding this comment

cat101 Feb 14, 2024

Choose a reason for hiding this comment

cat101 Feb 14, 2024

Choose a reason for hiding this comment

pappacena left a comment

Choose a reason for hiding this comment

cat101 commented Feb 6, 2024 •

edited

Loading