script to continuously evaluate elser #2670

davidkyle · 2024-05-22T16:06:13Z

First download the elser model locally. Either

The script runs pytorch_inference, loads the model then continuously runs inference on it. Logging is to std out, the model output is written to a json file. Every 100 request the script asks the pytorch_inference how much memory it is using and this is written to the same json file. grep mem out.json will show that data.

Run with

python3 signal9.py '/PATH/TO/elser_2/elser_model_2.pt' --num_allocations=4

--num_threads_per_allocation and --num_allocations are the parameters to tweak. Increasing either of those will make inference faster and changes in memory should be seen sooner.

jonathan-buttner · 2024-06-10T20:21:51Z

bin/pytorch_inference/signal9.py

+    parser.add_argument('--output_file', default='out.json')
+    parser.add_argument('--log_file', default='log.txt')
+    parser.add_argument('--num_threads_per_allocation', type=int, help='The number of inference threads used by LibTorch. Defaults to 1.')
+    parser.add_argument('--num_allocations', type=int, help='The number of allocations for parallel forwarding. Defaults to 1')


We should probably include setting the cache size here. Does it default to 0 if --cacheMemorylimitBytes isn't passed?

Does it default to 0 if --cacheMemorylimitBytes isn't passed?

Yes, the default cache value is 0:

--cacheMemorylimitBytes arg Optional memory in bytes that the inference cache can use - default is 0 which disables caching

script to continuously evaluate elser

0729447

jonathan-buttner reviewed Jun 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

script to continuously evaluate elser #2670

script to continuously evaluate elser #2670

davidkyle commented May 22, 2024

jonathan-buttner Jun 10, 2024

edsavage Jun 10, 2024

script to continuously evaluate elser #2670

Are you sure you want to change the base?

script to continuously evaluate elser #2670

Conversation

davidkyle commented May 22, 2024

jonathan-buttner Jun 10, 2024

Choose a reason for hiding this comment

edsavage Jun 10, 2024

Choose a reason for hiding this comment