Python 3 compatibility and t-batch caching. #9

jpalowitch · 2020-03-31T07:03:28Z

The first 3 commits address python 3 compatibility and remove unnecessary imports.

The final commit is an incomplete tbatching optimization. We don't need to recompute tbatches for every epoch, so it makes sense to do some type of caching. Also, we don't need to recompute them for every run either, assuming we can load the entire dict of tbatches into memory and do random access on each dict (needed to account for user changes to timespan).

However, the current code isn't set up to incorporate these changes easily, because chunks of t-batches are computed on-the-fly, trading off with the corresponding chunk of the epoch. So one would have to compute the "start" and "end" points of each tbatch chunk so that the epoch chunk can access the right tbatches.

The code in the 4th commit is not just unoptimized, but buggy. In the first epoch, the tbatch dicts keep growing, as args.cache_tbatches=True removes tbatch reinitialization. But the epoch still iterates over the full length of the tbatch dicts. There are a two competing ways one could fix this:

Revert to reinitializing the tbatch chunk every time, but save the tbatch chunks to disk.
Tell the epoch where to access the tbatch dict, instead of starting from the beginning.

2 seems easier, but I will leave that to the code maintainers' judgement :)

pmixer · 2020-07-07T03:15:35Z

Hi @jpalowitch, I can help finishing&testing cached t-batch feature as synced-up with Prof. Kumar last week, are you still interested/in-need of it?

jpalowitch · 2020-07-08T16:28:11Z

Hi @jpalowitch, I can help finishing&testing cached t-batch feature as synced-up with Prof. Kumar last week, are you still interested/in-need of it?

Yes, thanks!

jpalowitch added 4 commits March 25, 2020 16:07

Change all non-paren print statements to Python3 print.

12bd3bd

Remove cPickle imports

0700918

Use list() to unroll yields

77c1e8e

A start on caching tbatches

18d341b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python 3 compatibility and t-batch caching. #9

Python 3 compatibility and t-batch caching. #9

jpalowitch commented Mar 31, 2020

pmixer commented Jul 7, 2020

jpalowitch commented Jul 8, 2020

Python 3 compatibility and t-batch caching. #9

Are you sure you want to change the base?

Python 3 compatibility and t-batch caching. #9

Conversation

jpalowitch commented Mar 31, 2020

pmixer commented Jul 7, 2020

jpalowitch commented Jul 8, 2020