- Using Dlib python javascript 0 Using Chrome FaceDetection API sample
- Web Audio ML Features - link
- Audio processing applications - link
- Speech Recognition Engine using ConvNet in Keras - link
- Speacker Recognition: training audio books, testing on youtube videos - link
Use existing callback for saving activations - or at least simplify code using HooksCallback Untested, but something like:
class StoreHook(HookCallback):
def on_train_begin(self, **kwargs):
super().on_train_begin(**kwargs)
self.acts = []
def hook(self, m, i, o): return o
def on_batch_end(self, train, **kwargs): self.acts.append(self.hooks.stored)
Pass a list of modules to the ctor to hook whatever layers you like.
Several approaches can be tested, and the one that works better is one where time series are mapped to an image by means of a polar coordinate transformation called the Gramian Angular Field (GAF). This creates a rich graphical representation of a univariate time series. If you are interested you can read this paper (Encoding Time Series as Images for Visual Inspection and Classification Using Tiled Convolutional Neural Networks). Based on this the univariate time series on the left is transformed into the image on the right. notebook
- pyts for timeseries transformation and classification - documentation
- Get started on RL link