from-151-Ternary-dense-try-different-normalization-to-optimize-outcome #152

david-thrower · 2024-06-13T19:31:34Z

Kind of issue: Feature development

Issue described: We have a successful implementation of a Ternary replacement for Dense layers. The metrics are not quite what we want on some problems.

One possible issue is the default batch normalization behavior. Perhaps Dropout will work better or layer normalization.

david-thrower · 2024-06-13T19:56:43Z

Tried thus far:

No normalization nor dropout: Results: Basically no good RMSE on tabular = Mean value
Layer Normalization: Some improvement over no normalization but worse than default batch normalization on tabular. However, exceptional results on Ham / Spam problem. The disparity between the results on tabular and text classification is strange. It may be that the hyperparameter range for tabular are just way off for the Ternary neural network. It could also be that we should make a different node for Ternary and Dense and continue using Dense with batch normalization for tabular and use Ternary with Layer Normalization for NLP tasks. There is probably something that will make both work.

To try:

Dropout .10
Dropout 0.5
Dropout 0.8

david-thrower added traige/good first issue Good for newcomers triage/high-priority kind/reproducibility kind/increase-accuracy audience/technical Issue primarily for technical review and service. labels Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

from-151-Ternary-dense-try-different-normalization-to-optimize-outcome #152

from-151-Ternary-dense-try-different-normalization-to-optimize-outcome #152

david-thrower commented Jun 13, 2024

david-thrower commented Jun 13, 2024 •

edited

Loading

from-151-Ternary-dense-try-different-normalization-to-optimize-outcome #152

from-151-Ternary-dense-try-different-normalization-to-optimize-outcome #152

Comments

david-thrower commented Jun 13, 2024

david-thrower commented Jun 13, 2024 • edited Loading

david-thrower commented Jun 13, 2024 •

edited

Loading