Skip to content

Attention doesn't work well for downsample_step=1 and outputs_per_step=1 #24

@r9y9

Description

@r9y9

Noticed while working on #21.

Trained 300k steps, but the model was not generalized well. Need to figure out how we can improve.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions