Skip to content

Commit

Permalink
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[GPU] Optimized operations in the blas kernels with the latest buffe…
Browse files Browse the repository at this point in the history
…r changes.

    Updated the pipeline for both fp32 and fp16.
    SwiGLU, RmsNorm and Concat ops updated.

        **Self evaluation:**
        1. Build test:   [X]Passed [ ]Failed [ ]Skipped
        2. Run test:     [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: Niket Agarwal <niket.a@samsung.com>
niket-agarwal committed Jan 6, 2025
1 parent bed236e commit 15f9374
Showing 5 changed files with 246 additions and 213 deletions.
Loading

0 comments on commit 15f9374

Please sign in to comment.