Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Break up cutlass_extensions.cu, pt1 (#2944)
Summary: X-link: facebookresearch/FBGEMM#47 Pull Request resolved: #2944 Break up cutlass_extensions.cu Currently, `//deeplearning/fbgemm/fbgemm_gpu/experimental/gen_ai:quantize_ops_cuda#compile-cutlass_extensions.cu` is the second most expensive target in fbcode, at 1.77 EYS (see https://www.internalfb.com/intern/unidash/dashboard/ads_serving_platform/build_speed/). As such, this is the first of N diffs to break up the source file into smaller chunks to parallelize the compilation and reduce the build times. Before this diff - https://www.internalfb.com/buck2/4569ea23-35d0-4086-b7e1-ba30f1e665f8: (24:12.8s) After this diff - https://www.internalfb.com/buck2/fffc0745-446d-4b22-bff9-3bff0c74eb58 (22:58.5s) Reviewed By: jianyuh, spcyppt Differential Revision: D60874248 fbshipit-source-id: 345ef2e579b391049aee8adf210d24b6e3190775
- Loading branch information