Skip to content

Add support for running without zero optimizer sharding #1

@ahoffman-aws

Description

@ahoffman-aws

Would be nice to compare footprint with and without zero stage 1. It is currently not implemented

if model_repr.parallelism_cfg.zero_level != ParallelConfig.ZeroLevel.PARTITION_OPTIMIZER:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions