Mimic `adamw_torch_4bit` and have `adamw_torch_8bit`

### Feature request

Hi thanks for the lib! Currently there is `adamw_torch_4bit`, but I hope to mimic it to have a `adamw_torch_8bit` that uses 8bit torchao adamw.

The reason is that, I would like to use deepspeed cpu offload for the optimizer, and also use 8bit adamw. However, the 8bit one in current hf transformers does not support cpu, so I need to use the torchao one.

### Motivation

-

### Your contribution

yes, willing to PR

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

Feature request

Motivation

Your contribution

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Mimic adamw_torch_4bit and have adamw_torch_8bit #34893

Description

Feature request

Motivation

Your contribution

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893