Skip to content

Unstable Training #29

@wenjunli-0

Description

@wenjunli-0

When I switch to the areal v0.3.3 and run the asearcher_local experiment, I got very unstable training and training cannot continue due to very high rollout time costs.

Below are the hyperparameters I used to run the experiment and the logs. (The blue curves are relatively stable and the green and grey curves are unstable and are created after I switched to areal v0.3.3)
epochs=10, 4p1t1+4p1t1, batch_size=128, max_concurrent_rollouts=48, mem_fraction_static=0.80

Image Image Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions