Skip to content

#bins recommendation for large dataset... #8

@coforfe

Description

@coforfe

Hello,

Firstly thanks a lot for your package.

I think it is just the only one that allows to calulate drift concept between two dataset.
I plan to use it for a large dataset (some millions rows) and I would like to know which is your recommendation for the # bins parameter in either calculate_covariate_drift() or calculate_distance() functions.

Thanks in anticipation,
Carlos

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions