Skip to content
This repository was archived by the owner on Mar 12, 2024. It is now read-only.
This repository was archived by the owner on Mar 12, 2024. It is now read-only.

Auto-ranking of most explicative features #228

@danthe3rd

Description

@danthe3rd

Scenario:
I have a grid-search on parameters A, B and C.
For each sample, I have an associated loss which I try to minimize.

I want to know which parameter (A, B or C) has the most influence on the loss automatically.

In python: This can be done by learning a simple RandomForestRegressor (or Classifier depending on the target value type), and then calling permutation_importance to get an importance score for each parameter.
For this to be embedded in HiPlot, it would need to be done in JS (for example with this library?)

https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestRegressor.html
https://scikit-learn.org/stable/modules/permutation_importance.html

UI: This could be triggered by right-clicking a column. The result could be displayed by ordering the column by relative importance.
Need a way to select which columns to include/exclude from the calculation,
and to display the correlation score

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions