Hands on in in-house dataset - Bad performance on M1 Max #94

gonzalalGFM · 2024-05-06T13:40:09Z

Testing on a in-house dataset which we have in our research group as a benchmark. It has lots of inputs to one output but to keep it simple now we have 5 inputs and 1 output and like 100K observations.

Model conversion from pandas Dataframe

my_ds= {"train_input":torch.from_numpy(np.array(train_data_x)[:, :5]),
"test_input":torch.from_numpy(np.array(test_data_x)[:, :5]),
"train_label":torch.from_numpy(np.array(train_data_y)),
"test_label":torch.from_numpy(np.array(test_data_y))}

Model creation

kan_model = KAN(width=[1,1,1], grid=2, k=3, seed=0)

Model fit

kan_model.train(my_ds, opt="LBFGS", steps=2, lamb=0.01, lamb_entropy=10.)

Perhaps am I missing something, some parameter I don't know.

Pd: also tested for 10k observations and got the same behaviour.

Idk if it is important due to optimization and that stuff but I'm using an Apple M1 Max w/ 64GB.
The versions used are:
torch- > 2.3.0
numpy -> 1.24.4

So I left the training on a Jupyter notebook for more than an hour and it haven't pass from the 0%, tbh I think is something related to the chip architecture. But also could be to the data... I do not know if 10k obervations are a lot for this stage of the code.

KindXiaoming · 2024-05-06T13:56:59Z

Hi, if your input has 5D, then

kan_model = KAN(width=[1,1,1], grid=2, k=3, seed=0)
kan_model.train(...)

should incur an error immediately, because your KAN takes only one inputs. valid widths are e.g., width=[5,1], width=[5,3,1] etc.

gonzalalGFM · 2024-05-06T14:18:50Z

Mmm, indeed there was no error but that was the problem. Everything is rolling ;)
Are there any guide for searching the very best hyper parameters configuration? Are some hyperparameters more influenceable for the final performance? Trying to change grid as it is done in some examples, then also the width. Everything w/ optuna

KindXiaoming · 2024-05-06T16:45:59Z

Hi I have some general advice on hyperparameter tuning here.

brayevalerien · 2024-05-07T11:59:33Z

Providing data with a dimension that differs from the width of the first KAN layers should maybe raise an Error?

KindXiaoming closed this as completed Jul 14, 2024

KindXiaoming reopened this Jul 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hands on in in-house dataset - Bad performance on M1 Max #94

Hands on in in-house dataset - Bad performance on M1 Max #94

gonzalalGFM commented May 6, 2024

KindXiaoming commented May 6, 2024 •

edited

Loading

gonzalalGFM commented May 6, 2024

KindXiaoming commented May 6, 2024

brayevalerien commented May 7, 2024

Hands on in in-house dataset - Bad performance on M1 Max #94

Hands on in in-house dataset - Bad performance on M1 Max #94

Comments

gonzalalGFM commented May 6, 2024

Model conversion from pandas Dataframe

Model creation

Model fit

KindXiaoming commented May 6, 2024 • edited Loading

gonzalalGFM commented May 6, 2024

KindXiaoming commented May 6, 2024

brayevalerien commented May 7, 2024

KindXiaoming commented May 6, 2024 •

edited

Loading