Miscellaneous housekeeping improvements #513

aarmey · 2023-07-22T19:37:38Z

Apologies that this does not involve just one change. This resolves some various minor inconsistencies, deprecations, and performance issues:

In both the PLSR and HALS implementations, this reduces indexing which improves performance, particularly for Jax and TensorFlow
Some methods were not being exported or had missing arguments in their docstrings
Most backends provide the np.lstsq interface, so all four return values are passed on now
As suggested in Improving the test suite #448, this adds pytest-randomly. This sets the random seeds of the default random generators and reports it in output. I've found this is very helpful when dealing with inconsistent errors.
Python v3.8 is now EOL, so I removed some code ensuring its support. The default Python version is now 3.11 in testing.

I will further annotate each of the changes so that this is simpler to review.

aarmey · 2023-07-22T19:39:58Z

tensorly/__init__.py

 ones,
 zeros,
 zeros_like,
 eye,
 where,
+ conj,


These methods were not being exported, and I didn't see a reason for being left out.

codecov · 2023-07-22T19:40:00Z

Codecov Report

Merging #513 (b5161a7) into main (94c2ff2) will increase coverage by 0.07%.
Report is 3 commits behind head on main.
The diff coverage is 90.62%.

@@            Coverage Diff             @@
##             main     #513      +/-   ##
==========================================
+ Coverage   86.71%   86.78%   +0.07%     
==========================================
  Files         120      118       -2     
  Lines        7535     7515      -20     
==========================================
- Hits         6534     6522      -12     
+ Misses       1001      993       -8

Files Changed	Coverage Δ
tensorly/__init__.py	`100.00% <ø> (ø)`
tensorly/utils/__init__.py	`100.00% <ø> (ø)`
tensorly/tenalg/proximal.py	`67.68% <71.42%> (+0.23%)`	⬆️
tensorly/backend/core.py	`67.67% <80.00%> (-0.01%)`	⬇️
tensorly/backend/__init__.py	`89.83% <100.00%> (ø)`
tensorly/base.py	`95.83% <100.00%> (ø)`
tensorly/decomposition/_cp.py	`83.71% <100.00%> (-0.11%)`	⬇️
tensorly/decomposition/_tr.py	`85.24% <100.00%> (ø)`
tensorly/decomposition/tests/test_cmtf_als.py	`100.00% <100.00%> (ø)`
tensorly/metrics/entropy.py	`86.36% <100.00%> (-0.60%)`	⬇️
... and 4 more

... and 1 file with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

aarmey · 2023-07-22T19:40:28Z

tensorly/backend/core.py

 """
 raise NotImplementedError

 @staticmethod
- def min(tensor):
+ def min(tensor, axis=None):


The axis argument is actively used, so it should be represented here.

aarmey · 2023-07-22T19:40:52Z

tensorly/base.py

 from . import backend as tl
-from .utils import prod


prod was only here for Python v3.8 support.

aarmey · 2023-07-22T19:43:37Z

tensorly/decomposition/_cp.py

@@ -426,8 +426,6 @@ def parafac(
 tl.solve(tl.conj(tl.transpose(pseudo_inverse)), tl.transpose(mttkrp))
 )
 factors[mode] = factor
- if normalize_factors and mode != modes_list[-1]:


I think the decision to do this only at the end came out of #264, but was in a closed PR. I need to dig it up.

aarmey · 2023-07-22T19:43:56Z

tensorly/decomposition/tests/test_cmtf_als.py

- K = 8
- M = 7
+ K = 16
+ M = 14


To help with occasional testing failures.

aarmey · 2023-07-22T19:45:13Z

tensorly/regression/cp_plsr.py

- # Normalize separately—this was profiling very slow in parafac
- Z_comp = cp_normalize(Z_comp_CP)[1]
+ for mode in range(len(Z_comp)):
+ factor = multi_mode_dot(Z, Z_comp, skip=mode)


Tucker is equivalent to CP when rank 1. This is more performant, and this spot was consistently one of the slowest parts of testing, particularly for indexing-slow backends.

What was the main bottleneck in cp_normalize? Was it just the part making sure the scales are non-zero?
Or was the slow part just calling the CP where it's not needed?

Here, I think this mostly just came down to calling CP, with a lot of extra code contained within, when it is not strictly needed. This call is inside two loops, and so it can be called thousands of times. Within the profiling, I believe most of the time was spent on the MTTKRP, but other parts of the CP call weren't negligible.

aarmey · 2023-07-22T19:46:22Z

tensorly/tenalg/proximal.py

- )
- V = tl.index_update(V, tl.index[k, :], V[k, :] + deltaV)
+ # Modifying the function for sparsification
+ if sparsity_coefficient is not None:


This code should be unchanged, but (1) reuses some indexing results for indexing-slow backend performance, and (2) I find it a bit more readable by removing repetitive segments and breaking the expressions up into logical units.

Agreed, this is much more readable and concise

aarmey · 2023-07-22T19:47:01Z

tensorly/tests/test_tr_tensor.py

- tl.index[i, j, k],
- tl.sum(product * tl.eye(product.shape[0])),
- )
+ tensor = tl.einsum("iaj,jbk,kci->abc", *factors)


We have trace in the backend now (see TODO comment), but I thought this was cleaner.

JeanKossaifi

Thanks @aarmey! Always great to have improvements and more readable code!

JeanKossaifi · 2023-08-01T23:38:27Z

tensorly/regression/cp_plsr.py

- # Normalize separately—this was profiling very slow in parafac
- Z_comp = cp_normalize(Z_comp_CP)[1]
+ for mode in range(len(Z_comp)):
+ factor = multi_mode_dot(Z, Z_comp, skip=mode)


What was the main bottleneck in cp_normalize? Was it just the part making sure the scales are non-zero?
Or was the slow part just calling the CP where it's not needed?

JeanKossaifi · 2023-08-01T23:41:26Z

tensorly/decomposition/_tr.py

@@ -216,7 +216,7 @@ def tensor_ring_als(

 if ls_solve == "lstsq":
 # Solve least squares problem directly
- sol, _ = tl.lstsq(design_mat, tensor_unf)
+ sol = tl.lstsq(design_mat, tensor_unf)[0]


Is it better to do this? If so why not *_?

I didn't know *_ was possible! Agreed this will be more readable.

JeanKossaifi · 2023-08-02T00:48:37Z

tensorly/tenalg/proximal.py

- )
- V = tl.index_update(V, tl.index[k, :], V[k, :] + deltaV)
+ # Modifying the function for sparsification
+ if sparsity_coefficient is not None:


Agreed, this is much more readable and concise

JeanKossaifi · 2023-08-02T00:49:28Z

tensorly/tests/test_backend.py

@@ -407,21 +407,21 @@ def test_lstsq():
 # test dimensions
 a = T.randn((m, n))
 b = T.randn((m, k))
- x, res = T.lstsq(a, b)
- assert_equal(x.shape, (n, k))
+ ret = T.lstsq(a, b)


Same question as above for those: I find it typically slightly more readible to explicitly catch return values or use _ rather than index tuples

Aaron Meyer added 14 commits June 5, 2023 08:07

Fix exports and improve PLSR performance

bcc00bf

PLSR much faster, and pass-through of lstsq

035fbf3

Fix Jax

54dc5a3

Fix pytorch

afd8f09

Black formatting

f660108

Testing adjustments

8573328

Black lint

e03341e

Fix occasional torch error

790c1d7

Fix black linting

bb6bf0c

Fix CMTF test

a773116

Loosen tensor-train assertion

d7c0c8b

Other fixes

72c51cb

Upgrade to Python 3.10 by default

d132435

Fix YAML

9db2226

aarmey self-assigned this Jul 22, 2023

aarmey commented Jul 22, 2023

View reviewed changes

tensorly/base.py

from . import backend as tl

from .utils import prod

Copy link

Contributor Author

aarmey Jul 22, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

prod was only here for Python v3.8 support.

aarmey commented Jul 22, 2023

View reviewed changes

tensorly/decomposition/tests/test_cmtf_als.py

K = 8

M = 7

K = 16

M = 14

Copy link

Contributor Author

aarmey Jul 22, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To help with occasional testing failures.

aarmey commented Jul 22, 2023

View reviewed changes

aarmey marked this pull request as ready for review July 22, 2023 19:47

JeanKossaifi reviewed Aug 2, 2023

View reviewed changes

Break out return values

b5161a7

aarmey merged commit a982cf9 into tensorly:main Aug 2, 2023
9 of 10 checks passed

aarmey deleted the fix-exports-plsr-perf branch October 30, 2023 21:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Miscellaneous housekeeping improvements #513

Miscellaneous housekeeping improvements #513

aarmey commented Jul 22, 2023 •

edited

Loading

aarmey Jul 22, 2023

codecov bot commented Jul 22, 2023 •

edited

Loading

aarmey Jul 22, 2023

aarmey Jul 22, 2023

aarmey Jul 22, 2023

aarmey Jul 22, 2023

aarmey Jul 22, 2023

JeanKossaifi Aug 1, 2023

aarmey Aug 2, 2023

aarmey Jul 22, 2023

JeanKossaifi Aug 2, 2023

aarmey Jul 22, 2023

JeanKossaifi left a comment

JeanKossaifi Aug 1, 2023

JeanKossaifi Aug 1, 2023

aarmey Aug 2, 2023

JeanKossaifi Aug 2, 2023

JeanKossaifi Aug 2, 2023

Miscellaneous housekeeping improvements #513

Miscellaneous housekeeping improvements #513

Conversation

aarmey commented Jul 22, 2023 • edited Loading

Choose a reason for hiding this comment

codecov bot commented Jul 22, 2023 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JeanKossaifi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aarmey commented Jul 22, 2023 •

edited

Loading

codecov bot commented Jul 22, 2023 •

edited

Loading