Skip to content

v3.5.0

Compare
Choose a tag to compare
@github-actions github-actions released this 20 Jun 11:51
· 78 commits to main since this release

v3.5.0 (2024-06-20)

Feature

Unknown

  • Performance improvements + using multiple GPUs. (#189)

  • fix: no grads when filling cache

  • trainer should put activations on sae device

  • hack to allow sae device to be specific gpu when model is on multiple devices

  • add some tests (not in CI, which check multiple GPU performance

  • make formatter typer happy

  • make sure SAE calls move data between devices as needed (400474e)