Skip to main content

Showing 1–2 of 2 results for author: Lyubomirsky, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2006.09616  [pdf, other

    cs.LG cs.PL stat.ML

    Dynamic Tensor Rematerialization

    Authors: Marisa Kirisame, Steven Lyubomirsky, Altan Haan, Jennifer Brennan, Mike He, Jared Roesch, Tianqi Chen, Zachary Tatlock

    Abstract: Checkpointing enables the training of deep learning models under restricted memory budgets by freeing intermediate activations from memory and recomputing them on demand. Current checkpointing techniques statically plan these recomputations offline and assume static computation graphs. We demonstrate that a simple online algorithm can achieve comparable performance by introducing Dynamic Tensor Re… ▽ More

    Submitted 18 March, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 31 pages, 12 figures, implementation available here: https://github.com/uwsampl/dtr-prototype, OpenReview: https://openreview.net/forum?id=Vfs_2RnOD0H

    ACM Class: C.3

  2. arXiv:1904.08368  [pdf, other

    cs.LG cs.PL stat.ML

    Relay: A High-Level Compiler for Deep Learning

    Authors: Jared Roesch, Steven Lyubomirsky, Marisa Kirisame, Logan Weber, Josh Pollock, Luis Vega, Ziheng Jiang, Tianqi Chen, Thierry Moreau, Zachary Tatlock

    Abstract: Frameworks for writing, compiling, and optimizing deep learning (DL) models have recently enabled progress in areas like computer vision and natural language processing. Extending these frameworks to accommodate the rapidly diversifying landscape of DL models and hardware platforms presents challenging tradeoffs between expressivity, composability, and portability. We present Relay, a new compiler… ▽ More

    Submitted 24 August, 2019; v1 submitted 17 April, 2019; originally announced April 2019.