improve performance of TOML parsing by reading from memory instead of from file #1751

KristofferC · 2020-04-06T14:00:13Z

Reading a bunch of small things from a file tends to be much slower than just reading the whole file in one chunk and operating on memory (especially with Julia now locking the stream for every read). None of the files we parse are big enough that operating on a file makes sense.

Benchmark:

function parse_registry()
    path = joinpath(homedir(), ".julia/registries/General")
    for (root, dirs, files) in walkdir(path)
       for file in files
           if endswith(file, ".toml")
               TOML.parsefile(joinpath(root, file))
           end
       end
   end
end

Before:

696.417 ms (2111063 allocations: 157.45 MiB)

After:

355.687 ms (2164815 allocations: 166.51 MiB)

… file

KristofferC · 2020-04-06T15:11:19Z

Ah, this caught a too loose try catch.

… from file (#1751) * improve performance of TOML parsing by reading from memory instead of file (cherry picked from commit 47784de)

improve performance of TOML parsing by reading from memory instead of…

c4000fd

… file

KristofferC added performance TOML labels Apr 6, 2020

StefanKarpinski approved these changes Apr 6, 2020

View reviewed changes

also use parsefile when reading manifest

eb6309c

KristofferC force-pushed the kc/parse_speed branch from 1341073 to eb6309c Compare April 6, 2020 14:25

tweak some path opes

8dac789

KristofferC added 3 commits April 6, 2020 17:17

fixup using parsefile for manifest

af63d15

fixup

017a24a

fixup

8182c60

KristofferC merged commit 47784de into master Apr 7, 2020

KristofferC deleted the kc/parse_speed branch April 7, 2020 07:10

KristofferC added the backport pending 1.4 label May 4, 2020

KristofferC added a commit that referenced this pull request May 10, 2020

improve performance of TOML parsing by reading from memory instead of…

ae6e8ac

… from file (#1751) * improve performance of TOML parsing by reading from memory instead of file (cherry picked from commit 47784de)

fredrikekre removed the backport pending 1.4 label May 11, 2020

KristofferC mentioned this pull request May 25, 2020

Use the TOML stdlib in code loading JuliaLang/julia#36018

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve performance of TOML parsing by reading from memory instead of from file #1751

improve performance of TOML parsing by reading from memory instead of from file #1751

KristofferC commented Apr 6, 2020 •

edited

Loading

KristofferC commented Apr 6, 2020

improve performance of TOML parsing by reading from memory instead of from file #1751

improve performance of TOML parsing by reading from memory instead of from file #1751

Conversation

KristofferC commented Apr 6, 2020 • edited Loading

KristofferC commented Apr 6, 2020

KristofferC commented Apr 6, 2020 •

edited

Loading