Permit FFI files in subdirectories #3601

PgBiel · 2024-09-08T23:11:03Z

Closes #1562

~~Still WIP (finishing the details regarding the duplicate module check)~~

Changes:

FFI files in subdirectories are now copied. For JavaScript specifically, they can be used as @external(javascript, "./file.mjs", "function"). For Erlang, you'd use the module's name as usual.
An additional check was included to ensure no .erl files with the same name exist within the project, as Erlang cannot have duplicate modules.
Added support for directories to the in-memory filesystem. (Otherwise, the is_directory function would return true for files and read_dir would also list the directory itself, causing tests to enter infinite recursion and crash.)

Potential improvements:

I'd like to add explicit integration tests for using @external with relative paths in JS (I tested locally with .mjs with node.js, Deno and Bun and it worked, but could be worth it to test other potential edge cases). Let me know where would be the best spot in the codebase to do so.
Consider having an error when a native file would overlap with a .gleam file's generated code (.mjs or .erl).

lpil

Thank you!

You'll see there's integration tests in the test directory which can be updated with nested FFI files

compiler-core/src/build/native_file_copier.rs

compiler-core/src/io/memory.rs

PgBiel · 2024-09-15T18:02:25Z

Problem: My handmade recursive directory walker is too naive and can enter infinite symlink loops. I'll probably add a new method to FileSystemReader where we can address this at the IO boundary, similarly to how it's done for gleam_source_files.

PgBiel · 2024-09-16T02:24:50Z

Alright, I've made the implementation more robust (it also now supports copying nested Erlang modules), added detection of conflicting .gleam and .mjs files (but not of conflicting .gleam and .erl files yet due to the issue outlined here: #1562 (comment) - we could leave this to a future PR), added detection of conflicting .erl files in separate subpaths, added more unit tests and added integration tests. 😄

inoas · 2024-09-16T13:45:12Z

Hello,

thanks for taking this on, this is lovely <3 for FFI interop!

In case this is desired and-or not yet considered:

Can we enforce some namespacing on Erlang, Elixir and JavaScript modules that maps to the directory structure and else throw a compiler error?

Edit:

... if possible, this might also make this check obsolete?

An additional check was included to ensure no .erl files with the same name exist within the project, as Erlang cannot have duplicate modules.

PgBiel · 2024-09-16T16:12:28Z

I'm not sure that would be desirable since the main point of FFI is to have full control over what the final transpiled code does. Besides, we would have to parse all Erlang files, though I guess we could do with some regex in this case. But I think it's more a matter of flexibility, and the error just tells you if you ever mess it up somehow.

The error is also not perfect since I guess it's theoretically possible for you to shadow some module from some other package. But it's an attempt.

inoas · 2024-09-16T18:54:41Z

I'm not sure that would be desirable since the main point of FFI is to have full control over what the final transpiled code does. Besides, we would have to parse all Erlang files, though I guess we could do with some regex in this case. But I think it's more a matter of flexibility, and the error just tells you if you ever mess it up somehow.

The error is also not perfect since I guess it's theoretically possible for you to shadow some module from some other package. But it's an attempt.

maybe it could warn.
I don't know... I think these FFI files will all be new and setting up strictness is nice there. You can still build any erlang/elixir library as a deprendency.

PgBiel · 2024-09-16T19:38:00Z

I think this is fine to be honest... Forcing people to name their modules like folder@folder2@something in every folder doesn't sound good or useful realistically. And it can clash either way.

lpil

Very nice, thank you.

It seems now that there's a concept of directories in the in memory file system and also the OS file system, and each is responsible for implementing traversal. I think it would make sense to remove this duplication and instead to have the trait know nothing about directory walking and then the walking logic depends on the trait and is shared between all implementations.

PgBiel · 2024-09-20T20:38:50Z

I think it would make sense to remove this duplication and instead to have the trait know nothing about directory walking and then the walking logic depends on the trait and is shared between all implementations.

If I understood correctly, you are proposing making a dir walking function which uses only the trait's read_dir function (and other trait functions if needed), right? Though I'm not sure if we would want to reinvent the wheel here, given we already have a package which implements graph traversal logic to avoid loops etc. What if we compromise and simply add a walk_dir function to the trait and use that where applicable?

Although I guess a re-implementation will be needed if we add symlinks to the in-memory FS down the road, so we could alternatively consider searching for existing platform agnostic implementations and using that, or even just re-implement while attempting to avoid reinventing the wheel as much as possible, e.g. by using special data structure crates if needed. Still, I'm not fully aware of all edge cases to know whether we would be able to match existing OS-specific implementations in terms of not only performance but also correctness.

lpil · 2024-09-21T20:16:02Z

Yes, that's right. We should not have two implementations of the same thing, and removing a dependency is beneficial for maintenance. Having one less dependency to verify for the EU Cyber Resilience Act is nice too!

I'm not worried about correctness seeing as this is a trivial algorithm, will be fine.

PgBiel · 2024-10-07T00:55:25Z

By the way, I believe this is very close to completion (the dir walker algorithm was already pushed, but I haven't tested it well yet) - I'll be picking this up again very soon, once I have a bit more time :)

PgBiel · 2024-11-06T01:35:08Z

Ready for another look 😄

The dir walker has been implemented with symlink loop detection. I've switched other places using recursive search to use the dir walker. Try it out :)

I've also made a few fixes and explicit design decisions in the implementation of directories in IMFS. Of note, I now ensure the root path always exists, for consistency. (We don't really check for this anywhere at the moment though, so I can remove that, but I felt like not leaving a time bomb for the future here 😄.)

lpil

Thank you!!

Ideally, `src/abc.gleam` and `src/abc.erl` would cause an error. However, this can actually occur legitimately when downloading a Hex package with precompiled `.erl` source files, which would trigger false positives. Wonder what would be the best way to detect this situation, but doesn't seem trivial to solve.

- Test header file in separate subdir - Test parent folder ffi in Erlang

- Prevent problems with infinite symlink loops

add mjs bug fix to changelog update changelog to include erl ffi files update changelog with js ffi change

We were previously including all paths in the filesystem, which included, in particular, the root. Therefore, the "initial files" included the root directory, causing everything to be deleted, not only the initial files.

lpil reviewed Sep 10, 2024

View reviewed changes

compiler-core/src/build/native_file_copier.rs Outdated Show resolved Hide resolved

compiler-core/src/io/memory.rs Outdated Show resolved Hide resolved

compiler-core/src/io/memory.rs Outdated Show resolved Hide resolved

compiler-core/src/io/memory.rs Outdated Show resolved Hide resolved

lpil marked this pull request as draft September 10, 2024 15:22

PgBiel force-pushed the nested-js-ffi branch from 29b67b1 to 0880a75 Compare September 12, 2024 23:31

PgBiel changed the title ~~Permit JS FFI files in subdirectories~~ Permit FFI files in subdirectories Sep 13, 2024

PgBiel force-pushed the nested-js-ffi branch from a7903ba to cb3980e Compare September 15, 2024 17:36

PgBiel force-pushed the nested-js-ffi branch from 5da6877 to df8cbd8 Compare September 16, 2024 01:42

PgBiel marked this pull request as ready for review September 16, 2024 02:22

lpil reviewed Sep 20, 2024

View reviewed changes

lpil marked this pull request as draft September 20, 2024 13:53

PgBiel force-pushed the nested-js-ffi branch 5 times, most recently from ea4fbd3 to 7410d22 Compare October 24, 2024 04:39

PgBiel force-pushed the nested-js-ffi branch 2 times, most recently from 3808565 to 977041b Compare November 5, 2024 15:02

PgBiel marked this pull request as ready for review November 6, 2024 01:34

PgBiel force-pushed the nested-js-ffi branch 2 times, most recently from c4a6cf1 to 9769b61 Compare November 8, 2024 02:34

lpil approved these changes Nov 22, 2024

View reviewed changes

PgBiel added 26 commits November 22, 2024 14:20

detect clash between gleam and erl modules

dfb2cc0

detect clashing gleam and javascript modules

fc5945c

separate .gleam codepath

738054f

add subdir_ffi integration tests

a3a63d1

add subdir_ffi test to ci

08b8b49

use EcoString, add subdir Erlang file tests

120e29c

more robust subdir_ffi tests

40d3392

- Test header file in separate subdir - Test parent folder ffi in Erlang

use walkdir for native file search

3db62f3

- Prevent problems with infinite symlink loops

add module and native file clash error

d6de026

add error for duplicate native erlang module

2d8b038

update changelog with ffi subdir info

a4d5ec6

add mjs bug fix to changelog update changelog to include erl ffi files update changelog with js ffi change

final nested ffi improvements

ba54aa1

custom dirwalker

8f969ac

fix test project compiler

541c13f

dir walker: don't canonicalize returned paths

0789110

replace reader methods by dirwalker

c2d5718

add dir walking test

4c2308f

fix phony in subdir_ffi test

e083a8a

add trace back to gleam_source_files

770baba

improve dir walker comments

76474ba

use imfs subpaths for initial files in tests

210c20f

We were previously including all paths in the filesystem, which included, in particular, the root. Therefore, the "initial files" included the root directory, causing everything to be deleted, not only the initial files.

add note about root path

1313797

ensure imfs has root

45af9bf

cannot delete root from imfs

5c78e5a

fix tests by using 'files'

1959f4a

lpil force-pushed the nested-js-ffi branch from 9769b61 to 1959f4a Compare November 22, 2024 14:22

Remove some indentation

974abaf

lpil enabled auto-merge (rebase) November 22, 2024 14:34

lpil merged commit 3107320 into gleam-lang:main Nov 22, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Permit FFI files in subdirectories #3601

Permit FFI files in subdirectories #3601

PgBiel commented Sep 8, 2024 •

edited

Loading

lpil left a comment

PgBiel commented Sep 15, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

inoas commented Sep 16, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

inoas commented Sep 16, 2024

PgBiel commented Sep 16, 2024

lpil left a comment

PgBiel commented Sep 20, 2024 •

edited

Loading

lpil commented Sep 21, 2024

PgBiel commented Oct 7, 2024

PgBiel commented Nov 6, 2024 •

edited

Loading

lpil left a comment

Permit FFI files in subdirectories #3601

Permit FFI files in subdirectories #3601

Conversation

PgBiel commented Sep 8, 2024 • edited Loading

lpil left a comment

Choose a reason for hiding this comment

PgBiel commented Sep 15, 2024 • edited Loading

PgBiel commented Sep 16, 2024 • edited Loading

inoas commented Sep 16, 2024 • edited Loading

PgBiel commented Sep 16, 2024 • edited Loading

inoas commented Sep 16, 2024

PgBiel commented Sep 16, 2024

lpil left a comment

Choose a reason for hiding this comment

PgBiel commented Sep 20, 2024 • edited Loading

lpil commented Sep 21, 2024

PgBiel commented Oct 7, 2024

PgBiel commented Nov 6, 2024 • edited Loading

lpil left a comment

Choose a reason for hiding this comment

PgBiel commented Sep 8, 2024 •

edited

Loading

PgBiel commented Sep 15, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

inoas commented Sep 16, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

PgBiel commented Sep 20, 2024 •

edited

Loading

PgBiel commented Nov 6, 2024 •

edited

Loading