feat: Encoding API Improvements #498

nabetti1720 · 2024-07-26T15:25:31Z

Description of changes

To improve compatibility with encodings supported by Node.js, the following actions have been taken.
https://nodejs.org/api/util.html#whatwg-supported-encodings

For encodings supported by LLRT, modified so that they can also be specified by aliases.
Reduced excessive normalization and made label judgments a little stricter.
Since iso-8859-1 was an alias, it was managed by windows-1252.
ENCODING_MAP was introduced with the expectation of improved maintenance ~~and lookup performance~~.

Checklist

Created unit tests in tests/unit and/or in Rust for my feature if needed
Ran make fix to format JS and apply Clippy auto fixes
Made sure my code didn't add any additional warnings: make check
Updated documentation if needed (API.md/README.md/Other)

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

richarddavison

Hi @nabetti1720! I like the aliasing and more simple approach. Not sure if this yields much performance benefit (except for maybe one less string allocation to normalize the key) but it's more maintainable for sore.

llrt_utils/src/encoding.rs

nabetti1720 · 2024-07-27T00:48:05Z

Hi @richarddavison .
Yes. In the previous implementation, it was necessary to create a new enum when dealing with aliases that exceeded the normalization range (e.g., treating unicode-1-1-utf8 as utf-8), and this proposal was made to avoid this.

Also, we believe that by minimizing the normalization, we can now evaluate with the same string as the actual label.

Lastly, sorry. Regarding lookup performance, I believe it is equivalent to O(1) both previously and now. I will correct what I described as an improvement. :)

types/buffer.d.ts

feat: Encoding API Improvements

51a4884

richarddavison approved these changes Jul 26, 2024

View reviewed changes

llrt_utils/src/encoding.rs Outdated Show resolved Hide resolved

Use HashMap::with_capacity

c2e5893

Sytten reviewed Jul 27, 2024

View reviewed changes

types/buffer.d.ts Outdated Show resolved Hide resolved

nabetti1720 added 2 commits July 27, 2024 11:49

Fix types

7649549

Fix API.md

ca9415a

richarddavison approved these changes Jul 27, 2024

View reviewed changes

richarddavison merged commit 701ce32 into awslabs:main Jul 27, 2024
8 checks passed

nabetti1720 deleted the feat/encoding-api branch July 27, 2024 05:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Encoding API Improvements #498

feat: Encoding API Improvements #498

nabetti1720 commented Jul 26, 2024 •

edited

Loading

richarddavison left a comment

nabetti1720 commented Jul 27, 2024 •

edited

Loading

feat: Encoding API Improvements #498

feat: Encoding API Improvements #498

Conversation

nabetti1720 commented Jul 26, 2024 • edited Loading

Description of changes

Checklist

richarddavison left a comment

Choose a reason for hiding this comment

nabetti1720 commented Jul 27, 2024 • edited Loading

nabetti1720 commented Jul 26, 2024 •

edited

Loading

nabetti1720 commented Jul 27, 2024 •

edited

Loading