Handle non-ASCII charater filenames in EnrollWithRequestIdBean #175

SirTediousOfFoo · 2022-11-24T14:22:19Z

We had issues with issuing certs containing Croatian symbols - žđšćč as mentioned in #174 so I've looked into the parts that handle it and found slightly differing logic in the downloadToken function between the EnrollWithRequestIdBean and EnrollMakeNewRequestBean.java where one uses a getFileName() function to handle the name generation and the other one does it inline.
The one using the function also encodes the filename to a Base64 string which bypasses the issues with non-ASCII characters.

Resolve ECA-10481 "Fb configdump protocol configs" Closes ECA-10481 See merge request ejbca/ejbca!402

ECA-10793: Fix transaction handling in AdminPreferenceSessionBean Closes ECA-10793 See merge request ejbca/ejbca!405

…-alias' into 'main' ECA-10443 Added DNS identifier challenge type selection to ACME Closes ECA-10443 See merge request ejbca/ejbca!406

ECA-10623 Allow quotation marks in the CDP Closes ECA-10623 See merge request ejbca/ejbca!380

ECA-10811: Modified the CertificateCrlReader so that status gets updated Closes ECA-10811 See merge request ejbca/ejbca!385

ECA-10853: Handle failure to load PKCS#11 library, to avoid partial lock-out from GUI Closes ECA-10853 See merge request ejbca/ejbca!408

…'main' ECA-10801: Modified IKB UI page to not display crypto tokens without Closes ECA-10801 See merge request ejbca/ejbca!394

Resolve ECA-10730 "Fb acme mode md issues" Closes ECA-10730 See merge request ejbca/ejbca!409

…atabase)

…b End Entity page

…nd Entity page

Conflicts: build.xml modules/build-properties.xml modules/build.xml modules/ejbca-rest-configdump/src/org/ejbca/ui/web/rest/api/resource/ConfigdumpRestResource.java

…eparate-edition Conflicts: build.xml

…oval-pages' into 'main' Resolve ECA-10786 "Fb extra fields in ra web end entity and approval pages" Closes ECA-10786 See merge request ejbca/ejbca!412

…'fb-ECA-10843-wsracli-stress-param-no-tests' ECA-10843: ctb EjbcaWsRaCli stress: Make available to specify the number of tests to be run See merge request ejbca/ejbca!403

For easier element selection in DOM.

…-existing CA

…ad' messages)

… title for 1st column)

…ypo) Noticket: English language files fixed (MS Auto-Enrollment: error of button) Noticket: English language files fixed (OAuth Key Management: typo, and spaces, for l10n people) Noticket: English language files fixed (ACME protocol: spaces, typo, format titles) Noticket: Dummy zz language file fixed (remove Id, UNIX End of Line) Noticket: English language files fixed (ACME protocol: minor typo) Noticket: English language files fixed (duplicated message, minor fixes) Noticket: English language files updated (CA Structure & CRLs: better title for 1st column)

…o l10n-en-english-main-7.10-2

…anslated) Noticket: French language files updated (OAuth Key Management: fully translated) Noticket: French language files updated (OAuth Key Management: fixes) Noticket: French language files updated (ACME protocol: spaces) Noticket: French language files updated (ACME protocol: fully translated) Noticket: French language files updated (Azure CRL Publisher) Noticket: French language files updated (Intune Certificate Revocation) Noticket: French language files updated (C-ITS certificates) Noticket: French language files updated (CA Structure & CRLs: 'Download' messages)

… l10n-fr-french-main-7.10-2

….10-2 L10n: English fixes for the main branch (based on 7.10.0.2)

…10-2 L10n: French updates for the main branch (based on 7.10.0.2) Fully translated

modules/ra-gui/src/org/ejbca/ra/EnrollWithRequestIdBean.java

SirTediousOfFoo · 2022-11-25T07:29:26Z

modules/ra-gui/src/org/ejbca/ra/EnrollWithRequestIdBean.java

- if(fileName == null){
- fileName = "certificatetoken";
- }
+ final String fileName = getFileName();


Removed filename handling logic outside of the downloadToken function

SirTediousOfFoo · 2022-11-25T07:31:03Z

modules/ra-gui/src/org/ejbca/ra/EnrollWithRequestIdBean.java

+ *
+ * @return the file name to use in the content disposition header, filename safe characters
+ */
+ private String getFileName() {


Added finemane handling function like the one in EnrollMakeNewRequestBean

primetomas · 2022-11-25T13:29:07Z

I'm not too fond of base64 as filename, it's very user non-friendly. Isn't there an apache string function somewhere that makes strings "filename" friendly?

primetomas · 2022-11-25T13:34:31Z

Ah I see, you just copied the behavior that was already part of another piece in EJBCA....

primetomas · 2022-11-25T13:50:19Z

Base64 encoding as the last resort is definitely a good way. One potential improvement, that could preserve some utf-8 characters would be to base64 encode it if it doesn't pass:
java.nio.file.Paths.get(filename);
If I understand it correctly it would check the filename for validity without doing any IO or anything costly like that.

SirTediousOfFoo · 2022-11-25T14:13:29Z

Uh yeah, It's unwieldy and I didn't really like it either but seeing as it was already there for a different enrollment option I went with it as the obvious solution.
Best solution I found with apache string stuff was just checking if it's ASCIIPrintable and then dropping the special characters but that also results in weirdness since then you get a file where Šandor Štefanović would get a cert named andor tefanovi.pem

I'll check out your suggestion and get back to you here, I'm looking at what else could be done

dobicinaitis · 2022-11-25T15:02:17Z

A nice solution could be replacing UTF-8 characters with corresponding ASCII ones - https://stackoverflow.com/a/4122207/19848036

SirTediousOfFoo · 2022-11-25T15:52:16Z

About the time you posted this I found https://commons.apache.org/proper/commons-lang/apidocs/org/apache/commons/lang3/StringUtils.html#stripAccents-java.lang.String- which eluded me at first, I'll test it out and update the PR although I wonder if the solution you posted would cover a wider range of edge cases

SirTediousOfFoo · 2022-11-25T17:20:41Z

Alright, I've switched up the filename logic so now it uses the Apache commons lang3 StringUtils which implements a stripAccents function that does exactly what I wanted to achieve here.

I tested out this change with the names Lars Űmlaöt d'Ăȯçěny Fųßßb¤łł and Stevo Štefanovčić which yielded a functional filename of Lars Umlaot dAoceny Fußßb¤ll.pem and Stevo Stefanovcic.pem respectively which make the Apache server happy enough.

If nothing else this'll remedy some issues for users with less-than-standard alphabets

SirTediousOfFoo · 2022-11-25T17:23:29Z

modules/ra-gui/src/org/ejbca/ra/EnrollWithRequestIdBean.java

@@ -621,7 +620,8 @@ private String getFileName() {
 if (StringUtils.isAsciiPrintable(commonName)) {
 return StringTools.stripFilename(commonName);
 }
- return Base64.encodeBase64String(commonName.getBytes());
+ return org.apache.commons.lang3.StringUtils.stripAccents(commonName);


The StringUtils from org.apache.commons.lang doesn't have a stripAccents method so in order to keep everything else in the code as it was I just went with using the lang3 StringUtils like this

Realiserad · 2022-12-11T16:07:26Z

Unfortunately, StringUtils.stripAccents does not guarantee that the output is ASCII printable, as demonstrated by the following unit test:

@Test
public void testTextNormalisation() {
    assertTrue(StringUtils.isAsciiPrintable(StringUtils.stripAccents("Test CA")));
    assertTrue(StringUtils.isAsciiPrintable(StringUtils.stripAccents("malmö.se")));
    assertTrue(StringUtils.isAsciiPrintable(StringUtils.stripAccents("Га́рри Ки́мович Каспа́ров")));
    assertTrue(StringUtils.isAsciiPrintable(StringUtils.stripAccents("Mǎ Yún")));
    assertTrue(StringUtils.isAsciiPrintable(StringUtils.stripAccents("马云")));
}

This will cause problems for anyone using Chinese or Russian characters in their CN. I don't know how common that is, but there are a couple of people in China using EJBCA.

The standards don't really say what to do, so non-ASCII filenames are a bit of a mess, but my understanding is that most (all?) modern web browsers supports UTF-8 character in the Content-Disposition header. The trick is to URL-encode the filename. This should be done in DownloadHelper.sendFile 🔗 link. For example:

final String encodedFilename = URLEncoder.encode(filename, StandardCharsets.UTF_8);
ec.setResponseHeader("Content-Disposition", "attachment; filename*=UTF-8''" + encodedFilename);

As a sidenote, I asked ChatGPT and it seems to agree, and provides some more details:

To create a Content-Disposition header with a filename containing a UTF-8 string in Java, you can use the URLEncoder.encode() method to encode the filename according to the rules specified in the RFC 6266. The resulting string can then be used as the value of the filename parameter in the Content-Disposition header. Here is an example:

import java.net.URLEncoder;                                         

String filename = "my file with åäö characters.pdf";                    
String encodedFilename = URLEncoder.encode(filename, "UTF-8");                                                                                String contentDisposition = "attachment; filename*=UTF-8''" +           
String contentDisposition = "attachment; filename*=UTF-8''" + encodedFilename;                                                  
System.out.println(contentDisposition);

This will output the following Content-Disposition header:

┌──────────────────────────────────────────────────────────────────────────┐ 
│ attachment;                                                              │ 
│ filename*=UTF-8''my%20file%20with%20%C3%A5%C3%A4%C3%B6%20characters.pdf  │ 
└──────────────────────────────────────────────────────────────────────────┘

This should work in most major browsers. Note that the actual behavior may vary depending on the browser and its settings.

To keep backwards compatibility with older browsers, one could look at the UserAgent string and fall back to the ugly, but reliable base64-encoding if needed. There are still some people using EJBCA with Internet Explorer, probably because that's what their smartcard solution supports.

Tanmoy Kundu and others added 30 commits July 6, 2022 11:49

Resolve ECA-10481 "Fb configdump protocol configs"

a096a72

Merge branch 'fb-ECA-10481-configdump-protocol-configs' into 'main'

c34129d

Resolve ECA-10481 "Fb configdump protocol configs" Closes ECA-10481 See merge request ejbca/ejbca!402

Merge branch 'fb-ECA-10793-fix-adminpreference-transactions' into 'main'

adb6d98

ECA-10793: Fix transaction handling in AdminPreferenceSessionBean Closes ECA-10793 See merge request ejbca/ejbca!405

ECA-10801: small change

0c23768

ECA-1062: Move variable (review comment)

d3f0f04

Merge branch 'fb-ECA-10443-Make-challenge-types-configurable-per-ACME…

9734397

…-alias' into 'main' ECA-10443 Added DNS identifier challenge type selection to ACME Closes ECA-10443 See merge request ejbca/ejbca!406

ECA-10730: review comments

9f2a0bc

Merge branch 'ECA-10623-CRL-dist-point' into 'main'

e393ec9

ECA-10623 Allow quotation marks in the CDP Closes ECA-10623 See merge request ejbca/ejbca!380

Merge branch 'fb-ECA-10811-certificate_crl_reader_fix' into 'main'

15b1429

ECA-10811: Modified the CertificateCrlReader so that status gets updated Closes ECA-10811 See merge request ejbca/ejbca!385

Merge branch 'fb-ECA-10853-cryptotoken-library-path-lockout' into 'main'

65b65e5

ECA-10853: Handle failure to load PKCS#11 library, to avoid partial lock-out from GUI Closes ECA-10853 See merge request ejbca/ejbca!408

ECA-10801: addressed review comments

bc137de

Merge branch 'fb-ECA-10801-Possible_to_create_ikb_without_keys' into …

cbdc9fa

…'main' ECA-10801: Modified IKB UI page to not display crypto tokens without Closes ECA-10801 See merge request ejbca/ejbca!394

Merge branch 'fb-ECA-10730-acme-mode-md-issues' into 'main'

29b9ea4

Resolve ECA-10730 "Fb acme mode md issues" Closes ECA-10730 See merge request ejbca/ejbca!409

ECA-10841: Turn KeyfactorEnrollUtils into interface + impl class

574783a

ECA-10839 Changed field types, annotations and form layout

17ce3e7

ECA-10841: Fix NPE when CA type is unknown (e.g. deploying CE on EE d…

2218377

…atabase)

ECA-10841: Hide toggle button for Proxy CA if not available

ef7ed93

ECA-10817: Some refactoring and code cleanup first.

ab06d73

ECA-10841: Add proxy-ca build variant and exclude in other variants

31041c6

ECA-10786: Added PSD2 Qualified Certificate statement fields to RA We…

433c81e

…b End Entity page

ECA-10786: Added CA/B Forum Organization Identifier field to RA Web E…

7badfbe

…nd Entity page

Merge branch 'main' into fb-ECA-10681-proxy-ca-epic

54b622f

Conflicts: build.xml modules/build-properties.xml modules/build.xml modules/ejbca-rest-configdump/src/org/ejbca/ui/web/rest/api/resource/ConfigdumpRestResource.java

Merge branch 'fb-ECA-10681-proxy-ca-epic' into fb-ECA-10841-proxyca-s…

e5c908d

…eparate-edition Conflicts: build.xml

Merge branch 'fb-ECA-10786-extra-fields-in-ra-web-end-entity-and-appr…

e982faf

…oval-pages' into 'main' Resolve ECA-10786 "Fb extra fields in ra web end entity and approval pages" Closes ECA-10786 See merge request ejbca/ejbca!412

Merge branch 'fb-ECA-10843-ctb-EjbcaWsRaCli-stress-nr-of-tests' into …

9bae56a

…'fb-ECA-10843-wsracli-stress-param-no-tests' ECA-10843: ctb EjbcaWsRaCli stress: Make available to specify the number of tests to be run See merge request ejbca/ejbca!403

NoTicket: Added a CSS id to some HTML elements.

d90c1b3

For easier element selection in DOM.

ECA-10839 Changed more field types.

6b5416e

ECA-10663:end entity notification over REST

3228e50

ECA-10865: Fix NPE during certificate search when authorized to a non…

49726e1

…-existing CA

ECA-10817: Add FQDN in the cert SAN.

e91cb35

dcarella and others added 12 commits November 16, 2022 18:11

Noticket: French language files updated (Intune Certificate Revocation)

61ac0d4

Noticket: French language files updated (C-ITS certificates)

dc422de

Noticket: English language files fixed (duplicated message, minor fixes)

5120c0a

Noticket: French language files updated (CA Structure & CRLs: 'Downlo…

ef1a896

…ad' messages)

Noticket: English language files updated (CA Structure & CRLs: better…

fd6c597

… title for 1st column)

Merge remote-tracking branch 'origin/l10n-en-english-main-7.10-2' int…

59d597c

…o l10n-en-english-main-7.10-2

Merge remote-tracking branch 'origin/l10n-fr-french-main-7.10-2' into…

2dd9efe

… l10n-fr-french-main-7.10-2

Merge pull request Keyfactor#166 from dcarella/l10n-en-english-main-7…

4262306

….10-2 L10n: English fixes for the main branch (based on 7.10.0.2)

Merge pull request Keyfactor#167 from dcarella/l10n-fr-french-main-7.…

782fc79

…10-2 L10n: French updates for the main branch (based on 7.10.0.2) Fully translated

changed cert issuing logic to match other other classes

134dd50

SirTediousOfFoo commented Nov 25, 2022

View reviewed changes

modules/ra-gui/src/org/ejbca/ra/EnrollWithRequestIdBean.java Outdated Show resolved Hide resolved

SirTediousOfFoo commented Nov 25, 2022

View reviewed changes

Dropped Base64Encode in favour of stripAccents

a450776

SirTediousOfFoo commented Nov 25, 2022

View reviewed changes

SirTediousOfFoo changed the title ~~Add base64 filename encoding to EnrollWithRequestIdBean~~ Handle non-ASCII charater filenames in EnrollWithRequestIdBean Nov 25, 2022

hesunmark force-pushed the main branch from 696b6c0 to bc595b1 Compare January 12, 2023 11:58

mike-agrenius-kushner force-pushed the main branch from 6a106af to ce7f5e7 Compare June 30, 2023 09:30

hesunmark force-pushed the main branch from 3f863c6 to 39b6c93 Compare December 7, 2023 12:29

hesunmark force-pushed the main branch from 641901b to fb6acdc Compare June 13, 2024 14:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle non-ASCII charater filenames in EnrollWithRequestIdBean #175

Handle non-ASCII charater filenames in EnrollWithRequestIdBean #175

SirTediousOfFoo commented Nov 24, 2022

SirTediousOfFoo Nov 25, 2022

SirTediousOfFoo Nov 25, 2022

primetomas commented Nov 25, 2022

primetomas commented Nov 25, 2022

primetomas commented Nov 25, 2022

SirTediousOfFoo commented Nov 25, 2022

dobicinaitis commented Nov 25, 2022

SirTediousOfFoo commented Nov 25, 2022

SirTediousOfFoo commented Nov 25, 2022

SirTediousOfFoo Nov 25, 2022

Realiserad commented Dec 11, 2022 •

edited

Loading

Handle non-ASCII charater filenames in EnrollWithRequestIdBean #175

Are you sure you want to change the base?

Handle non-ASCII charater filenames in EnrollWithRequestIdBean #175

Conversation

SirTediousOfFoo commented Nov 24, 2022

SirTediousOfFoo Nov 25, 2022

Choose a reason for hiding this comment

SirTediousOfFoo Nov 25, 2022

Choose a reason for hiding this comment

primetomas commented Nov 25, 2022

primetomas commented Nov 25, 2022

primetomas commented Nov 25, 2022

SirTediousOfFoo commented Nov 25, 2022

dobicinaitis commented Nov 25, 2022

SirTediousOfFoo commented Nov 25, 2022

SirTediousOfFoo commented Nov 25, 2022

SirTediousOfFoo Nov 25, 2022

Choose a reason for hiding this comment

Realiserad commented Dec 11, 2022 • edited Loading

Realiserad commented Dec 11, 2022 •

edited

Loading