Ere 398 #22

xbarrelet · 2021-06-18T06:51:08Z

Practitioner information now filled from extraction to bundle, compatible with DENS pdf1 + both pdf from CGM. Tests of parser + ExtractionToBundleWorkflowTest completed.

Also fixed some fields (including birthdate and authoredOn) in RegexParser, last time I fixed them in Muster16SvgExtractorParser as I didn’t know the regexParser existed. Javadoc added in the extractorParser to indicate it’s deprecated.

I’ve attached in the Jira task the test1.pdf with the number in the practitioner name removed as we shouldn’t support number in this field (or should we?) and as I couldn’t push in the secret repo, just add it in the same folder as test1.pdf.

and authoredOn. Now works fine with Dens pdf.

tests

cartel1 · 2021-06-18T16:31:58Z

This looks OK to me.

@OmarMalass please confirm so we can merge this. Thanks.

OmarMalass · 2021-06-19T00:11:05Z

.../java/health/ere/ps/service/muster16/parser/rgxer/delegate/pattern/PractitionerPatterns.java

+import java.util.regex.Pattern;
+
+public class PractitionerPatterns extends Patterns {
+ public final Pattern NAME_PREFIX = Pattern.compile("(Dr)\\.");


Other possible prefixes: (med. prof.), can be a composite prefix like: "Dr. med."

Ok, support of them added.

OmarMalass · 2021-06-19T00:14:20Z

.../java/health/ere/ps/service/muster16/parser/rgxer/delegate/pattern/PractitionerPatterns.java

+public class PractitionerPatterns extends Patterns {
+ public final Pattern NAME_PREFIX = Pattern.compile("(Dr)\\.");
+ public final Pattern NAME_LINE = Pattern.compile("(Dr\\.)*([a-z A-Z]+)(-){0,1}([a-z A-Z]+)");
+ public final Pattern STREET_LINE = Pattern.compile("^[a-z A-Zß]+(\\.{1})?.*[0-9]+");


There are German characters that won't be captured by the letter pattern. You can use \w instead.
I purpose using \s to capture whitespace instead of space.

\w actually matches [A-Za-z0-9_], not any German character. I've added äöüÄÖÜß to the list of accepted letters, that will do the trick :)
I don't think we should match whitespace here as such character would involve an error or weird behavior in our parsing.

OmarMalass · 2021-06-19T01:05:37Z

...e/ps/service/muster16/parser/rgxer/delegate/practitioner/PractitionerEntryParseDelegate.java

+ matchAndExtractLine(lines, patterns.PHONE_LINE).ifPresent(this::parsePhoneNumber);
+ matchAndExtractLine(lines, patterns.CITY_LINE).ifPresent(this::parseAddressLine);
+ matchAndExtractLine(lines, patterns.STREET_LINE).ifPresent(this::parseStreetLine);
+ matchAndExtractLine(lines, patterns.NAME_LINE).ifPresent(this::parseNames);


Can we also consider extracting the doctor's qualifications. Is there a set of possible values?

You mean the type of doctor he is, like dentist or general doctor? I would say here let's wait until we need to do it to have a clearer idea of what we need to support. I've heard that dentists will be one of our main user soon,

xbarrelet added 4 commits June 17, 2021 17:46

End of Day intermediary commit, not done yet.

b6d2b55

Added practitioner info into bundle + fixed the date including birthdate

1df7981

and authoredOn. Now works fine with Dens pdf.

Now also working with both pdfs from CGM, tests completed

abf33d1

Updated the pull request CI pipeline as only the push one works with all

bc43936

tests

OmarMalass reviewed Jun 19, 2021

View reviewed changes

Better regex to match German special characters and more prefixes

8383404

xbarrelet merged commit ea38e3d into main Jun 21, 2021

xbarrelet deleted the ERE-398 branch June 28, 2021 10:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ere 398 #22

Ere 398 #22

xbarrelet commented Jun 18, 2021

cartel1 commented Jun 18, 2021

OmarMalass Jun 19, 2021

xbarrelet Jun 21, 2021

OmarMalass Jun 19, 2021 •

edited

Loading

xbarrelet Jun 21, 2021

OmarMalass Jun 19, 2021

xbarrelet Jun 21, 2021

Ere 398 #22

Ere 398 #22

Conversation

xbarrelet commented Jun 18, 2021

cartel1 commented Jun 18, 2021

OmarMalass Jun 19, 2021

Choose a reason for hiding this comment

xbarrelet Jun 21, 2021

Choose a reason for hiding this comment

OmarMalass Jun 19, 2021 • edited Loading

Choose a reason for hiding this comment

xbarrelet Jun 21, 2021

Choose a reason for hiding this comment

OmarMalass Jun 19, 2021

Choose a reason for hiding this comment

xbarrelet Jun 21, 2021

Choose a reason for hiding this comment

OmarMalass Jun 19, 2021 •

edited

Loading