MODLD-586: LCCN Normalization | Auto-add white spaces by askhat-abishev · Pull Request #64 · folio-org/mod-linked-data

askhat-abishev · 2024-12-02T12:16:09Z

No description provided.

src/main/java/org/folio/linked/data/validation/dto/LccnPatternValidator.java

src/test/java/org/folio/linked/data/preprocessing/lccn/SpaceAdderStructureaTest.java

pkjacob · 2024-12-04T17:42:24Z

src/main/java/org/folio/linked/data/preprocessing/lccn/LccnNormalizer.java

+package org.folio.linked.data.preprocessing.lccn;
+
+@FunctionalInterface
+public interface LccnNormalizer<T> {


Do we need to overcomplicate this by adding <T>? LCCN is always going to be a string. So, instead of T, we can hardcode the type as String right? Or do you foresee any need to support other datatypes for LCCN in future?

Yes, I thought that in future we can normalize something other than just String. It should have been called just Normalizer but I forgot to rename it properly.

pkjacob · 2024-12-04T19:03:29Z

src/main/java/org/folio/linked/data/validation/dto/LccnPatternValidator.java

+
+  public LccnPatternValidator(SpecProvider specProvider, List<LccnNormalizer<String>> lccnNormalizers) {
+    this.specProvider = specProvider;
+    this.lccnNormalizer = lccnNormalizers.stream().reduce(LccnNormalizer.identity(), LccnNormalizer::andThen);


Hi @askhat-abishev,
Although this code will work, I think we can improve it. Currently, we first apply the StructureA normalizer and then pass the output to the StructureB normalizer (or vice versa). While this approach might work in this case, it isn’t logically correct. We should apply only one of the normalizers, determined by the pattern of the incoming LCCN.

So, I think a more clean structure for LccnNormalizer is as follows

public interface LccnNormalizer extends Predicate<String>, UnaryOperator<String> { // Return normalized LCCN value if it is valid, otherwise return empty Optional default Optional<String> normalize(String t) { if (this.test(t)) { // In `test` method, check if LCCN's pattern match corresponding regex return Optional.of(this.apply(t)); // Do actual normalization in `apply` method. } return Optional.empty(); } }

Then apply the normalization in LccnPatternValidator as follows

private String normalize(String lccn) { return lccnNormalizers .stream() .flatMap(normalizer -> normalizer.normalize(lccn).stream()) .findFirst() .orElse(lccn); }

What do you think?

pkjacob · 2024-12-04T19:08:34Z

src/main/java/org/folio/linked/data/preprocessing/lccn/impl/AbstractSpaceAdder.java

+import java.util.regex.Pattern;
+import org.folio.linked.data.preprocessing.lccn.LccnNormalizer;
+
+public abstract class AbstractSpaceAdder implements LccnNormalizer<String> {


minor - AbstractLccnNormalizer is a better name I think.
Similarly, LccnNormalizerStructrueA and LccnNormalizerStructrueB

sonarqubecloud · 2024-12-05T14:23:22Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
91.5% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

askhat-abishev requested review from AndreiBordak, PBobylev and pkjacob December 2, 2024 12:16

askhat-abishev self-assigned this Dec 2, 2024

PBobylev reviewed Dec 3, 2024

View reviewed changes

src/main/java/org/folio/linked/data/validation/dto/LccnPatternValidator.java Outdated Show resolved Hide resolved

src/test/java/org/folio/linked/data/preprocessing/lccn/SpaceAdderStructureaTest.java Outdated Show resolved Hide resolved

askhat-abishev force-pushed the MODLD-586 branch from fd1a425 to 5954251 Compare December 4, 2024 10:57

askhat-abishev requested a review from PBobylev December 4, 2024 11:30

AndreiBordak approved these changes Dec 4, 2024

View reviewed changes

pkjacob reviewed Dec 4, 2024

View reviewed changes

askhat-abishev requested a review from pkjacob December 5, 2024 12:07

pkjacob approved these changes Dec 5, 2024

View reviewed changes

askhat-abishev added 3 commits December 5, 2024 18:59

MODLD-586: LCCN Normalization | Auto-add white spaces

c27a5a4

MODLD-586: fix review remarks

a1c6d4d

MODLD-586: fix review remarks

b5bcf2b

askhat-abishev force-pushed the MODLD-586 branch from e395f92 to b5bcf2b Compare December 5, 2024 13:59

askhat-abishev merged commit 4000b75 into master Dec 5, 2024

askhat-abishev deleted the MODLD-586 branch December 5, 2024 14:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MODLD-586: LCCN Normalization | Auto-add white spaces#64

MODLD-586: LCCN Normalization | Auto-add white spaces#64
askhat-abishev merged 3 commits intomasterfrom
MODLD-586

askhat-abishev commented Dec 2, 2024

Uh oh!

Uh oh!

Uh oh!

pkjacob Dec 4, 2024 •

edited

Loading

Uh oh!

askhat-abishev Dec 5, 2024

Uh oh!

pkjacob Dec 4, 2024 •

edited

Loading

Uh oh!

askhat-abishev Dec 5, 2024

Uh oh!

pkjacob Dec 4, 2024

Uh oh!

askhat-abishev Dec 5, 2024

Uh oh!

sonarqubecloud bot commented Dec 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

askhat-abishev commented Dec 2, 2024

Uh oh!

Uh oh!

Uh oh!

pkjacob Dec 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

askhat-abishev Dec 5, 2024

Choose a reason for hiding this comment

Uh oh!

pkjacob Dec 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

askhat-abishev Dec 5, 2024

Choose a reason for hiding this comment

Uh oh!

pkjacob Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

askhat-abishev Dec 5, 2024

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Dec 5, 2024

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pkjacob Dec 4, 2024 •

edited

Loading

pkjacob Dec 4, 2024 •

edited

Loading