Skip to content

testthat tests fail with the new stringi using ICU 63.1 #1604

@gagolews

Description

@gagolews

Hi there!

Running against the most recent stringi 1.3.1 (devel version https://github.com/gagolews/stringi) gives:

══ Failed ══════════════════════════════════════════════════════════════════════
── 1. Failure: summary.character works with character objects (#1285) (@test-sum
as.character(summary(txt)) not equal to c("2", "character", "character").
Lengths differ: 2 is not 3

── 2. Failure: tokens works for strange spaces (#796) (@test-tokens.R#301)  ────
ntoken(txt, remove_punct = FALSE, remove_separators = FALSE) not equal to c(text1 = 18).
1/1 mismatches
[1] 17 - 18 == -1

── 3. Failure: tokens works for strange spaces (#796) (@test-tokens.R#302)  ────
as.character(tokens(txt, remove_punct = FALSE, remove_separators = FALSE))[16:18] not equal to c("variationselector16", " ", ".").
3/3 mismatches
x[1]: " "
y[1]: "variationselector16"

x[2]: "."
y[2]: " "

x[3]: NA
y[3]: "."

── 4. Failure: tokens works for strange spaces (#796) (@test-tokens.R#306)  ────
ntoken(txt, remove_punct = TRUE, remove_separators = FALSE) not equal to c(text1 = 16).
1/1 mismatches
[1] 15 - 16 == -1

── 5. Failure: tokens works for strange spaces (#796) (@test-tokens.R#310)  ────
as.character(tokens(txt, remove_punct = TRUE, remove_separators = FALSE))[15:16] not equal to c("variationselector16", " ").
2/2 mismatches
x[1]: " "
y[1]: "variationselector16"

x[2]: NA
y[2]: " "

This happens on a Ubuntu 18.10 system with libicu-dev version 63.1 (stringi compiled against system ICU)

Metadata

Metadata

Assignees

Labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions