Subsetting changes document names #2127

koheiw · 2021-08-12T06:21:59Z

Here, I am taking the second and the third sentences (.2 and .3), but the resulting documents have different names (.1 and .2)

> require(quanteda)
> corp <- corpus_reshape(data_corpus_inaugural)
> corp[c("1789-Washington.2", "1789-Washington.3")]
Corpus consisting of 2 documents and 4 docvars.
1789-Washington.1 :
"On the one hand, I was summoned by my Country, whose voice I..."

1789-Washington.2 :
"On the other hand, the magnitude and difficulty of the trust..."

> corpus_subset(corp, docnames(corp) %in% c("1789-Washington.2", "1789-Washington.3"))
Corpus consisting of 2 documents and 4 docvars.
1789-Washington.1 :
"On the one hand, I was summoned by my Country, whose voice I..."

1789-Washington.2 :
"On the other hand, the magnitude and difficulty of the trust..."

koheiw added the bug label Aug 12, 2021

koheiw self-assigned this Aug 12, 2021

koheiw mentioned this issue Aug 12, 2021

Issue 2127 #2128

Merged

kbenoit closed this as completed Aug 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Subsetting changes document names #2127

Subsetting changes document names #2127

koheiw commented Aug 12, 2021

Subsetting changes document names #2127

Subsetting changes document names #2127

Comments

koheiw commented Aug 12, 2021