rollup and start preparing for a 1.0 release #104

BurntSushi · 2022-07-04T15:51:04Z

Closes #106

I chose this because it's what appears to be the version of Rust that is in the current version of Debian stable (which is bullseye). I don't necessarily plan to always track Debian stable, but in the absence of any other constraint (like a strong desire to use something stabilized in a more recent Rust release), it seems like a fine place to sit.

I'm also trying to fix CI, which appears broken on macOS. The build fails with zero logs, so that means it's time to just start changing things until it works.

Allowing access to the buffer itself may make some use cases simpler when access to the original buffer after consuming a few lines is preferrable. Without it, one would have to find a way to count the consumed bytes themselves. Closes #101

This re-generates all of the Unicode DFAs with Unicode 14. Doing this update was a major pain: 1) Update ucd-generate to Unicode 14, cut release 2) Update regex-syntax to Unicode 14, cut release 3) Update ucd-generate to new regex-syntax, cut release 4) Update bstr by re-running Unicode generation shell script *phew*

This is a more specific name given that we know we're dealing with a '&[u8]'.

Closes #96

Closes #97

Fixes #93, Closes #94

This implements 'ByteSlice::repeatn' with '<[u8]>::repeat'. 'repeat' on slices is stable since Rust 1.40.0 and does an exponential 'ptr::copy_nonoverlapping'. The implementation in 'alloc' also correctly handles the capacity overflow when 'len * n' is more than 'usize::MAX'. Closes #91

Closes #85

Previously, they would just return 'Vec<u8>' as the error type. But a 'FromUtf8Error' gives strictly more information, so we return that instead. Closes #52

This commit enables bstr to build and test without a dependency on std. This change was mostly munging 'use' statements to prefer 'core' and 'alloc' and changing around some conditional compilation. This change also enables bstr` to successfully test in 'no_std' and 'alloc' configurations. This includes doc tests. Previously, neither the library nor doc tests worked right if 'std' was disabled. Fixes #79, Closes #83

This doesn't change any behavior (I was wrong in #87), but instead clarifies that empty bytesets are valid and never match anything. Fixes #87

In a few places I must have got lazy when defining the iterator types and forced haystacks and needles/splitters to always have the same lifetime. This works in most cases, but #45 shows a case where it breaks down. To fix it, we just make sure we represent all of our lifetimes in our types. Note that #45 was reported against our split types, but we also have this issue with our find types too. Indeed, our split types are built on top of our find types, so we just fix everything. This is a breaking change since we are adding a new lifetime parameter to several public API types. Fixes #45

Our pattern has been to group forward/reverse APIs together, but the split_once APIs were put between 'split_str' and 'rsplit_str'. So we move them both below 'rsplit_str'.

It currently uses 'char::is_whitespace', but this is more of an implementation detail. While 'char::is_whitespace' is available in 'core', it's plausible that we might use our own data some data. In particular, 'trim' already uses its own data. I believe this is the only routine that makes direct use of some kind of Unicode data that wasn't previously gated behind the 'unicode' feature. Ref #40

This switches over to using as fewer 'use' statements. We don't go with the minimal number though, since I still find it useful to split 'use' statements into logical blocks: core, alloc, std, third party, crate.

It's nice to reserve 'pub' strictly for things that are part of the public API, as a way of making it easy to see which things are and aren't part of the API. I'm sure there are more 'pub' things that we should make 'pub(crate)', but this one stood out to me.

BurntSushi force-pushed the ag/work branch 4 times, most recently from 51405a9 to 1f77e5c Compare July 4, 2022 18:02

BurntSushi added the rollup label Jul 5, 2022

atouchet and others added 5 commits July 5, 2022 14:11

cargo: use SPDX license format

7dedaa4

Closes #106

doc: fix crates.io badge and update links

f409b56

ci: upgrade various 'actions'

8b89968

I'm also trying to fix CI, which appears broken on macOS. The build fails with zero logs, so that means it's time to just start changing things until it works.

api: add 'as_bytes()' to line iterators

a6a676e

Allowing access to the buffer itself may make some use cases simpler when access to the original buffer after consuming a few lines is preferrable. Without it, one would have to find a way to count the consumed bytes themselves. Closes #101

BurntSushi force-pushed the ag/work branch from 1f77e5c to 4cdce09 Compare July 5, 2022 22:57

BurntSushi and others added 12 commits July 5, 2022 19:21

BREAKING: rename 'Bytes::as_slice' to 'Bytes::as_bytes'

7b992bd

This is a more specific name given that we know we're dealing with a '&[u8]'.

api: impl Clone and Debug for Lines and LinesWithTerminator

7d2a23a

Closes #96

api: impl DoubleEndedIterator and FusedIterator for line iterators

9b57bd3

Closes #97

api: impl From<&BStr> for &[u8]

c5f06c9

Fixes #93, Closes #94

api: add ByteSlice::r?split_once_str

50856f7

Closes #85

BREAKING: OsStr/Path methods now use FromUtf8Error

f5ce5e3

Previously, they would just return 'Vec<u8>' as the error type. But a 'FromUtf8Error' gives strictly more information, so we return that instead. Closes #52

doc: add examples using an empty byteset

90749c5

This doesn't change any behavior (I was wrong in #87), but instead clarifies that empty bytesets are valid and never match anything. Fixes #87

doc: move 'r?split_once_str' routines

3256d07

Our pattern has been to group forward/reverse APIs together, but the split_once APIs were put between 'split_str' and 'rsplit_str'. So we move them both below 'rsplit_str'.

BurntSushi force-pushed the ag/work branch from 4cdce09 to a73aab1 Compare July 5, 2022 23:46

BurntSushi force-pushed the ag/work branch from a73aab1 to 1d92c84 Compare July 5, 2022 23:51

BurntSushi added 3 commits July 5, 2022 20:17

style: switch import style

3a6ca7a

This switches over to using as fewer 'use' statements. We don't go with the minimal number though, since I still find it useful to split 'use' statements into logical blocks: core, alloc, std, third party, crate.

doc: updates for 1.0 release

5c57bfa

BurntSushi merged commit 0d9d222 into master Jul 6, 2022

BurntSushi deleted the ag/work branch July 6, 2022 00:40

This was referenced Jul 6, 2022

RFC: 1.0 release? #40

Closed

Fix crates.io badge and update links #100

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rollup and start preparing for a 1.0 release #104

rollup and start preparing for a 1.0 release #104

BurntSushi commented Jul 4, 2022

rollup and start preparing for a 1.0 release #104

rollup and start preparing for a 1.0 release #104

Conversation

BurntSushi commented Jul 4, 2022