Tracking issue for string patterns #27721

alexcrichton · 2015-08-12T17:59:54Z

(Link to original RFC: rust-lang/rfcs#528)

This is a tracking issue for the unstable pattern feature in the standard library. We have many APIs which support the ability to search with any number of patterns generically within a string (e.g. substrings, characters, closures, etc), but implementing your own pattern (e.g. a regex) is not stable. It would be nice if these implementations could indeed be stable!

Some open questions are:

Have these APIs been audited for naming and consistency?
Are we sure these APIs are as conservative as they need to be?
Are we sure that these APIs are as performant as they can be?
Are we sure that these APIs can be used to implement all the necessary forms of searching?

cc @Kimundi

The text was updated successfully, but these errors were encountered:

bluss · 2015-08-16T20:08:16Z

String Patterns RFC tracking issue: #22477

Stabilization not only impacts implementing the pattern traits, but also of course detailed use of the Searcher trait, .next_match(), .next_reject() and so on.

jneem · 2015-09-17T19:16:52Z

I'm having trouble seeing the purpose of the next() method; in all the examples I look at, it's faster and cleaner to just implement next_match() and next_reject() individually. For example, these two functions can be implemented for CharEqSearcher as one-liners using Iterator::find. Moreover, if you want an optimized implementation of Searcher for an ASCII char then in implementing next() you need to choose between an implementation that returns rejections quickly and an implementation that skips quickly over the input (e.g. using SIMD) in order to quickly find a match.

bluss · 2015-09-17T19:32:43Z

@jneem the StrSearcher uses the same code for next() and next_match(), but specialized for each case, so that the next_match() case is much faster. It works well.

jneem · 2015-09-17T19:57:20Z

@bluss I think that helps make my point: AFAICT, all the users of Searcher only call next_match() and next_reject(). Therefore, the only purpose of next() AFAICT is to make implementing Searcher easier -- you only need to implement one function instead of two. But that benefit isn't born out in practice. StrSearcher implements two methods anyway, and it would be simpler to implement next_reject() instead of next(). CharEqSearcher implements next() only, but it would be simpler and cleaner to optimize if it implemented next_match() and next_reject() instead.

jneem · 2015-09-18T09:52:24Z

Oops, I missed one: Pattern uses Searcher::next() for the default implementation of is_prefix_of.

mkpankov · 2015-11-17T16:21:18Z

This is useful, I'd like to use it on stable.

bluss · 2015-11-17T16:38:33Z

cc @BurntSushi, Regex is the major pattern user and it would be great for everyone if it was stable because of that.

bluss · 2015-11-17T16:39:19Z

@BurntSushi Are the Pattern traits sufficient for Regex? Any design issues?

BurntSushi · 2015-11-17T16:50:19Z

I wrote the impl for Regex a while back, which does indeed only implement next: https://github.com/rust-lang-nursery/regex/blob/master/src/re.rs#L1104 I seem to recall that it was tricky to get the corner cases right, but that isn't too surprising. I otherwise found it pretty straight forward.

One thing that I don't think is representable with the Pattern API is fast retrieval of all matches from a suffix table. In particular, matches are reported as part of lexicographically sorted suffixes rather than order of occurrence in the string, so it can't satisfy the contract of Searcher without an additional cost. (I've already talked to @Kimundi about this, and I don't think it's a major issue. Suffix tables are a pretty niche data structure.)

aturon · 2016-01-27T19:08:26Z

Nominating for discussion. Not sure whether this is ready for stabilization, but I'd like for the team to dig into it a bit.

alexcrichton · 2016-01-29T07:33:33Z

Unfortunately the libs team didn't get a chance to talk about this in terms of stabilization for 1.8, but I'm going to leave the nominated tag as I think we should chat about this regardless.

Kimundi · 2016-02-17T21:11:31Z

Today I investigated a bit what would need to be done to generalize the Pattern API to arbitrary slice types. As part of that I also took a more pragmatic route to the involved types and interfaces based on these assumptions:

Generally, the API should suit the std library, with more complex scenarios being handled by external libraries.
Thus, providing the same feature set for libstd slice types like str, [T] or OsStr is more important than having a API that can accommodate the most general kind of "slice type".
Having an more simple to maintain API suitable for the existing consumers like find, split and match_indices is a higher priority than having a more complicated API that could accommodate exotic consumers.
Being able to reuse the API for mutable slices is worth a moderate increase in API verbosity and unsafety-based interface surfaces.

A very rough sketch of the result can be seen below. Note that this is only the Pattern traits itself, without actual integration in the std lib or comprehensive re implementation of the existing types.

https://github.com/Kimundi/pattern_api_sketch/blob/master/src/v5.rs

The core changes are:

The Pattern trait now has an input parameter for the slice type. This turns Pattern<'a> into Pattern<&'a str> and allows for Pattern<&'a mut [T]>.
next() got removed, leaving only next_{match/reject}(). This has been done because none of the existing types could make use of the return value of next(), and none of the existing implementations benefited from needing to implement it.
- As part of that change, the Pattern::is_{prefix/suffix}_of() methods are no longer default-implemented. But seeing how the manual implementation for a given pattern-slice combination is usually simple and straight forward, this does not appear to be a problem.
In order to allow for mutable slices and shared code of general iterators like match_indices between slice types,
the return values of next_*() are now associated types of the slice type.
They have an API that allows them to be used as abstract raw pointer types, which means generic code can use them relatively freely to create slices at or between matches, at the cost of requiring unsafe code and needing to follow aliasing rules.

Changes in regard to the open questions:

In addition to the existing interface, the new SearchPtrs trait has gained new ad-hoc named elements which would require additional name audition. Also, Pattern continues to be a confusing name in a language with pattern matching...
The sketched API change is somewhat more conservative by getting rid of the next() methods,
concentrating instead on the pure search loops.
The prior point should also make the performance aspect more validateable.
However, should the existing slice type-specific iterators types like split() be replaced by shared, generic ones there might be some regressions due to optimizations possibly not carrying over. This is optional though.
We should probably specify the Pattern API more closely as only being intended for iterator-like linear search operations.

bluss · 2016-02-18T19:20:57Z

Is the focus of Pattern for slices going to be specific for &[u8] only? I think that's ok, I'm unsure how to really extend "substring search" further to generic &[T]. Technically, the two-way algorithm that we use for str::find(&str) can be made to work on any ordered alphabet, i.e. T: Ord is enough, but I don't know how performant or realistic this is.

Kimundi · 2016-02-18T23:47:45Z

@bluss: I only used &[u8] as a quick proof of concept that the traits work with different slice types. I'm assuming that there is some sensible way to make it work with generic slices.

The &[u8] case seems like a good candidate for a specialization impl though.

DanielKeep · 2016-03-04T14:11:39Z

I'm currently trying out some runtime scanners for scan-rules. Basically: I want users to be able to parse text based on Patterns (e.g. accept anything that matches this pattern, consume input until this pattern is matched, etc.). The current example is being able to do something like (where until accepts a Pattern):

let_scan!("before:after", (let word_0 <| until(":"), ":", let word_1: Everything));
assert_eq!(word_0, "before");
assert_eq!(word_1, "after");

The problem I'm having is that it's really painful to actually do this. The interface as it exists seems to assume that ownership of the pattern can be passed to the method which will use it, and as a result, can only be used exactly once. This doesn't make much sense to me. Searching for a pattern should not (in general) consume the pattern.

What I want is P: Pattern ⇒ for<P: Pattern> &P: Pattern. Currently, it's only true for &str. If I had this, I could take ownership of the pattern from the caller, store it, and loan it to the underlying find calls. If the caller wants to use the pattern in multiple places, and it's expensive to construct, they can instead pass my function a borrow, which will also work.

The more I think about this, the more it comes down to the FnMut implementation. The only closure kind that would allow for the kind of interface I want is Fn... but that excludes closures that test based on accumulated state, though I wonder how often that's even desireable.

I can work around this by requiring Copy patterns, but again, callables are the most useful kind of pattern (until(char::is_whitespace), etc.), so it seems a deep shame to exclude them.

DanielKeep · 2016-03-04T14:38:18Z

Oh, another thing I just realised: there doesn't appear to be any way to find out how long a match is given a pattern and, say, str::starts_with.

bluss · 2016-03-04T14:39:58Z

You can use .next_reject for that.

withoutboats · 2016-05-15T01:07:09Z

Hey, I don't know a lot about this API, but I noticed that StrSearcher is not a double ended pattern, but CharSliceSearcher is. Is this an error? The explanation of why StrSearcher is not double ended seems to apply equally well to CharSliceSearcher:

(&str)::Searcher is not a DoubleEndedSearcher because the pattern "aa" in the haystack "aaa" matches as either "[aa]a" or "a[aa]", depending from which side it is searched.

bluss · 2016-05-15T11:12:13Z

@withoutboats A slice of chars represents a set of possibilities, so it's not like a string; either of the chars can be matched by themselves.

withoutboats · 2016-05-15T11:17:46Z

@bluss That makes sense. The semantics of CharSliceSearcher are not documented; I assumed the char slice was treated as an ordered sequence rather than a set.

Stebalien · 2016-08-13T19:57:28Z

Can we say that, after returning Done once, a Searcher must continue to return Done forever more? That is, can I make this assumption when implementing FusedIterator?

Implement 1581 (FusedIterator) * [ ] Implement on patterns. See #27721 (comment). * [ ] Handle OS Iterators. A bunch of iterators (`Args`, `Env`, etc.) in libstd wrap platform specific iterators. The current ones all appear to be well-behaved but can we assume that future ones will be? * [ ] Does someone want to audit this? On first glance, all of the iterators on which I implemented `FusedIterator` appear to be well-behaved but there are a *lot* of them so a second pair of eyes would be nice. * I haven't touched rustc internal iterators (or the internal rand) because rustc doesn't actually call `fuse()`. * `FusedIterator` can't be implemented on `std::io::{Bytes, Chars}`. Closes: #35602 (Tracking Issue) Implements: rust-lang/rfcs#1581

Implement 1581 (FusedIterator) * [ ] Implement on patterns. See rust-lang/rust#27721 (comment). * [ ] Handle OS Iterators. A bunch of iterators (`Args`, `Env`, etc.) in libstd wrap platform specific iterators. The current ones all appear to be well-behaved but can we assume that future ones will be? * [ ] Does someone want to audit this? On first glance, all of the iterators on which I implemented `FusedIterator` appear to be well-behaved but there are a *lot* of them so a second pair of eyes would be nice. * I haven't touched rustc internal iterators (or the internal rand) because rustc doesn't actually call `fuse()`. * `FusedIterator` can't be implemented on `std::io::{Bytes, Chars}`. Closes: #35602 (Tracking Issue) Implements: rust-lang/rfcs#1581

shahn · 2017-04-12T13:55:15Z

I'm a big fan of extending this to be more generic than just str, needed/wanted it for [u8] quite frequently. Unfortunately, there's an API inconsistency already - str::split takes a Pattern, whereas slice::split takes a predicate function that only looks at a single T in isolation.

TonalidadeHidrica · 2021-11-25T01:28:58Z

Pattern is implemented for FnMut(char) -> bool. Is it allowed that this function returns different values for the same input? Specifically, say that we want to strip at most three a's from a string, then currently we may prepare a counter so that it returns true for input a while the counter is less than 3 or otherwise false. Is it an intended use? Particularly, is it guaranteed that FnMut(char) -> bool function is called for each character in order?

jhpratt · 2021-11-25T01:36:13Z

@TonalidadeHidrica I wouldn't rely on that myself. To me that's not an idiomatic use case; I'd expect the function to be pure.

Luro02 · 2021-11-25T08:30:44Z

@TonalidadeHidrica I wouldn't rely on that myself. To me that's not an idiomatic use case; I'd expect the function to be pure.

Why has this trait been implemented for FnMut? If the function should be pure, then wouldn't it be better to only implement it for Fn?

From the docs of FnMut:

Use FnMut as a bound when you want to accept a parameter of function-like type and need to call it repeatedly, while allowing it to mutate state. If you don’t want the parameter to mutate state, use Fn as a bound; if you don’t need to call it repeatedly, use FnOnce.

https://doc.rust-lang.org/std/ops/trait.FnMut.html

Luro02 · 2021-11-25T08:32:22Z

This seems like something that should be documented in the docs of the Pattern trait.

TonalidadeHidrica · 2021-11-25T08:40:05Z

I agree that this should be documented. I guess why this function is Fn instead of FnMut is that we may sometimes use something like "cache" of a large database of chars, the update of which requires mutations. The issue of use of FnMut is somewhat similar to "Iterator::map accepts FnMut, then can I update an external counter during the iteration of map? No, it's discouraged."

Mingun · 2021-11-25T09:49:54Z

Would it be better to ban the mutation, and if it is needed, use the internal mutation (Cell, etc.)?

jhpratt · 2021-11-25T10:30:42Z

Not possible due to pack-compatibility. Pattern, while unstable, is exposed in some stable APIs.

lizelive · 2022-08-10T03:30:26Z

what is the holdup for making this stable?

ckaran · 2022-09-22T14:33:05Z

I'm a little confused by the API; the API docs for std::str::pattern::Searcher state that:

This trait provides methods for searching for non-overlapping matches of a pattern starting from the front (left) of a string.

I see the following possible interpretations of this, and I want to be sure which is in use to prevent any ambiguity in implementations.

Greedy approach

If you're looking for the string aa in the haystack aaaaa, then you'll always get a sequence like the following if you require a greedy search:

Match(0,2)
Match(2,4)
Reject(4,5)
Done

Starts at the start of the string, but skips some letters because why not?

Greedy is overrated. Let's skip the first letter and match on the rest!

Reject(0,1)
Match(1,3)
Match(3,5)
Done

Getting the most matches is so overrated, how about we skip some?

The API definition doesn't require that the maximal number of matches be returned, so we could just ignore some matching sub-strings.

Reject(0,1)
Reject(1,2)
Match(2,4)
Reject(4,5)
Done

Suggestions for documentation improvements.

I'd like to suggest that matching is always greedy and always maximal. Roughly the following pseudo-code (don't use this in production, it will overflow your stack, and finite state machines are faster anyways):

// `results` is empty when this function is first called.
// `usize` is 0 when first called.
fn string_matcher(pattern: &str, haystack: &str, results: &mut Vec<SearchStep>, index: usize) {
    if haystack.len() < pattern.len() {
        if haystack.len() > 0 {
            results.push(SearchStep::Reject(index, index + haystack.len()));
        }
        results.push(SearchStep::Done);
    } else if pattern == &haystack[0..pattern.len()] {
        results.push(SearchStep::Match(index, index + pattern.len()));
        string_matcher(
            pattern,
            &haystack[pattern.len()..],
            results,
            index + pattern.len(),
        );
    } else {
        results.push(SearchStep::Reject(index, index + 1));
        string_matcher(pattern, &haystack[1..], results, index + 1);
    }
}

playground

ckaran · 2022-09-22T14:41:43Z

Also, what about when you want overlapping matches? I can see cases where I would want all overlapping matches in addition to what the Searcher API currently provides.

BurntSushi · 2022-09-22T15:36:35Z

The docs could certainly be improved. I'm not sure if "greedy" or "maximal" are the right words.

Overlapping matches is a bit of a niche case and I don't think there is a compelling reason for the standard library to support them. Overlapping searches are available in the aho-corasick crate.

ckaran · 2022-09-22T23:31:23Z

@BurntSushi the main reason for the overlapping case is because then you can say that the searcher needs to return all matches, even the overlapping ones. The user is then responsible for deciding which overlapping case is the interesting one(s). If the searcher implements the Iterator trait, then you can use filtering to get the parts you want.

BurntSushi · 2022-09-22T23:34:49Z

I don't think that's worth doing and likely has deep performance implications.

ckaran · 2022-09-23T11:32:29Z

I don't think that's worth doing

I disagree, though I do think that it should be a completely separated from the current API (different function, different trait, whatever is deemed best)

and likely has deep performance implications.

Hah! I agree 110% with you on this! And it's the reason why having it as a separate API is likely the best way to do it.

BurntSushi · 2022-09-23T11:57:06Z

Overlapping searches are way way way too niche to put into std. If you want to convince folks otherwise, I would recommend giving more compelling reasons for why it should be in std.

ckaran · 2022-09-23T13:52:57Z

The best example I can give you off the top of my head is very niche, and likely not applicable to str.

I sometimes have to decode streams of bytes coming in from a receiver that can make errors¹ because of clock skew, mismatched oscillator frequencies, and noise in general. These can show up as bit flips, missing bits, or extra bits. Despite this, I want to know when a legitimate byte stream is starting. The normal (and fast) way is to define some kind of known pattern that signals that a frame is starting, which I'm going to call the start of frame pattern². To make your own life simple, this pattern is going to be chosen to be highly unlikely to occur by accident in your environment, but it's also really, really easy to look for. One example might be just to have a stream of bits like 0101010101 as your start pattern.

Now here is where things get interesting; while you could use some form of forward error correction (FEC) code to encode the start of frame pattern, continuously decoding all incoming bits to look for the pattern is energy intensive, which means battery life goes down. What you want to do is find the probable start of a frame, and then start the computationally (and therefore power) expensive process of decoding bits only when you are pretty sure you've found the start of a frame. So, you don't bother with proper FEC of the frame pattern. Instead, you make your pattern simple, and your pattern matcher will be just as simple. If it sees a pattern that looks like it could be a start of frame, you turn on your full FEC decoder and start decoding bits until you either decide that you made a mistake, or you have a frame (checksums, etc. come later).

The issue is that the noise I mentioned earlier can show up anywhere, including at the head of the start of frame pattern. So instead of looking for the full 0101010101 start of frame pattern, you might just look for 0101 in overlapping substrings, starting a new FEC decode task as soon as you match the pattern³. Which is where you need the overlapping pattern search.

All of that makes good sense in a byte stream, and that is where the windows method can be helpful. Does any of this make sense for a UTF-8 encoded string that is not subject to errors in encoding? Probably not. But, this is the best I could come up with on the spur of the moment for a practical use case.

'Receiver' in this case might be hardware, like a radio receiver. If the receiver is in a noisy environment, then it's constantly receiving bits, including bits that are just noise. ↩
I'm skipping so many details and algorithms that can be used under various conditions, it isn't even funny. If you know that background, just fill in the details in your head, if you don't know them, just ignore them. I'm just trying to give a very simplified example here. ↩
The assumption is that since the noise could be anywhere including in the start of frame section, you might have to start correcting for errors that occurred right at the start of your actual byte string. ↩

BurntSushi · 2022-09-23T14:05:20Z

Yeah I totally grant that there exist use cases for overlapping search. That's not really what I'm looking for, although I appreciate you outlining your use case. What I'm trying to get at here is that they are not frequent enough to be in std. Frequency isn't our only criterion, but I can't see any other reason why std should care at all about overlapping search. If you want it to be in std, you really need to answer the question, "why can't you use a crate for it?" with a specific reason for this particular problem. (i.e., Not general complaints like "I don't want to add dependencies to my project.")

ckaran · 2022-09-23T14:42:39Z

You're right, on all counts. I don't have a good enough reason for why it should be in std and not some crate, so I'm fine with it being dropped.

That said, I would like to see the documentation clarified on which non-overlapping patterns need to be returned. I'm fine with the docs stating that you can return an arbitrary set of non-overlapping matches, I just want it to be 100% clear as to what is expected of implementors.

Fishrock123 · 2022-10-17T22:15:55Z

Could Pattern have a flag that could be set on whether to use eq_ignore_ascii_case for its comparisons?

(note: i do not know how Pattern works so maybe it's not possible idk. but it would be very handy!)

BurntSushi · 2022-10-17T23:03:40Z

@Fishrock123 It is doable, but substring search algorithms are usually not amenable to being adapted straight-forwardly to support case insensitivity. (This isn't true for all of them, but I think is likely true for Two-Way at least, which is the algorithm currently used for substring search.) So in order to support such a flag, you'd probably need to dispatch to an entirely different algorithm.

Another option is the regex crate. It's a bit of a beefy dependency for just a simple ASCII case insensitive search, but it will do it for you. The aho-corasick crate also supports ASCII case insensitivity. While Aho-Corasick is typically used for multi-substring search, it can of course be used with just one substring. aho-corasick is a less beefy dependency than regex.

cehteh · 2022-12-17T13:15:31Z

This issue is open since over 7 years now, me and likely a lot other people would like to use this in stable.

Would it be possible to move the unstable attributes from the root into the API methods instead. Then at least one could export it in a stable way as '&str' already does (in stable). Example (illustration only, i leave the working bits out):

use std::str::pattern::Pattern;

struct MyStr {
    /*...*/
}

impl MyStr {
    fn from(&str) -> Self {...}
    fn as_str(&self) -> &str {...}

    pub fn split_once<'a, P: Pattern<'a>>(&'a self, delimiter: P) -> Option<(MyStr, MyStr)> {
        match self.as_str().split_once(delimiter) {
            Some(a,b) => Some(Self::from(a), Self::from(b)),
            None => None,
        }
    }
}

StragaSevera · 2023-02-26T13:13:59Z

I agree, it is sad to see such an issue being abandoned.

drmason13 · 2023-04-29T12:01:36Z

Before stabilizing the API (seems like this might take some time) one should consider waiting for #44265:
use core::str::pattern::{ReverseSearcher, Searcher};

pub trait Pattern {
    type Searcher<'a>: Searcher<'a>;

Is GAT in its current state suitable for this? I know it has some limitations.

If it is, then I imagine it must be worth considering this API while we are still unstable?

Related to this, I'm writing a function that ultimately searches a &str, the obvious (to me) signature was:

fn until(pattern: impl Pattern);

But I guess it would need some lifetime generic with the current API.

tgross35 · 2023-11-02T06:03:42Z

I think the summary here is that this API needs rework and somebody to champion it. There was some good discussion at #71780, including a rough proposal from @withoutboats in #71780 (comment). This kinda sorta echos @Luro02's outline in #27721 (comment) (it seems like GATs provide us with a more ergonomic solution in any case)

Another thing to keep in mind is that slice patterns were removed (#76901 (comment)) but we may want some way to work with &[u8] byte strings. It is a bit of a pain point that many std APIs require a UTF-8 &str when it isn't always needed, meaning that there is a runtime cost for str::from_utf8 to do things without unsafe when you have mostly-but-maybe-not-completely UTF-8 sequences (e.g., OsStr / Read interaction)

So the next steps forward, probably:

Somebody puts forth a design proposal. I don't think this needs to be a RFC since the concept was already accepted, but it has been so long that I think we just need a from-scratch design with justification and documented limitations. An ACP is probably a good place for this (acps are just an issue template at https://github.com/rust-lang/libs-team, link it here if you post one)
Implement that proposal
Revisit stabilization after it has been around for a while

It is unfortunate that we more or less have to go back to square one with stabilization, but there have been a lot of lessons learned and better ways to do things since the 2014 RFC (a decade!). Really this is probably just in need of somebody to take charge of the redesign and push everything forward.

cehteh · 2023-11-03T21:03:11Z

All I'd really asked for above is to stabilize the existence of the Pattern API, that would already address a lot of problems removing unstable bits from the stable Rust stdlib API.

When the API/implementation behind needs more work, that's Ok. But honestly after that much years and many people relying on patterns, overly big changes would be quite surprising.

tgross35 · 2023-11-03T23:09:51Z

All I'd really asked for above is to stabilize the existence of the Pattern API, that would already address a lot of problems removing unstable bits from the stable Rust stdlib API.

That would of course be nice, but we don’t want to do that until knowing for sure that we won’t need to change generics from what there currently is (a single lifetime). Probably unlikely, but there’s no way of knowing without a concrete proposal.

When the API/implementation behind needs more work, that's Ok. But honestly after that much years and many people relying on patterns, overly big changes would be quite surprising.

I think it’s the opposite: all the discussion here, the very long time with no stabilization, and the proposed replacements I linked in #27721 (comment) seem to indicate that nobody is happy enough with this API as-is. This feature needs a champion who is willing to experiment and push things along.

Phosphorus-M · 2024-03-10T21:54:25Z

Do we have some news about this feature?

Luro02 · 2024-03-11T07:27:37Z

Do we have some news about this feature?

Please stop poluting the issue tracker. If there is an update, someone will link to this issue.

alexcrichton added T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. B-unstable Feature: Implemented in the nightly compiler and unstable. labels Aug 12, 2015

aturon added the I-nominated label Jan 27, 2016

alexcrichton mentioned this issue Feb 18, 2016

Tracking issue for Add a generic string pattern matching API (RFC 528) #22477

Closed

aturon removed the I-nominated label Jun 9, 2016

Stebalien mentioned this issue Aug 13, 2016

Implement 1581 (FusedIterator) #35656

Merged

3 tasks

allaboutevemirolive mentioned this issue Jul 26, 2023

Tracking Issue for string_remove_matches #72826

Open

2 tasks

tomtau mentioned this issue Oct 5, 2023

Create a string library that works on encrypted data using TFHE-rs zama-ai/bounty-and-grant-program#80

Closed

Tracking issue for string patterns #27721

Tracking issue for string patterns #27721

Comments

alexcrichton commented Aug 12, 2015 • edited by bstrie

bluss commented Aug 16, 2015

jneem commented Sep 17, 2015

bluss commented Sep 17, 2015

jneem commented Sep 17, 2015

jneem commented Sep 18, 2015

mkpankov commented Nov 17, 2015

bluss commented Nov 17, 2015

bluss commented Nov 17, 2015

BurntSushi commented Nov 17, 2015

aturon commented Jan 27, 2016

alexcrichton commented Jan 29, 2016

Kimundi commented Feb 17, 2016

bluss commented Feb 18, 2016

Kimundi commented Feb 18, 2016

DanielKeep commented Mar 4, 2016

DanielKeep commented Mar 4, 2016

bluss commented Mar 4, 2016

withoutboats commented May 15, 2016

bluss commented May 15, 2016

withoutboats commented May 15, 2016

Stebalien commented Aug 13, 2016

shahn commented Apr 12, 2017

TonalidadeHidrica commented Nov 25, 2021

jhpratt commented Nov 25, 2021

Luro02 commented Nov 25, 2021

Luro02 commented Nov 25, 2021

TonalidadeHidrica commented Nov 25, 2021

Mingun commented Nov 25, 2021

jhpratt commented Nov 25, 2021

lizelive commented Aug 10, 2022

ckaran commented Sep 22, 2022

Greedy approach

Starts at the start of the string, but skips some letters because why not?

Getting the most matches is so overrated, how about we skip some?

Suggestions for documentation improvements.

ckaran commented Sep 22, 2022

BurntSushi commented Sep 22, 2022 • edited

ckaran commented Sep 22, 2022 • edited

BurntSushi commented Sep 22, 2022

ckaran commented Sep 23, 2022

BurntSushi commented Sep 23, 2022

ckaran commented Sep 23, 2022

Footnotes

BurntSushi commented Sep 23, 2022

ckaran commented Sep 23, 2022

Fishrock123 commented Oct 17, 2022

BurntSushi commented Oct 17, 2022

cehteh commented Dec 17, 2022

StragaSevera commented Feb 26, 2023

drmason13 commented Apr 29, 2023

tgross35 commented Nov 2, 2023 • edited

cehteh commented Nov 3, 2023

tgross35 commented Nov 3, 2023

Phosphorus-M commented Mar 10, 2024

Luro02 commented Mar 11, 2024

alexcrichton commented Aug 12, 2015 •

edited by bstrie

BurntSushi commented Sep 22, 2022 •

edited

ckaran commented Sep 22, 2022 •

edited

tgross35 commented Nov 2, 2023 •

edited