-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement further str
split variants via a builder API
#168
Comments
str
split combinations via a builder APIstr
split variants via a builder API
Adding an extra comment here that it would be nice to also add the ability to include indices of splits, as mentioned in #222. |
We discussed this in the libs meeting today. We don't like the large API surface that this introduces, and think this would be a better fix for an |
I don't really understand how this is a large API surface. My implementation in the str_splitter crate has only 5 functions and 3 structs (or a single struct with const generic parameters). Sure, it's not just a couple functions, but it's also not 10. Any such external crate will have to reimplement It's also weird to send people out of the standard library for |
One of the particular concerns that was raised is that builder APIs are not very ergonomic to work with in practice, except in certain cases where there is a potentially unbounded number of options to apply (e.g. |
I would want to see a concrete design proposal here, but I'm not sure I would agree with. I am personally a fan of builders and think they work quite nicely. But I don't mean to make any universal claims. A non-builder API might work better here. Not sure. |
A builder might make sense if we expect it to generalize along additional dimensions in the future, something that a method can't handle. But such a thing could still first be designed in a crate. |
The builder API proposed here is different from most in that every stage is a totally usable iterator. It doesn't require an explicit To me, it feels quite ergonomic. It works well with editor suggestions.
Such future extensions have already been proposed, such as left/right inclusivity and
I'm not sure if this was just missed, but I have implemented this approach two different ways in the str_splitter crate. Take a look at the docs for Splitter as an example: https://docs.rs/str_splitter/latest/str_splitter/combinators/struct.Splitter.html All of the doctests in that crate passed when I last published it, but due to relying on nightly features it could have changed. (Also it provides as_str instead of remainder as that change was introduced after I published) |
That's great. If there's a lot of demand for it in the future we can reconsider it for inclusion in the library. |
Proposal
Problem statement
There are multiple "options" provided one the
str
type:rsplit...
)split_inclusive
)split_terminator
,rsplit_terminator
)splitn
,rsplitn
)split_once
,rsplit_once
)Currently, various useful combination of these options are missing from the
str
type (non-exhaustive):rsplit_inclusive
)splitn_inclusive
,rsplitn_inclusive
)split_inclusive_once
,rsplit_inclusive_once
)Additionally, there may be demand for before- and after-inclusive versions (
split_inclusive
is after-inclusive).Adding all of the combinations would quickly balloon the API surface of
str
.The combinatorial explosion would only get worse if more options are added.
Motivation, use-cases
A comment on a PR adding more combinations was the inspiration for this proposal:
Example:
rsplit_inclusive
Example:
splitn_inclusive
Example:
split_inclusive_once
Solution sketches
My proposal is to add a builder API to the existing
Split
type. This consists of a handful of functions that modify the splitting behavior:And these modifiers can be combined to produce any of the existing splitting functions:
split(pat)
split(pat)
split_inclusive(pat)
split(pat).to_inclusive()
rsplit(pat)
split(pat).to_reversed()
split_terminator(pat)
split(pat).to_terminated()
rsplit_terminator(pat)
split(pat).to_terminated().to_reversed()
split(pat).to_reversed().to_terminated()
splitn(n, pat)
split(pat).with_limit(n)
rsplitn(n, pat)
split(pat).with_limit(n)
split_once(pat)
split(pat).once()
rsplit_once(pat)
split(pat).to_reversed().once()
Plus more that aren't currently available:
rsplit_inclusive(pat)
split(pat).to_inclusive().to_reversed()
split_inclusive_once(pat)
split(pat).to_inclusive().once()
rsplit_inclusive_once(pat)
split(pat).to_inclusive().to_reversed().once()
splitn_inclusive(n, pat)
split(pat).to_inclusive().with_limit(n)
rsplitn_inclusive(n, pat)
split(pat).to_inclusive().to_reversed().with_limit(n)
All of the above (with the exception of
once
variants) return a type that implementsIterator
,DoubleEndedIterator
, etc as the existing functions do. The difference is that the type returned is not a standalone struct, but instead a combinator of generic structs. For instance,split(pat).to_reversed()
returnsReversed<Split<'a, P>>
,split(pat).to_inclusive().to_reversed()
returnsReversed<Inclusive<Split<'a, P>>>
, etc.A proof of concept is available in the
str_splitter
crate. When desired, I can quickly put forward an implementation PR, since the base code there is directly from the standard library source.This approach has several benefits:
Alternate solutions
One alternate solution put forward in the aforementioned PR comment is adding extra
next
functions on theSplit
struct that will each return the next substring of the given modifier:The downside of this approach is that those would not be usable as iterators in their own right. Additionally, adding more orthogonal modifiers would result in the same combinatorial explosion (though to a lesser degree) of functions on the
Split
type.Links and related work
rsplit_inclusive
split_inclusive
'sDoubleEndedIterator
implementation is unexpected rust#100756split_inclusive
variants for slices rust#91546str_splitter
crate: https://docs.rs/str_splitterWhat happens now?
This issue is part of the libs-api team API change proposal process. Once this issue is filed the libs-api team will review open proposals in its weekly meeting. You should receive feedback within a week or two.
The text was updated successfully, but these errors were encountered: