Documentation

Init.Data.String.Basic

Creates a string that contains the characters in a list, in order.

Examples:

Equations
Equations
Equations
@[extern lean_string_dec_lt]
instance String.decidableLT (s₁ s₂ : String) :
Decidable (s₁ < s₂)
Equations
@[reducible, inline, deprecated String.decidableLT (since := "2024-12-13")]
abbrev String.decLt (s₁ s₂ : String) :
Decidable (s₁ < s₂)
Equations
@[reducible]
def String.le (a b : String) :

Non-strict inequality on strings, typically used via the operator.

a ≤ b is defined to mean ¬ b < a.

Equations
instance String.decLE (s₁ s₂ : String) :
Decidable (s₁ s₂)
Equations
@[extern lean_string_length]

Returns the length of a string in Unicode code points.

Examples:

Equations
@[extern lean_string_push]

Adds a character to the end of a string.

The internal implementation uses dynamic arrays and will perform destructive updates if the string is not shared.

Examples:

  • "abc".push 'd' = "abcd"
  • "".push 'a' = "a"
Equations
  • { data := s }.push x✝ = { data := s ++ [x✝] }
@[extern lean_string_append]

Appends two strings. Usually accessed via the ++ operator.

The internal implementation will perform destructive updates if the string is not shared.

Examples:

  • "abc".append "def" = "abcdef"
  • "abc" ++ "def" = "abcdef"
  • "" ++ "" = ""
Equations
  • { data := a }.append { data := b } = { data := a ++ b }

Converts a string to a list of characters.

Even though the logical model of strings is as a structure that wraps a list of characters, this operation takes time and space linear in the length of the string. At runtime, strings are represented as dynamic arrays of bytes.

Examples:

Equations
@[extern lean_string_is_valid_pos]

Returns true if p is a valid UTF-8 position in the string s.

This means that p ≤ s.endPos and p lies on a UTF-8 character boundary. At runtime, this operation takes constant time.

Examples:

Equations
Equations
Equations
@[extern lean_string_utf8_get]
def String.get (s : String) (p : Pos) :

Returns the character at position p of a string. If p is not a valid position, returns the fallback value (default : Char), which is 'A', but does not panic.

This function is overridden with an efficient implementation in runtime code. See String.utf8GetAux for the reference implementation.

Examples:

  • "abc".get ⟨1⟩ = 'b'
  • "abc".get ⟨3⟩ = (default : Char) because byte 3 is at the end of the string.
  • "L∃∀N".get ⟨2⟩ = (default : Char) because byte 2 is in the middle of '∃'.
Equations
Equations
@[extern lean_string_utf8_get_opt]

Returns the character at position p of a string. If p is not a valid position, returns none.

This function is overridden with an efficient implementation in runtime code. See String.utf8GetAux? for the reference implementation.

Examples:

  • "abc".get? ⟨1⟩ = some 'b'
  • "abc".get? ⟨3⟩ = none
  • "L∃∀N".get? ⟨1⟩ = some '∃'
  • "L∃∀N".get? ⟨2⟩ = none
Equations
@[extern lean_string_utf8_get_bang]
def String.get! (s : String) (p : Pos) :

Returns the character at position p of a string. Panics if p is not a valid position.

See String.get? for a safer alternative.

This function is overridden with an efficient implementation in runtime code. See String.utf8GetAux for the reference implementation.

Examples

  • "abc".get! ⟨1⟩ = 'b'
Equations
def String.utf8SetAux (c' : Char) :
List CharPosPosList Char
Equations
@[extern lean_string_utf8_set]
def String.set :
StringPosCharString

Replaces the character at a specified position in a string with a new character. If the position is invalid, the string is returned unchanged.

If both the replacement character and the replaced character are 7-bit ASCII characters and the string is not shared, then it is updated in-place and not copied.

Examples:

  • "abc".set ⟨1⟩ 'B' = "aBc"
  • "abc".set ⟨3⟩ 'D' = "abc"
  • "L∃∀N".set ⟨4⟩ 'X' = "L∃XN"
  • "L∃∀N".set ⟨2⟩ 'X' = "L∃∀N" because '∃' is a multi-byte character, so the byte index 2 is an invalid position.
Equations
def String.modify (s : String) (i : Pos) (f : CharChar) :

Replaces the character at position p in the string s with the result of applying f to that character. If p is an invalid position, the string is returned unchanged.

If both the replacement character and the replaced character are 7-bit ASCII characters and the string is not shared, then it is updated in-place and not copied.

Examples:

Equations
@[extern lean_string_utf8_next]
def String.next (s : String) (p : Pos) :

Returns the next position in a string after position p. The result is unspecified if p is not a valid position or if p = s.endPos.

A run-time bounds check is performed to determine whether p is at the end of the string. If a bounds check has already been performed, use String.next' to avoid a repeated check.

Some examples where the result is unspecified:

  • "abc".next ⟨3⟩, since 3 = "abc".endPos
  • "L∃∀N".next ⟨2⟩, since 2 points into the middle of a multi-byte UTF-8 character

Examples:

  • "abc".get ("abc".next 0) = 'b'
  • "L∃∀N".get (0 |> "L∃∀N".next |> "L∃∀N".next) = '∀'
Equations
Equations
@[extern lean_string_utf8_prev]
def String.prev :
StringPosPos

Returns the position in a string before a specified position, p. If p = ⟨0⟩, returns 0. If p is not a valid position, the result is unspecified.

For example, "L∃∀N".prev ⟨3⟩ is unspecified, since byte 3 occurs in the middle of the multi-byte character '∃'.

Examples:

  • "abc".get ("abc".endPos |> "abc".prev) = 'c'
  • "L∃∀N".get ("L∃∀N".endPos |> "L∃∀N".prev |> "L∃∀N".prev |> "L∃∀N".prev) = '∃'
Equations
@[inline]

Returns the first character in s. If s = "", returns (default : Char).

Examples:

Equations
@[inline]

Returns the last character in s. If s = "", returns (default : Char).

Examples:

  • "abc".back = 'c'
  • "".back = (default : Char)
Equations
@[extern lean_string_utf8_at_end]

Returns true if a specified byte position is greater than or equal to the position which points to the end of a string. Otherwise, returns false.

Examples:

  • (0 |> "abc".next |> "abc".next |> "abc".atEnd) = false
  • (0 |> "abc".next |> "abc".next |> "abc".next |> "abc".next |> "abc".atEnd) = true
  • (0 |> "L∃∀N".next |> "L∃∀N".next |> "L∃∀N".next |> "L∃∀N".atEnd) = false
  • (0 |> "L∃∀N".next |> "L∃∀N".next |> "L∃∀N".next |> "L∃∀N".next |> "L∃∀N".atEnd) = true
  • "abc".atEnd ⟨4⟩ = true
  • "L∃∀N".atEnd ⟨7⟩ = false
  • "L∃∀N".atEnd ⟨8⟩ = true
Equations
@[extern lean_string_utf8_get_fast]
def String.get' (s : String) (p : Pos) (h : ¬s.atEnd p = true) :

Returns the character at position p of a string. Returns (default : Char), which is 'A', if p is not a valid position.

Requires evidence, h, that p is within bounds instead of performing a run-time bounds check as in String.get.

A typical pattern combines get' with a dependent if-expression to avoid the overhead of an additional bounds check. For example:

def getInBounds? (s : String) (p : String.Pos) : Option Char :=
  if h : s.atEnd p then none else some (s.get' p h)

Even with evidence of ¬ s.atEnd p, p may be invalid if a byte index points into the middle of a multi-byte UTF-8 character. For example, "L∃∀N".get' ⟨2⟩ (by decide) = (default : Char).

Examples:

  • "abc".get' 0 (by decide) = 'a'
  • let lean := "L∃∀N"; lean.get' (0 |> lean.next |> lean.next) (by decide) = '∀'
Equations
@[extern lean_string_utf8_next_fast]
def String.next' (s : String) (p : Pos) (h : ¬s.atEnd p = true) :

Returns the next position in a string after position p. The result is unspecified if p is not a valid position.

Requires evidence, h, that p is within bounds. No run-time bounds check is performed, as in String.next.

A typical pattern combines String.next' with a dependent if-expression to avoid the overhead of an additional bounds check. For example:

def next? (s: String) (p : String.Pos) : Option Char :=
  if h : s.atEnd p then none else s.get (s.next' p h)

Example:

  • let abc := "abc"; abc.get (abc.next' 0 (by decide)) = 'b'
Equations
@[reducible, inline, deprecated Char.utf8Size_pos (since := "2026-06-04")]
Equations
@[simp]
theorem String.pos_lt_eq (p₁ p₂ : Pos) :
(p₁ < p₂) = (p₁.byteIdx < p₂.byteIdx)
@[simp]
theorem String.pos_add_char (p : Pos) (c : Char) :
theorem String.Pos.ne_zero_of_lt {a b : Pos} :
a < bb 0
theorem String.lt_next (s : String) (i : Pos) :
theorem String.utf8PrevAux_lt_of_pos (cs : List Char) (i p : Pos) :
p 0(utf8PrevAux cs i p).byteIdx < p.byteIdx
theorem String.prev_lt_of_pos (s : String) (i : Pos) (h : i 0) :
@[irreducible]
def String.posOfAux (s : String) (c : Char) (stopPos pos : Pos) :
Equations
@[inline]
def String.posOf (s : String) (c : Char) :

Returns the position of the first occurrence of a character, c, in a string s. If s does not contain c, returns s.endPos.

Examples:

  • "abcba".posOf 'a' = ⟨0⟩
  • "abcba".posOf 'z' = ⟨5⟩
  • "L∃∀N".posOf '∀' = ⟨4⟩
Equations
@[irreducible]
def String.revPosOfAux (s : String) (c : Char) (pos : Pos) :
Equations
@[inline]

Returns the position of the last occurrence of a character, c, in a string s. If s does not contain c, returns none.

Examples:

  • "abcabc".refPosOf 'a' = some ⟨3⟩
  • "abcabc".revPosOf 'z' = none
  • "L∃∀N".revPosOf '∀' = some ⟨4⟩
Equations
@[irreducible]
def String.findAux (s : String) (p : CharBool) (stopPos pos : Pos) :
Equations
@[inline]
def String.find (s : String) (p : CharBool) :

Finds the position of the first character in a string for which the Boolean predicate p returns true. If there is no such character in the string, then the end position of the string is returned.

Examples:

  • "coffee tea water".find (·.isWhitespace) = ⟨6⟩
  • "tea".find (· == 'X') = ⟨3⟩
  • "".find (· == 'X') = ⟨0⟩
Equations
@[irreducible]
def String.revFindAux (s : String) (p : CharBool) (pos : Pos) :
Equations
@[inline]
def String.revFind (s : String) (p : CharBool) :

Finds the position of the last character in a string for which the Boolean predicate p returns true. If there is no such character in the string, then none is returned.

Examples:

  • "coffee tea water".revFind (·.isWhitespace) = some ⟨10⟩
  • "tea".revFind (· == 'X') = none
  • "".revFind (· == 'X') = none
Equations
@[reducible, inline]
abbrev String.Pos.min (p₁ p₂ : Pos) :

Returns either p₁ or p₂, whichever has the least byte index.

Equations

Returns the first position where the two strings differ.

If one string is a prefix of the other, then the returned position is the end position of the shorter string. If the strings are identical, then their end position is returned.

Examples:

Equations
@[irreducible]
def String.firstDiffPos.loop (a b : String) (stopPos i : Pos) :
Equations
  • One or more equations did not get rendered due to their size.
@[extern lean_string_utf8_extract]

Creates a new string that consists of the region of the input string delimited by the two positions.

The result is "" if the start position is greater than or equal to the end position or if the start position is at the end of the string. If either position is invalid (that is, if either points at the middle of a multi-byte UTF-8 character) then the result is unspecified.

Examples:

  • "red green blue".extract ⟨0⟩ ⟨3⟩ = "red"
  • "red green blue".extract ⟨3⟩ ⟨0⟩ = ""
  • "red green blue".extract ⟨0⟩ ⟨100⟩ = "red green blue"
  • "red green blue".extract ⟨4⟩ ⟨100⟩ = "green blue"
  • "L∃∀N".extract ⟨2⟩ ⟨100⟩ = "green blue"
Equations
Equations
Equations
@[irreducible, specialize #[]]
def String.splitAux (s : String) (p : CharBool) (b i : Pos) (r : List String) :
Equations
  • One or more equations did not get rendered due to their size.
@[specialize #[]]
def String.split (s : String) (p : CharBool) :

Splits a string at each character for which p returns true.

The characters that satisfy p are not included in any of the resulting strings. If multiple characters in a row satisfy p, then the resulting list will contain empty strings.

Examples:

  • "coffee tea water".split (·.isWhitespace) = ["coffee", "tea", "water"]
  • "coffee tea water".split (·.isWhitespace) = ["coffee", "", "tea", "", "water"]
  • "fun x =>\n x + 1\n".split (· == '\n') = ["fun x =>", " x + 1", ""]
Equations
@[irreducible]
def String.splitOnAux (s sep : String) (b i j : Pos) (r : List String) :

Auxiliary for splitOn. Preconditions:

  • sep is not empty
  • b <= i are indexes into s
  • j is an index into sep, and not at the end

It represents the state where we have currently parsed some split parts into r (in reverse order), b is the beginning of the string / the end of the previous match of sep, and the first j bytes of sep match the bytes i-j .. i of s.

Equations
  • One or more equations did not get rendered due to their size.
@[inline]
def String.splitOn (s : String) (sep : String := " ") :

Splits a string s on occurrences of the separator string sep. The default separator is " ".

When sep is empty, the result is [s]. When sep occurs in overlapping patterns, the first match is taken. There will always be exactly n+1 elements in the returned list if there were n non-overlapping matches of sep in the string. The separators are not included in the returned substrings.

Examples:

  • "here is some text ".splitOn = ["here", "is", "some", "text", ""]
  • "here is some text ".splitOn "some" = ["here is ", " text "]
  • "here is some text ".splitOn "" = ["here is some text "]
  • "ababacabac".splitOn "aba" = ["", "bac", "c"]
Equations
@[inline]
def String.pushn (s : String) (c : Char) (n : Nat) :

Adds multiple repetitions of a character to the end of a string.

Returns s, with n repetitions of c at the end. Internally, the implementation repeatedly calls String.push, so the string is modified in-place if there is a unique reference to it.

Examples:

  • "indeed".pushn '!' 2 = "indeed!!"
  • "indeed".pushn '!' 0 = "indeed"
  • "".pushn ' ' 4 = " "
Equations
@[inline]

Checks whether a string is empty.

Empty strings are equal to "" and have length and end position 0.

Examples:

Equations
@[inline]

Appends all the strings in a list of strings, in order.

Use String.intercalate to place a separator string between the strings in a list.

Examples:

Equations
@[inline]

Returns a new string that contains only the character c.

Because strings are encoded in UTF-8, the resulting string may take multiple bytes.

Examples:

Equations

Appends the strings in a list of strings, placing the separator s between each pair.

Examples:

  • ", ".intercalate ["red", "green", "blue"] = "red, green, blue"
  • " and ".intercalate ["tea", "coffee"] = "tea and coffee"
  • " | ".intercalate ["M", "", "N"] = "M | | N"
Equations

An iterator over the characters (Unicode code points) in a String. Typically created by String.iter.

String iterators pair a string with a valid byte index. This allows efficient character-by-character processing of strings while avoiding the need to manually ensure that byte indices are used with the correct strings.

An iterator is valid if the position i is valid for the string s, meaning 0 ≤ i ≤ s.endPos and i lies on a UTF8 byte boundary. If i = s.endPos, the iterator is at the end of the string.

Most operations on iterators return unspecified values if the iterator is not valid. The functions in the String.Iterator API rule out the creation of invalid iterators, with two exceptions:

  • s : String

    The string being iterated over.

  • i : Pos

    The current UTF-8 byte position in the string s.

    This position is not guaranteed to be valid for the string. If the position is not valid, then the current character is (default : Char), similar to String.get on an invalid position.

Instances For
@[inline]

Creates an iterator at the beginning of the string.

Equations
@[reducible, inline]

Creates an iterator at the beginning of the string.

Equations

The size of a string iterator is the number of bytes remaining.

Recursive functions that iterate towards the end of a string will typically decrease this measure.

Equations
@[inline]

The string being iterated over.

Equations
@[inline]

The number of UTF-8 bytes remaining in the iterator.

Equations
@[inline]

The current UTF-8 byte position in the string s.

This position is not guaranteed to be valid for the string. If the position is not valid, then the current character is (default : Char), similar to String.get on an invalid position.

Equations
@[inline]

Gets the character at the iterator's current position.

A run-time bounds check is performed. Use String.Iterator.curr' to avoid redundant bounds checks.

If the position is invalid, returns (default : Char).

Equations
@[inline]

Moves the iterator's position forward by one character, unconditionally.

It is only valid to call this function if the iterator is not at the end of the string (i.e. if Iterator.atEnd is false); otherwise, the resulting iterator will be invalid.

Equations
  • { s := s, i := i }.next = { s := s, i := s.next i }
@[inline]

Moves the iterator's position backward by one character, unconditionally.

The position is not changed if the iterator is at the beginning of the string.

Equations
  • { s := s, i := i }.prev = { s := s, i := s.prev i }
@[inline]

Checks whether the iterator is past its string's last character.

Equations
@[inline]

Checks whether the iterator is at or before the string's last character.

Equations
@[inline]

Checks whether the iterator is after the beginning of the string.

Equations
@[inline]

Gets the character at the iterator's current position.

The proof of it.hasNext ensures that there is, in fact, a character at the current position. This function is faster that String.Iterator.curr due to avoiding a run-time bounds check.

Equations
@[inline]

Moves the iterator's position forward by one character, unconditionally.

The proof of it.hasNext ensures that there is, in fact, a position that's one character forwards. This function is faster that String.Iterator.next due to avoiding a run-time bounds check.

Equations
  • { s := s, i := i }.next' h_2 = { s := s, i := s.next' i }
@[inline]

Replaces the current character in the string.

Does nothing if the iterator is at the end of the string. If both the replacement character and the replaced character are 7-bit ASCII characters and the string is not shared, then it is updated in-place and not copied.

Equations
  • { s := s, i := i }.setCurr x✝ = { s := s.set i x✝, i := i }
@[inline]

Moves the iterator's position to the end of the string, just past the last character.

Equations
@[inline]

Extracts the substring between the positions of two iterators. The first iterator's position is the start of the substring, and the second iterator's position is the end.

Returns the empty string if the iterators are for different strings, or if the position of the first iterator is past the position of the second iterator.

Equations

Moves the iterator's position forward by the specified number of characters.

The resulting iterator is only valid if the number of characters to skip is less than or equal to the number of characters left in the iterator.

Equations
@[inline]

The remaining characters in an iterator, as a string.

Equations

Moves the iterator's position forward by the specified number of characters.

The resulting iterator is only valid if the number of characters to skip is less than or equal to the number of characters left in the iterator.

Equations

Moves the iterator's position back by the specified number of characters, stopping at the beginning of the string.

Equations
@[irreducible]
def String.offsetOfPosAux (s : String) (pos i : Pos) (offset : Nat) :
Equations
@[inline]
def String.offsetOfPos (s : String) (pos : Pos) :

Returns the character index that corresponds to the provided position (i.e. UTF-8 byte index) in a string.

If the position is at the end of the string, then the string's length in characters is returned. If the position is invalid due to pointing at the middle of a UTF-8 byte sequence, then the character index of the next character after the position is returned.

Examples:

Equations
@[irreducible, specialize #[]]
def String.foldlAux {α : Type u} (f : αCharα) (s : String) (stopPos i : Pos) (a : α) :
α
Equations
@[inline]
def String.foldl {α : Type u} (f : αCharα) (init : α) (s : String) :
α

Folds a function over a string from the left, accumulating a value starting with init. The accumulated value is combined with each character in order, using f.

Examples:

  • "coffee tea water".foldl (fun n c => if c.isWhitespace then n + 1 else n) 0 = 2
  • "coffee tea and water".foldl (fun n c => if c.isWhitespace then n + 1 else n) 0 = 3
  • "coffee tea water".foldl (·.push ·) "" = "coffee tea water"
Equations
@[irreducible, specialize #[]]
def String.foldrAux {α : Type u} (f : Charαα) (a : α) (s : String) (i begPos : Pos) :
α
Equations
@[inline]
def String.foldr {α : Type u} (f : Charαα) (init : α) (s : String) :
α

Folds a function over a string from the right, accumulating a value starting with init. The accumulated value is combined with each character in reverse order, using f.

Examples:

  • "coffee tea water".foldr (fun c n => if c.isWhitespace then n + 1 else n) 0 = 2
  • "coffee tea and water".foldr (fun c n => if c.isWhitespace then n + 1 else n) 0 = 3
  • "coffee tea water".foldr (fun c s => c.push s) "" = "retaw dna aet eeffoc"
Equations
@[irreducible, specialize #[]]
def String.anyAux (s : String) (stopPos : Pos) (p : CharBool) (i : Pos) :
Equations
@[inline]
def String.any (s : String) (p : CharBool) :

Checks whether there is a character in a string for which the Boolean predicate p returns true.

Short-circuits at the first character for which p returns true.

Examples:

  • "brown".any (·.isLetter) = true
  • "brown".any (·.isWhitespace) = false
  • "brown and orange".any (·.isLetter) = true
  • "".any (fun _ => false) = false
Equations
@[inline]
def String.all (s : String) (p : CharBool) :

Checks whether the Boolean predicate p returns true for every character in a string.

Short-circuits at the first character for which p returns false.

Examples:

  • "brown".all (·.isLetter) = true
  • "brown and orange".all (·.isLetter) = false
  • "".all (fun _ => false) = true
Equations
@[inline]
def String.contains (s : String) (c : Char) :

Checks whether a string contains the specified character.

Examples:

Equations
theorem String.utf8SetAux_of_gt (c' : Char) (cs : List Char) {i p : Pos} :
i > putf8SetAux c' cs i p = cs
theorem String.set_next_add (s : String) (i : Pos) (c : Char) (b₁ b₂ : Nat) (h : (s.next i).byteIdx + b₁ = s.endPos.byteIdx + b₂) :
((s.set i c).next i).byteIdx + b₁ = (s.set i c).endPos.byteIdx + b₂
theorem String.set_next_add.foo (i : Pos) (c : Char) (cs : List Char) (a : Pos) (b₁ b₂ : Nat) :
(utf8GetAux cs a i).utf8Size + b₁ = utf8ByteSize.go cs + b₂(utf8GetAux (utf8SetAux c cs a i) a i).utf8Size + b₁ = utf8ByteSize.go (utf8SetAux c cs a i) + b₂
theorem String.mapAux_lemma (s : String) (i : Pos) (c : Char) (h : ¬s.atEnd i = true) :
@[irreducible, specialize #[]]
def String.mapAux (f : CharChar) (i : Pos) (s : String) :
Equations
@[inline]
def String.map (f : CharChar) (s : String) :

Applies the function f to every character in a string, returning a string that contains the resulting characters.

Examples:

Equations
@[inline]

Checks whether the string can be interpreted as the decimal representation of a natural number.

A string can be interpreted as a decimal natural number if it is not empty and all the characters in it are digits.

Use String.toNat? or String.toNat! to convert such a string to a natural number.

Examples:

Equations

Interprets a string as the decimal representation of a natural number, returning it. Returns none if the string does not contain a decimal natural number.

A string can be interpreted as a decimal natural number if it is not empty and all the characters in it are digits.

Use String.isNat to check whether String.toNat? would return some. String.toNat! is an alternative that panics instead of returning none when the string is not a natural number.

Examples:

Equations
def String.substrEq (s1 : String) (pos1 : Pos) (s2 : String) (pos2 : Pos) (sz : Nat) :

Checks whether substrings of two strings are equal. Substrings are indicated by their starting positions and a size in UTF-8 bytes. Returns false if the indicated substring does not exist in either string.

Equations
  • One or more equations did not get rendered due to their size.
@[irreducible]
def String.substrEq.loop (s1 s2 : String) (off1 off2 stop1 : Pos) :
Equations
  • One or more equations did not get rendered due to their size.

Checks whether the first string (p) is a prefix of the second (s).

String.startsWith is a version that takes the potential prefix after the string.

Examples:

Equations
def String.replace (s pattern replacement : String) :

In the string s, replaces all occurrences of pattern with replacement.

Examples:

  • "red green blue".replace "e" "" = "rd grn blu"
  • "red green blue".replace "ee" "E" = "red grEn blue"
  • "red green blue".replace "e" "E" = "rEd grEEn bluE"
Equations
@[irreducible]
def String.replace.loop (s pattern replacement : String) (hPatt : 0 < pattern.endPos.byteIdx) (acc : String) (accStop pos : Pos) :
Equations
  • One or more equations did not get rendered due to their size.
def String.findLineStart (s : String) (pos : Pos) :

Returns the position of the beginning of the line that contains the position pos.

Lines are ended by '\n', and the returned position is either 0 : String.Pos or immediately after a '\n' character.

Equations
@[inline]

Checks whether a substring is empty.

A substring is empty if its start and end positions are the same.

Equations
@[inline]

Copies the region of the underlying string pointed to by a substring into a fresh string.

Equations
@[inline]

Returns an iterator into the underlying string, at the substring's starting position. The ending position is discarded, so the iterator alone cannot be used to determine whether its current position is within the original substring.

Equations
  • { str := s, startPos := b, stopPos := e }.toIterator = { s := s, i := b }
@[inline]

Returns the character at the given position in the substring.

The position is relative to the substring, rather than the underlying string, and no bounds checking is performed with respect to the substring's end position. If the relative position is not a valid position in the underlying string, the fallback value (default : Char), which is 'A', is returned. Does not panic.

Equations
  • { str := s, startPos := b, stopPos := stopPos }.get x✝ = s.get (b + x✝)
@[inline]

Returns the next position in a substring after the given position. If the position is at the end of the substring, it is returned unmodified.

Both the input position and the returned position are interpreted relative to the substring's start position, not the underlying string.

Equations
@[inline]

Returns the previous position in a substring, just prior to the given position. If the position is at the beginning of the substring, it is returned unmodified.

Both the input position and the returned position are interpreted relative to the substring's start position, not the underlying string.

Equations

Returns the position that's the specified number of characters forward from the given position in a substring. If the end position of the substring is reached, it is returned.

Both the input position and the returned position are interpreted relative to the substring's start position, not the underlying string.

Equations

Returns the position that's the specified number of characters prior to the given position in a substring. If the start position of the substring is reached, it is returned.

Both the input position and the returned position are interpreted relative to the substring's start position, not the underlying string.

Equations
@[inline]

Returns the first character in the substring.

If the substring is empty, but the substring's start position is a valid position in the underlying string, then the character at the start position is returned. If the substring's start position is not a valid position in the string, the fallback value (default : Char), which is 'A', is returned. Does not panic.

Equations
@[inline]

Returns the substring-relative position of the first occurrence of c in s, or s.bsize if c doesn't occur.

Equations
@[inline]

Removes the specified number of characters (Unicode code points) from the beginning of a substring by advancing its start position.

If the substring's end position is reached, the start position is not advanced past it.

Equations
  • { str := s, startPos := b, stopPos := e }.drop x✝ = { str := s, startPos := b + { str := s, startPos := b, stopPos := e }.nextn x✝ 0, stopPos := e }
@[inline]

Removes the specified number of characters (Unicode code points) from the end of a substring by moving its end position towards its start position.

If the substring's start position is reached, the end position is not retracted past it.

Equations
  • One or more equations did not get rendered due to their size.
@[inline]

Retains only the specified number of characters (Unicode code points) at the beginning of a substring, by moving its end position towards its start position.

If the substring's start position is reached, the end position is not retracted past it.

Equations
  • { str := s, startPos := b, stopPos := e }.take x✝ = { str := s, startPos := b, stopPos := b + { str := s, startPos := b, stopPos := e }.nextn x✝ 0 }
@[inline]

Retains only the specified number of characters (Unicode code points) at the end of a substring, by moving its start position towards its end position.

If the substring's end position is reached, the start position is not advanced past it.

Equations
  • One or more equations did not get rendered due to their size.
@[inline]

Checks whether a position in a substring is precisely equal to its ending position.

The position is understood relative to the substring's starting position, rather than the underlying string's starting position.

Equations
  • { str := s, startPos := b, stopPos := stopPos }.atEnd x✝ = (b + x✝ == stopPos)
@[inline]

Returns the region of the substring delimited by the provided start and stop positions, as a substring. The positions are interpreted with respect to the substring's start position, rather than the underlying string.

If the resulting substring is empty, then the resulting substring is a substring of the empty string "". Otherwise, the underlying string is that of the input substring with the beginning and end positions adjusted.

Equations
  • { str := s, startPos := b, stopPos := e }.extract x✝¹ x✝ = if x✝¹ x✝ then { str := "", startPos := 0, stopPos := 0 } else { str := s, startPos := e.min (b + x✝¹), stopPos := e.min (b + x✝) }
def Substring.splitOn (s : Substring) (sep : String := " ") :

Splits a substring s on occurrences of the separator string sep. The default separator is " ".

When sep is empty, the result is [s]. When sep occurs in overlapping patterns, the first match is taken. There will always be exactly n+1 elements in the returned list if there were n non-overlapping matches of sep in the string. The separators are not included in the returned substrings, which are all substrings of s's string.

Equations
@[irreducible]
def Substring.splitOn.loop (s : Substring) (sep : String := " ") (b i j : String.Pos) (r : List Substring) :
Equations
  • One or more equations did not get rendered due to their size.
@[inline]
def Substring.foldl {α : Type u} (f : αCharα) (init : α) (s : Substring) :
α

Folds a function over a substring from the left, accumulating a value starting with init. The accumulated value is combined with each character in order, using f.

Equations
@[inline]
def Substring.foldr {α : Type u} (f : Charαα) (init : α) (s : Substring) :
α

Folds a function over a substring from the right, accumulating a value starting with init. The accumulated value is combined with each character in reverse order, using f.

Equations
@[inline]
def Substring.any (s : Substring) (p : CharBool) :

Checks whether the Boolean predicate p returns true for any character in a substring.

Short-circuits at the first character for which p returns true.

Equations
  • { str := s_1, startPos := b, stopPos := e }.any p = s_1.anyAux e p b
@[inline]
def Substring.all (s : Substring) (p : CharBool) :

Checks whether the Boolean predicate p returns true for every character in a substring.

Short-circuits at the first character for which p returns false.

Equations
@[inline]

Checks whether a substring contains the specified character.

Equations
@[irreducible, specialize #[]]
def Substring.takeWhileAux (s : String) (stopPos : String.Pos) (p : CharBool) (i : String.Pos) :
Equations
@[inline]

Retains only the longest prefix of a substring in which a Boolean predicate returns true for all characters by moving the substring's end position towards its start position.

Equations
@[inline]

Removes the longest prefix of a substring in which a Boolean predicate returns true for all characters by moving the substring's start position. The start position is moved to the position of the first character for which the predicate returns false, or to the substring's end position if the predicate always returns true.

Equations
@[irreducible, specialize #[]]
Equations
  • One or more equations did not get rendered due to their size.
@[inline]

Retains only the longest suffix of a substring in which a Boolean predicate returns true for all characters by moving the substring's start position towards its end position.

Equations
@[inline]

Removes the longest suffix of a substring in which a Boolean predicate returns true for all characters by moving the substring's end position. The end position is moved just after the position of the last character for which the predicate returns false, or to the substring's start position if the predicate always returns true.

Equations
@[inline]

Removes leading whitespace from a substring by moving its start position to the first non-whitespace character, or to its end position if there is no non-whitespace character.

“Whitespace” is defined as characters for which Char.isWhitespace returns true.

Equations
@[inline]

Removes trailing whitespace from a substring by moving its end position to the last non-whitespace character, or to its start position if there is no non-whitespace character.

“Whitespace” is defined as characters for which Char.isWhitespace returns true.

Equations
@[inline]

Removes leading and trailing whitespace from a substring by first moving its start position to the first non-whitespace character, and then moving its end position to the last non-whitespace character.

If the substring consists only of whitespace, then the resulting substring's start position is moved to its end position.

“Whitespace” is defined as characters for which Char.isWhitespace returns true.

Examples:

  • " red green blue ".toSubstring.trim.toString = "red green blue"
  • " red green blue ".toSubstring.trim.startPos = ⟨1⟩
  • " red green blue ".toSubstring.trim.stopPos = ⟨15⟩
  • " ".toSubstring.trim.startPos = ⟨5⟩
Equations
  • One or more equations did not get rendered due to their size.
@[inline]

Checks whether the substring can be interpreted as the decimal representation of a natural number.

A substring can be interpreted as a decimal natural number if it is not empty and all the characters in it are digits.

Use Substring.toNat? to convert such a substring to a natural number.

Equations

Checks whether the substring can be interpreted as the decimal representation of a natural number, returning the number if it can.

A substring can be interpreted as a decimal natural number if it is not empty and all the characters in it are digits.

Use Substring.isNat to check whether the substring is such a substring.

Equations
def Substring.beq (ss1 ss2 : Substring) :

Checks whether two substrings represent equal strings. Usually accessed via the == operator.

Two substrings do not need to have the same underlying string or the same start and end positions; instead, they are equal if they contain the same sequence of characters.

Equations

Checks whether two substrings have the same position and content.

The two substrings do not need to have the same underlying string for this check to succeed.

Equations

Returns the longest common prefix of two substrings.

The returned substring uses the same underlying string as s.

Equations
@[irreducible]

Returns the ending position of the common prefix, working up from spos, tpos.

Equations
  • One or more equations did not get rendered due to their size.

Returns the longest common suffix of two substrings.

The returned substring uses the same underlying string as s.

Equations
@[irreducible]

Returns the starting position of the common prefix, working down from spos, tpos.

Equations
  • One or more equations did not get rendered due to their size.

If pre is a prefix of s, returns the remainder. Returns none otherwise.

The substring pre is a prefix of s if there exists a t : Substring such that s.toString = pre.toString ++ t.toString. If so, the result is the substring of s without the prefix.

Equations

If suff is a suffix of s, returns the remainder. Returns none otherwise.

The substring suff is a suffix of s if there exists a t : Substring such that s.toString = t.toString ++ suff.toString. If so, the result the substring of s without the suffix.

Equations
@[inline]
def String.drop (s : String) (n : Nat) :

Removes the specified number of characters (Unicode code points) from the start of the string.

If n is greater than s.length, returns "".

Examples:

  • "red green blue".drop 4 = "green blue"
  • "red green blue".drop 10 = "blue"
  • "red green blue".drop 50 = ""
Equations
@[inline]

Removes the specified number of characters (Unicode code points) from the end of the string.

If n is greater than s.length, returns "".

Examples:

Equations
@[inline]
def String.take (s : String) (n : Nat) :

Creates a new string that contains the first n characters (Unicode code points) of s.

If n is greater than s.length, returns s.

Examples:

  • "red green blue".take 3 = "red"
  • "red green blue".take 1 = "r"
  • "red green blue".take 0 = ""
  • "red green blue".take 100 = "red green blue"
Equations
@[inline]

Creates a new string that contains the last n characters (Unicode code points) of s.

If n is greater than s.length, returns s.

Examples:

Equations
@[inline]
def String.takeWhile (s : String) (p : CharBool) :

Creates a new string that contains the longest prefix of s in which p returns true for all characters.

Examples:

  • "red green blue".takeWhile (·.isLetter) = "red"
  • "red green blue".takeWhile (· == 'r') = "r"
  • "red green blue".takeWhile (· != 'n') = "red gree"
  • "red green blue".takeWhile (fun _ => true) = "red green blue"
Equations
@[inline]
def String.dropWhile (s : String) (p : CharBool) :

Creates a new string by removing the longest prefix from s in which p returns true for all characters.

Examples:

  • "red green blue".dropWhile (·.isLetter) = " green blue"
  • "red green blue".dropWhile (· == 'r') = "ed green blue"
  • "red green blue".dropWhile (· != 'n') = "n blue"
  • "red green blue".dropWhile (fun _ => true) = ""
Equations
@[inline]

Creates a new string that contains the longest suffix of s in which p returns true for all characters.

Examples:

Equations
@[inline]

Creates a new string by removing the longest suffix from s in which p returns true for all characters.

Examples:

Equations
@[inline]

Checks whether the first string (s) begins with the second (pre).

String.isPrefix is a version that takes the potential prefix before the string.

Examples:

Equations
@[inline]
def String.endsWith (s post : String) :

Checks whether the first string (s) ends with the second (post).

Examples:

Equations
@[inline]

Removes trailing whitespace from a string.

“Whitespace” is defined as characters for which Char.isWhitespace returns true.

Examples:

Equations
@[inline]

Removes leading whitespace from a string.

“Whitespace” is defined as characters for which Char.isWhitespace returns true.

Examples:

Equations
@[inline]

Removes leading and trailing whitespace from a string.

“Whitespace” is defined as characters for which Char.isWhitespace returns true.

Examples:

  • "abc".trim = "abc"
  • " abc".trim = "abc"
  • "abc \t ".trim = "abc"
  • " abc ".trim = "abc"
  • "abc\ndef\n".trim = "abc\ndef"
Equations
@[inline]
def String.nextWhile (s : String) (p : CharBool) (i : Pos) :

Repeatedly increments a position in a string, as if by String.next, while the predicate p returns true for the character at the position. Stops incrementing at the end of the string or when p returns false for the current character.

Examples:

Equations
@[inline]
def String.nextUntil (s : String) (p : CharBool) (i : Pos) :

Repeatedly increments a position in a string, as if by String.next, while the predicate p returns false for the character at the position. Stops incrementing at the end of the string or when p returns true for the current character.

Examples:

Equations
@[inline]

Replaces each character in s with the result of applying Char.toUpper to it.

Char.toUpper has no effect on characters outside of the range 'a''z'.

Examples:

Equations
@[inline]

Replaces each character in s with the result of applying Char.toLower to it.

Char.toLower has no effect on characters outside of the range 'A''Z'.

Examples:

Equations
@[inline]

Replaces the first character in s with the result of applying Char.toUpper to it. Returns the empty string if the string is empty.

Char.toUpper has no effect on characters outside of the range 'a''z'.

Examples:

Equations
@[inline]

Replaces the first character in s with the result of applying Char.toLower to it. Returns the empty string if the string is empty.

Char.toLower has no effect on characters outside of the range 'A''Z'.

Examples:

Equations

If pre is a prefix of s, returns the remainder. Returns none otherwise.

The string pre is a prefix of s if there exists a t : String such that s = pre ++ t. If so, the result is some t.

Use String.stripPrefix to return the string unchanged when pre is not a prefix.

Examples:

Equations

If suff is a suffix of s, returns the remainder. Returns none otherwise.

The string suff is a suffix of s if there exists a t : String such that s = t ++ suff. If so, the result is some t.

Use String.stripSuffix to return the string unchanged when suff is not a suffix.

Examples:

Equations

If pre is a prefix of s, returns the remainder. Returns s unmodified otherwise.

The string pre is a prefix of s if there exists a t : String such that s = pre ++ t. If so, the result is t. Otherwise, it is s.

Use String.dropPrefix? to return none when pre is not a prefix.

Examples:

Equations

If suff is a suffix of s, returns the remainder. Returns s unmodified otherwise.

The string suff is a suffix of s if there exists a t : String such that s = t ++ suff. If so, the result is t. Otherwise, it is s.

Use String.dropSuffix? to return none when suff is not a suffix.

Examples:

Equations
@[inline]

Constructs a singleton string that contains only the provided character.

Examples:

Equations
@[simp]
theorem String.ext {s₁ s₂ : String} (h : s₁.data = s₂.data) :
s₁ = s₂
theorem String.ext_iff {s₁ s₂ : String} :
s₁ = s₂ s₁.data = s₂.data
@[simp]
@[simp]
theorem String.length_mk (s : List Char) :
{ data := s }.length = s.length
@[simp]
@[simp]
@[simp]
theorem String.length_push {s : String} (c : Char) :
(s.push c).length = s.length + 1
@[simp]
theorem String.length_pushn {s : String} (c : Char) (n : Nat) :
(s.pushn c n).length = s.length + n
@[simp]
theorem String.length_append (s t : String) :
(s ++ t).length = s.length + t.length
@[simp]
theorem String.data_push (s : String) (c : Char) :
(s.push c).data = s.data ++ [c]
@[simp]
theorem String.data_append (s t : String) :
(s ++ t).data = s.data ++ t.data
theorem String.lt_iff {s t : String} :
s < t s.data < t.data
theorem String.Pos.byteIdx_mk (n : Nat) :
{ byteIdx := n }.byteIdx = n
@[simp]
theorem String.Pos.mk_zero :
{ } = 0
@[simp]
theorem String.Pos.mk_byteIdx (p : Pos) :
{ byteIdx := p.byteIdx } = p
theorem String.Pos.ext {i₁ i₂ : Pos} (h : i₁.byteIdx = i₂.byteIdx) :
i₁ = i₂
theorem String.Pos.ext_iff {i₁ i₂ : Pos} :
i₁ = i₂ i₁.byteIdx = i₂.byteIdx
@[simp]
theorem String.Pos.add_byteIdx (p₁ p₂ : Pos) :
(p₁ + p₂).byteIdx = p₁.byteIdx + p₂.byteIdx
theorem String.Pos.add_eq (p₁ p₂ : Pos) :
p₁ + p₂ = { byteIdx := p₁.byteIdx + p₂.byteIdx }
@[simp]
theorem String.Pos.sub_byteIdx (p₁ p₂ : Pos) :
(p₁ - p₂).byteIdx = p₁.byteIdx - p₂.byteIdx
theorem String.Pos.sub_eq (p₁ p₂ : Pos) :
p₁ - p₂ = { byteIdx := p₁.byteIdx - p₂.byteIdx }
@[simp]
theorem String.Pos.addChar_byteIdx (p : Pos) (c : Char) :
theorem String.Pos.addChar_eq (p : Pos) (c : Char) :
p + c = { byteIdx := p.byteIdx + c.utf8Size }
theorem String.Pos.zero_addChar_eq (c : Char) :
0 + c = { byteIdx := c.utf8Size }
theorem String.Pos.addChar_right_comm (p : Pos) (c₁ c₂ : Char) :
p + c₁ + c₂ = p + c₂ + c₁
theorem String.Pos.ne_of_lt {i₁ i₂ : Pos} (h : i₁ < i₂) :
i₁ i₂
theorem String.Pos.ne_of_gt {i₁ i₂ : Pos} (h : i₁ < i₂) :
i₂ i₁
@[reducible, inline, deprecated String.Pos.byteIdx_addString (since := "2025-03-18")]
Equations
theorem String.Pos.addString_eq (p : Pos) (s : String) :
p + s = { byteIdx := p.byteIdx + s.utf8ByteSize }
@[reducible, inline, deprecated String.Pos.byteIdx_zero_addString (since := "2025-03-18")]
Equations
theorem String.Pos.zero_addString_eq (s : String) :
0 + s = { byteIdx := s.utf8ByteSize }
theorem String.Pos.le_iff {i₁ i₂ : Pos} :
i₁ i₂ i₁.byteIdx i₂.byteIdx
@[simp]
theorem String.Pos.mk_le_mk {i₁ i₂ : Nat} :
{ byteIdx := i₁ } { byteIdx := i₂ } i₁ i₂
theorem String.Pos.lt_iff {i₁ i₂ : Pos} :
i₁ < i₂ i₁.byteIdx < i₂.byteIdx
@[simp]
theorem String.Pos.mk_lt_mk {i₁ i₂ : Nat} :
{ byteIdx := i₁ } < { byteIdx := i₂ } i₁ < i₂
@[simp]
theorem String.get!_eq_get (s : String) (p : Pos) :
s.get! p = s.get p
theorem String.lt_next' (s : String) (p : Pos) :
p < s.next p
@[simp]
theorem String.prev_zero (s : String) :
s.prev 0 = 0
@[simp]
theorem String.get'_eq (s : String) (p : Pos) (h : ¬s.atEnd p = true) :
s.get' p h = s.get p
@[simp]
theorem String.next'_eq (s : String) (p : Pos) (h : ¬s.atEnd p = true) :
s.next' p h = s.next p
theorem String.singleton_eq (c : Char) :
singleton c = { data := [c] }
@[simp]
@[simp]
theorem String.append_empty (s : String) :
s ++ "" = s
@[simp]
theorem String.empty_append (s : String) :
"" ++ s = s
theorem String.append_assoc (s₁ s₂ s₃ : String) :
s₁ ++ s₂ ++ s₃ = s₁ ++ (s₂ ++ s₃)
@[simp]
theorem Substring.prev_zero (s : Substring) :
s.prev 0 = 0
@[simp]
theorem Substring.prevn_zero (s : Substring) (n : Nat) :
s.prevn n 0 = 0