W3cubDocs

/Crystal

class String

Overview

A String represents an immutable sequence of UTF-8 characters.

A String is typically created with a string literal, enclosing UTF-8 characters in double quotes:

"hello world"

A backslash can be used to denote some characters inside the string:

"\"" # double quote
"\\" # backslash
"\e" # escape
"\f" # form feed
"\n" # newline
"\r" # carriage return
"\t" # tab
"\v" # vertical tab

You can use a backslash followed by an u and four hexadecimal characters to denote a unicode codepoint written:

"\u0041" # == "A"

Or you can use curly braces and specify up to six hexadecimal numbers (0 to 10FFFF):

"\u{41}" # == "A"

A string can span multiple lines:

"hello
      world" # same as "hello\n      world"

Note that in the above example trailing and leading spaces, as well as newlines, end up in the resulting string. To avoid this, you can split a string into multiple lines by joining multiple literals with a backslash:

"hello " \
"world, " \
"no newlines" # same as "hello world, no newlines"

Alternatively, a backslash followed by a newline can be inserted inside the string literal:

"hello \
     world, \
     no newlines" # same as "hello world, no newlines"

In this case, leading whitespace is not included in the resulting string.

If you need to write a string that has many double quotes, parentheses, or similar characters, you can use alternative literals:

# Supports double quotes and nested parentheses
%(hello ("world")) # same as "hello (\"world\")"

# Supports double quotes and nested brackets
%[hello ["world"]] # same as "hello [\"world\"]"

# Supports double quotes and nested curlies
%{hello {"world"}} # same as "hello {\"world\"}"

# Supports double quotes and nested angles
%<hello <"world">> # same as "hello <\"world\">"

To create a String with embedded expressions, you can use string interpolation:

a = 1
b = 2
"sum = #{a + b}" # "sum = 3"

This ends up invoking Object#to_s(IO) on each expression enclosed by #{...}.

If you need to dynamically build a string, use String#build or IO::Memory.

Non UTF-8 valid strings

String might end up being conformed of bytes which are an invalid byte sequence according to UTF-8. This can happen if the string is created via one of the constructors that accept bytes, or when getting a string from String.build or IO::Memory. No exception will be raised, but invalid byte sequences, when asked as chars, will use the unicode replacement char (value 0xFFFD). For example:

# here 255 is not a valid byte value in the UTF-8 encoding
string = String.new(Bytes[255, 97])
string.valid_encoding? # => false

# The first char here is the unicode replacement char
string.chars # => ['�', 'a']

One can also create strings with specific byte value in them by using octal and hexadecimal escape sequences:

# Octal escape sequences
"\101" # # => "A"
"\12"  # # => "\n"
"\1"   # string with one character with code point 1
"\377" # string with one byte with value 255

# Hexadecimal escape sequences
"\x41" # # => "A"
"\xFF" # string with one byte with value 255

The reason for allowing strings that don't have a valid UTF-8 sequence is that the world is full of content that isn't properly encoded, and having a program raise an exception or stop because of this is not good. It's better if programs are more resilient, but show a replacement character when there's an error in incoming data.

Included Modules

Comparable(String)

Defined in:

big/big_decimal.cr
big/big_float.cr
big/big_int.cr
json/to_json.cr
string.cr
string/utf16.cr
yaml/to_yaml.cr

Constructors

.build(capacity = 64, &) : self
Builds a String by creating a String::Builder with the given initial capacity, yielding it to the block and finally getting a String out of it.
.from_utf16(slice : Slice(UInt16)) : String
Decodes the given slice UTF-16 sequence into a String.
.new(pull : JSON::PullParser)
.new(capacity : Int, &)
Creates a new String by allocating a buffer (Pointer(UInt8)) with the given capacity, then yielding that buffer.
.new(slice : Bytes)
Creates a String from the given slice.
.new(chars : Pointer(UInt8))
Creates a String from a pointer.
.new(ctx : YAML::ParseContext, node : YAML::Nodes::Node)
.new(chars : Pointer(UInt8), bytesize, size = 0)
Creates a new String from a pointer, indicating its bytesize count and, optionally, the UTF-8 codepoints count (size).
.new(bytes : Bytes, encoding : String, invalid : Symbol? = nil) : String
Creates a new String from the given bytes, which are encoded in the given encoding.

Class Method Summary

.from_json_object_key?(key : String)
.from_utf16(pointer : Pointer(UInt16)) : Tuple(String, Pointer(UInt16))
Decodes the given slice UTF-16 sequence into a String and returns the pointer after reading.
.interpolation(*values : *T) forall T
Implementation of string interpolation of multiple, possibly non-string values.
.interpolation(*values : String)
Implementation of string interpolation of multiple string values.
.interpolation(value)
Implementation of string interpolation of a single non-string value.
.interpolation(value : String)
Implementation of string interpolation of a single string.
.interpolation(char : Char, value : String)
Implementation of string interpolation of a char and a string.
.interpolation(value : String, char : Char)
Implementation of string interpolation of a string and a char.

Instance Method Summary

#%(other)
Interpolates other into the string using Kernel#sprintf.
#*(times : Int)
Makes a new String by adding str to itself times times.
#+(other : self)
Concatenates str and other.
#+(char : Char)
Concatenates str and other.
#<=>(other : self)
The comparison operator.
#==(other : self) : Bool
Returns true if this string is equal to `other.
#=~(regex : Regex)
Tests whether str matches regex.
#=~(other)
Tests whether str matches regex.
#[](start : Int, count : Int)
Returns a substring starting from the start character of size count.
#[](str : String | Char)
#[](range : Range)
Returns a substring by using a Range's begin and end as character indices.
#[](index : Int)
Returns the Char at the given index.
#[](regex : Regex, group)
#[](regex : Regex)
#[]?(regex : Regex)
#[]?(str : String | Char)
#[]?(start : Int, count : Int)
Like #[Int, Int] but returns nil if the start index is out of bounds.
#[]?(regex : Regex, group)
#[]?(index : Int)
#[]?(range : Range)
Like #[Range], but returns nil if the range's start is out of bounds.
#ascii_only?
Returns true if this String is comprised in its entirety by ASCII characters.
#blank?
Returns true if this string consists exclusively of unicode whitespace.
#byte_at(index) : UInt8
Returns the byte at the given index.
#byte_at(index, &)
Returns the byte at the given index, or yields if out of bounds.
#byte_at?(index) : UInt8?
Returns the byte at the given index, or nil if out of bounds.
#byte_index(byte : Int, offset = 0) : Int32?
Returns the index of the first ocurrence of byte in the string, or nil if not present.
#byte_index(search : String, offset = 0) : Int32?
Returns the byte index of search in the string, or nil if the string is not present.
#byte_index_to_char_index(index)
Returns the char index of a byte index, or nil if out of bounds.
#byte_slice(start : Int, count : Int) : String
Returns a new string built from count bytes starting at start byte.
#byte_slice(start : Int) : String
Returns a substring starting from the start byte.
#byte_slice?(start : Int, count : Int) : String?
Like #byte_slice(Int, Int) but returns Nil if the start index is out of bounds.
#bytes
Returns this string's bytes as an Array(UInt8).
#bytesize : Int32
Returns the number of bytes in this string.
#camelcase(io : IO, options : Unicode::CaseOptions = Unicode::CaseOptions::None, *, lower : Bool = false) : Nil
Writes an camelcased version of self to the given io.
#camelcase(options : Unicode::CaseOptions = Unicode::CaseOptions::None, *, lower : Bool = false) : String
Converts underscores to camelcase boundaries.
#capitalize(io : IO, options : Unicode::CaseOptions = :none) : Nil
Writes a capitalized version of self to the given io.
#capitalize(options : Unicode::CaseOptions = :none) : String
Returns a new String with the first letter converted to uppercase and every subsequent letter converted to lowercase.
#center(io : IO, len : Int, char : Char = ' ') : Nil
Adds instances of char to left and right of the string until it is at least size of len, then appends the result to the given IO.
#center(len : Int, io : IO) : Nil

Adds spaces to left and right of the string until it is at least size of len, then appends the result to the given IO.

DEPRECATED Use #center(io :IO, len : Int, char : Char = ' ') instead
#center(len : Int, char : Char = ' ')
Adds instances of char to left and right of the string until it is at least size of len.
#center(len : Int, char : Char, io : IO) : Nil

Adds instances of char to left ond right of the string until it is at least size of len, then appends the result to the given IO.

DEPRECATED Use #center(io :IO, len : Int, char : Char = ' ') instead
#char_at(index : Int) : Char
Returns the Char at the given index.
#char_at(index : Int, &)
Returns the Char at the given index, or result of running the given block if out of bounds.
#char_index_to_byte_index(index)
Returns the byte index of a char index, or nil if out of bounds.
#chars
Returns an Array of all characters in the string.
#check_no_null_byte(name = nil)
Raises an ArgumentError if self has null bytes.
#chomp
Returns a new String with the last carriage return removed (that is, it will remove \n, \r, and \r\n).
#chomp(suffix : String)
Returns a new String with suffix removed from the end of the string.
#chomp(suffix : Char)
Returns a new String with suffix removed from the end of the string.
#clone : String
Returns self.
#codepoint_at(index) : Int32
Returns the codepoint of the character at the given index.
#codepoints
Returns an Array of the codepoints that make the string.
#compare(other : String, case_insensitive = false, options = Unicode::CaseOptions::None)
Compares this string with other, returning -1, 0 or 1 depending on whether this string is less, equal or greater than other, optionally in a case_insensitive manner.
#count(*sets)
Sets should be a list of strings following the rules described at Char#in_set?.
#count(&)
Yields each char in this string to the block, returns the number of times the block returned a truthy value.
#count(other : Char)
Counts the occurrences of other char in this string.
#delete(&)
Yields each char in this string to the block.
#delete(char : Char)
Returns a new String with all occurrences of char removed.
#delete(*sets)
Sets should be a list of strings following the rules described at Char#in_set?.
#downcase(options : Unicode::CaseOptions = :none) : String
Returns a new String with each uppercase letter replaced with its lowercase counterpart.
#downcase(io : IO, options : Unicode::CaseOptions = :none) : Nil
Writes a downcased version of self to the given io.
#dump : String
Returns a representation of self using character escapes for special characters and and non-ascii characters (unicode codepoints > 128), wrapped in quotes.
#dump(io : IO) : Nil
Appends self to the given IO object using character escapes for special characters and and non-ascii characters (unicode codepoints > 128), wrapped in quotes.
#dump_unquoted : String
Returns a representation of self using character escapes for special characters and and non-ascii characters (unicode codepoints > 128), but not wrapped in quotes.
#dump_unquoted(io : IO) : Nil
Appends self to the given IO object using character escapes for special characters and and non-ascii characters (unicode codepoints > 128), but not wrapped in quotes.
#dup : String
Returns self.
#each_byte(&)
Yields each byte in the string to the block.
#each_byte
Returns an Iterator over each byte in the string.
#each_char
Returns an Iterator over each character in the string.
#each_char(&) : Nil
Yields each character in the string to the block.
#each_char_with_index(offset = 0, &)
Yields each character and its index in the string to the block.
#each_codepoint(&)
Yields each codepoint to the block.
#each_codepoint
Returns an Iterator for each codepoint.
#each_line(chomp = true, &block : String -> ) : Nil
Splits the string after each newline and yields each line to a block.
#each_line(chomp = true)
Returns an Iterator which yields each line of this string (see String#each_line).
#empty?
Returns true if this is the empty string, "".
#encode(encoding : String, invalid : Symbol? = nil) : Bytes
Returns a slice of bytes containing this string encoded in the given encoding.
#ends_with?(str : String) : Bool
Returns true if this string ends with the given str.
#ends_with?(re : Regex) : Bool
Returns true if the regular expression re matches at the end of this string.
#ends_with?(char : Char) : Bool
Returns true if this string ends with the given char.
#gsub(pattern : Regex, hash : Hash(String, _) | NamedTuple)
Returns a String where all occurrences of the given pattern are replaced with a hash of replacements.
#gsub(string : String, &)
Returns a String where all occurrences of the given string are replaced with the block's value.
#gsub(&block : Char -> _)
Returns a String where each character yielded to the given block is replaced by the block's return value.
#gsub(char : Char, replacement)
Returns a String where all occurrences of the given char are replaced with the given replacement.
#gsub(pattern : Regex, replacement, backreferences = true)
Returns a String where all occurrences of the given pattern are replaced with the given replacement.
#gsub(string : String, replacement)
Returns a String where all occurrences of the given string are replaced with the given replacement.
#gsub(hash : Hash(Char, _))
Returns a String where all chars in the given hash are replaced by the corresponding hash values.
#gsub(tuple : NamedTuple)
Returns a String where all chars in the given named tuple are replaced by the corresponding tuple values.
#gsub(pattern : Regex, &)
Returns a String where all occurrences of the given pattern are replaced by the block value's value.
#has_back_references?
This returns true if this string has '\\' in it.
#hash(hasher)
See Object#hash(hasher)
#hexbytes : Bytes
Interprets this string as containing a sequence of hexadecimal values and decodes it as a slice of bytes.
#hexbytes? : Bytes?
Interprets this string as containing a sequence of hexadecimal values and decodes it as a slice of bytes.
#includes?(search : Char | String)
Returns true if the string contains search.
#index(search : String, offset = 0)
Returns the index of the first occurrence of search in the string, or nil if not present.
#index(search : Regex, offset = 0)
Returns the index of the first occurrence of search in the string, or nil if not present.
#index(search : Char, offset = 0)
Returns the index of the first occurrence of search in the string, or nil if not present.
#insert(index : Int, other : String)
Returns a new String that results of inserting other in self at index.
#insert(index : Int, other : Char)
Returns a new String that results of inserting other in self at index.
#inspect(io : IO) : Nil
Appends self to the given IO object using character escapes for special characters and wrapped in double quotes.
#inspect : String
Returns a representation of self using character escapes for special characters and wrapped in quotes.
#inspect_unquoted : String
Returns a representation of self using character escapes for special characters but not wrapped in quotes.
#inspect_unquoted(io : IO) : Nil
Appends self to the given IO object using character escapes for special characters but not wrapped in quotes.
#lchop : String
Returns a new String with the first char removed from it.
#lchop(prefix : Char | String) : String
Returns a new String with prefix removed from the beginning of the string.
#lchop?(prefix : Char | String) : String?
Returns a new String with prefix removed from the beginning of the string if possible, else returns nil.
#lchop? : String?
Returns a new String with the first char removed from it if possible, else returns nil.
#lines(chomp = true)
#ljust(io : IO, len : Int, char : Char = ' ') : Nil
Adds instances of char to right of the string until it is at least size of len, and then appends the result to the given IO.
#ljust(len : Int, io : IO) : Nil

Adds spaces to right of the string until it is at least size of len, and then appends the result to the given IO.

DEPRECATED Use #ljust(io :IO, len : Int, char : Char = ' ') instead
#ljust(len : Int, char : Char = ' ')
Adds instances of char to right of the string until it is at least size of len.
#ljust(len : Int, char : Char, io : IO) : Nil

Adds instances of char to right of the string until it is at least size of len, and then appends the result to the given IO.

DEPRECATED Use #ljust(io :IO, len : Int, char : Char = ' ') instead
#lstrip(char : Char)
Returns a new string with leading occurrences of char removed.
#lstrip(chars : String)
Returns a new string where leading occurrences of any char in chars are removed.
#lstrip(&block : Char -> _)
Returns a new string where leading characters for which the block returns a truthy value are removed.
#lstrip
Returns a new String with leading whitespace removed.
#match(regex : Regex, pos = 0) : Regex::MatchData?
Finds match of regex, starting at pos.
#matches?(regex : Regex, pos = 0) : Bool
Finds match of regex like #match, but it returns Bool value.
#partition(search : Char | String) : Tuple(String, String, String)
Searches separator or pattern (Regex) in the string, and returns a Tuple with the part before it, the match, and the part after it.
#partition(search : Regex) : Tuple(String, String, String)
Searches separator or pattern (Regex) in the string, and returns a Tuple with the part before it, the match, and the part after it.
#presence : self?
Returns self unless #blank? is true in which case it returns nil.
#pretty_print(pp : PrettyPrint) : Nil
Pretty prints self into the given printer.
#rchop : String
Returns a new String with the last character removed.
#rchop(suffix : Char | String) : String
Returns a new String with suffix removed from the end of the string.
#rchop? : String?
Returns a new String with the last character removed if possible, else returns nil.
#rchop?(suffix : Char | String) : String?
Returns a new String with suffix removed from the end of the string if possible, else returns nil.
#reverse
Reverses the order of characters in the string.
#rindex(search : Regex, offset = size - 1)
Returns the index of the last appearance of search in the string, If offset is present, it defines the position to end the search (characters beyond this point are ignored).
#rindex(search : String, offset = size - search.size)
Returns the index of the last appearance of search in the string, If offset is present, it defines the position to end the search (characters beyond this point are ignored).
#rindex(search : Char, offset = size - 1)
Returns the index of the last appearance of search in the string, If offset is present, it defines the position to end the search (characters beyond this point are ignored).
#rjust(len : Int, char : Char = ' ')
Adds instances of char to left of the string until it is at least size of len.
#rjust(len : Int, char : Char, io : IO) : Nil

Adds instances of char to left of the string until it is at least size of len, and then appends the result to the given IO.

DEPRECATED Use #rjust(io :IO, len : Int, char : Char = ' ') instead
#rjust(len : Int, io : IO) : Nil

Adds spaces to left of the string until it is at least size of len, and then appends the result to the given IO.

DEPRECATED Use #rjust(io :IO, len : Int, char : Char = ' ') instead
#rjust(io : IO, len : Int, char : Char = ' ') : Nil
Adds instances of char to left of the string until it is at least size of len, and then appends the result to the given IO.
#rpartition(search : Regex) : Tuple(String, String, String)
Searches separator or pattern (Regex) in the string from the end of the string, and returns a Tuple with the part before it, the match, and the part after it.
#rpartition(search : Char | String) : Tuple(String, String, String)
Searches separator or pattern (Regex) in the string from the end of the string, and returns a Tuple with the part before it, the match, and the part after it.
#rstrip(char : Char)
Returns a new string with trailing occurrences of char removed.
#rstrip(chars : String)
Returns a new string where trailing occurrences of any char in chars are removed.
#rstrip
Returns a new String with trailing whitespace removed.
#rstrip(&block : Char -> _)
Returns a new string where trailing characters for which the block returns a truthy value are removed.
#scan(pattern : String)
Searches the string for instances of pattern, returning an array of the matched string for each match.
#scan(pattern : Regex)
Searches the string for instances of pattern, returning an Array of Regex::MatchData for each match.
#scan(pattern : Regex, &)
Searches the string for instances of pattern, yielding a Regex::MatchData for each match.
#scan(pattern : String, &)
Searches the string for instances of pattern, yielding the matched string for each match.
#scrub(replacement = Char::REPLACEMENT) : String
Returns a String where bytes that are invalid in the UTF-8 encoding are replaced with replacement.
#size
Returns the number of unicode codepoints in this string.
#split(separator : Regex, limit = nil, *, remove_empty = false, &block : String -> _)
Makes an Array by splitting the string on separator (and removing instances of separator).
#split(separator : Regex, limit = nil, *, remove_empty = false)
Splits the string after each regex separator and yields each part to a block.
#split(separator : String, limit = nil, *, remove_empty = false, &block : String -> _)
Splits the string after each string separator and yields each part to a block.
#split(separator : String, limit = nil, *, remove_empty = false)
Makes an Array by splitting the string on separator (and removing instances of separator).
#split(separator : Char, limit = nil, *, remove_empty = false, &block : String -> _)
Splits the string after each character separator and yields each part to a block.
#split(separator : Char, limit = nil, *, remove_empty = false)
Makes an Array by splitting the string on the given character separator (and removing that character).
#split(limit : Int32? = nil, &block : String -> _)
Splits the string after any amount of ASCII whitespace characters and yields each non-whitespace part to a block.
#split(limit : Int32? = nil)
Makes an array by splitting the string on any amount of ASCII whitespace characters (and removing that whitespace).
#squeeze
Returns a new String, that has all characters removed, that were the same as the previous one.
#squeeze(*sets : String)
Sets should be a list of strings following the rules described at Char#in_set?.
#squeeze(&)
Yields each char in this string to the block.
#squeeze(char : Char)
Returns a new String, with all runs of char replaced by one instance.
#starts_with?(str : String) : Bool
Returns true if this string starts with the given str.
#starts_with?(char : Char) : Bool
Returns true if this string starts with the given char.
#starts_with?(re : Regex) : Bool
Returns true if the regular expression re matches at the start of this string.
#strip(char : Char)
Returns a new string where leading and trailing occurrences of char are removed.
#strip(&block : Char -> _)
Returns a new string where leading and trailing characters for which the block returns a truthy value are removed.
#strip
Returns a new String with leading and trailing whitespace removed.
#strip(chars : String)
Returns a new string where leading and trailing occurrences of any char in chars are removed.
#sub(char : Char, replacement)
Returns a String where the first occurrence of char is replaced by replacement.
#sub(pattern : Regex, hash : Hash(String, _) | NamedTuple)
Returns a String where the first occurrences of the given pattern is replaced with the matching entry from the hash of replacements.
#sub(&block : Char -> _)
Returns a new String where the first character is yielded to the given block and replaced by its return value.
#sub(string : String, &)
Returns a String where the first occurrences of the given string is replaced with the block's value.
#sub(pattern : Regex, &)
Returns a String where the first occurrence of pattern is replaced by the block's return value.
#sub(pattern : Regex, replacement, backreferences = true)
Returns a String where the first occurrence of pattern is replaced by replacement
#sub(string : String, replacement)
Returns a String where the first occurrences of the given string is replaced with the given replacement.
#sub(index : Int, replacement : Char)
Returns a new String with the character at the given index replaced by replacement.
#sub(index : Int, replacement : String)
Returns a new String with the character at the given index replaced by replacement.
#sub(range : Range, replacement : String)
Returns a new String with characters at the given range replaced by replacement.
#sub(hash : Hash(Char, _))
Returns a String where the first char in the string matching a key in the given hash is replaced by the corresponding hash value.
#sub(range : Range, replacement : Char)
Returns a new String with characters at the given range replaced by replacement.
#succ
Returns the successor of the string.
#titleize(options : Unicode::CaseOptions = :none) : String
Returns a new String with the first letter after any space converted to uppercase and every other letter converted to lowercase.
#titleize(io : IO, options : Unicode::CaseOptions = :none) : Nil
Writes a titleized version of self to the given io.
#to_big_d
Converts self to BigDecimal.
#to_big_f
Converts self to a BigFloat.
#to_big_i(base = 10) : BigInt
Returns a BigInt from this string, in the given base.
#to_f(whitespace : Bool = true, strict : Bool = true)
Returns the result of interpreting characters in this string as a floating point number (Float64).
#to_f32(whitespace : Bool = true, strict : Bool = true)
Same as #to_f but returns a Float32.
#to_f32?(whitespace : Bool = true, strict : Bool = true)
Same as #to_f? but returns a Float32.
#to_f64(whitespace : Bool = true, strict : Bool = true)
Same as #to_f.
#to_f64?(whitespace : Bool = true, strict : Bool = true)
Same as #to_f?.
#to_f?(whitespace : Bool = true, strict : Bool = true)
Returns the result of interpreting characters in this string as a floating point number (Float64).
#to_i(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false, &)
Same as #to_i, but returns the block's value if there is not a valid number at the start of this string, or if the resulting integer doesn't fit an Int32.
#to_i(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false)
Returns the result of interpreting leading characters in this string as an integer base base (between 2 and 36).
#to_i16(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : Int16
Same as #to_i but returns an Int16.
#to_i16(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false, &)
Same as #to_i but returns an Int16 or the block's value.
#to_i16?(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : Int16?
Same as #to_i but returns an Int16 or nil.
#to_i32(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : Int32
Same as #to_i.
#to_i32(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false, &)
Same as #to_i.
#to_i32?(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : Int32?
Same as #to_i.
#to_i64(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : Int64
Same as #to_i but returns an Int64.
#to_i64(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false, &)
Same as #to_i but returns an Int64 or the block's value.
#to_i64?(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : Int64?
Same as #to_i but returns an Int64 or nil.
#to_i8(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false, &)
Same as #to_i but returns an Int8 or the block's value.
#to_i8(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : Int8
Same as #to_i but returns an Int8.
#to_i8?(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : Int8?
Same as #to_i but returns an Int8 or nil.
#to_i?(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false)
Same as #to_i, but returns nil if there is not a valid number at the start of this string, or if the resulting integer doesn't fit an Int32.
#to_json(json : JSON::Builder)
#to_json_object_key
#to_s : String
Returns self.
#to_s(io : IO) : Nil
Appends self to io.
#to_slice : Bytes
Returns the underlying bytes of this String.
#to_u16(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false, &)
Same as #to_i but returns an UInt16 or the block's value.
#to_u16(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : UInt16
Same as #to_i but returns an UInt16.
#to_u16?(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : UInt16?
Same as #to_i but returns an UInt16 or nil.
#to_u32(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : UInt32
Same as #to_i but returns an UInt32.
#to_u32(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false, &)
Same as #to_i but returns an UInt32 or the block's value.
#to_u32?(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : UInt32?
Same as #to_i but returns an UInt32 or nil.
#to_u64(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : UInt64
Same as #to_i but returns an UInt64.
#to_u64(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false, &)
Same as #to_i but returns an UInt64 or the block's value.
#to_u64?(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : UInt64?
Same as #to_i but returns an UInt64 or nil.
#to_u8(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false, &)
Same as #to_i but returns an UInt8 or the block's value.
#to_u8(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : UInt8
Same as #to_i but returns an UInt8.
#to_u8?(base : Int = 10, whitespace : Bool = true, underscore : Bool = false, prefix : Bool = false, strict : Bool = true, leading_zero_is_octal : Bool = false) : UInt8?
Same as #to_i but returns an UInt8 or nil.
#to_unsafe : Pointer(UInt8)
Returns a pointer to the underlying bytes of this String.
#to_utf16 : Slice(UInt16)
Returns the UTF-16 encoding of the given string.
#to_yaml(yaml : YAML::Nodes::Builder)
#tr(from : String, to : String)
Returns a new string translating characters using from and to as a map.
#underscore(io : IO, options : Unicode::CaseOptions = :none) : Nil
Writes an underscored version of self to the given io.
#underscore(options : Unicode::CaseOptions = :none) : String
Converts camelcase boundaries to underscores.
#unsafe_byte_at(index : Int) : UInt8
Returns the byte at the given index without bounds checking.
#unsafe_byte_slice(byte_offset) : Slice
Returns the underlying bytes of this String starting at given byte_offset.
#unsafe_byte_slice(byte_offset, count) : Slice
Returns count of underlying bytes of this String starting at given byte_offset.
#upcase(io : IO, options : Unicode::CaseOptions = :none) : Nil
Writes a upcased version of self to the given io.
#upcase(options : Unicode::CaseOptions = :none) : String
Returns a new String with each lowercase letter replaced with its uppercase counterpart.
#valid_encoding?
Returns true if this String is encoded correctly according to the UTF-8 encoding.

Instance methods inherited from module `Comparable(String)`

, , , , , ,

Instance methods inherited from class `Reference`

, , , , , , ,

Constructor methods inherited from class `Reference`

Instance methods inherited from class `Object`

, , , , , , , , , , , , , , , , , , , , , , , , , ,

Class methods inherited from class `Object`

Constructor Detail

def self.build(capacity = 64, &) : selfSource

Builds a String by creating a String::Builder with the given initial capacity, yielding it to the block and finally getting a String out of it. The String::Builder automatically resizes as needed.

str = String.build do |str|
  str << "hello "
  str << 1
end
str # => "hello 1"

def self.from_utf16(slice : Slice(UInt16)) : String Source

Decodes the given slice UTF-16 sequence into a String.

Invalid values are encoded using the unicode replacement char with codepoint 0xfffd.

slice = Slice[104_u16, 105_u16, 32_u16, 55296_u16, 56485_u16]
String.from_utf16(slice) # => "hi 𐂥"

def self.new(pull : JSON::PullParser)Source

def self.new(capacity : Int, &)Source

Creates a new String by allocating a buffer (Pointer(UInt8)) with the given capacity, then yielding that buffer. The block must return a tuple with the bytesize and size (UTF-8 codepoints count) of the String. If the returned size is zero, the UTF-8 codepoints count will be lazily computed.

The bytesize returned by the block must be less than or equal to the capacity given to this String, otherwise ArgumentError is raised.

If you need to build a String where the maximum capacity is unknown, use String#build.

str = String.new(4) do |buffer|
  buffer[0] = 'a'.ord.to_u8
  buffer[1] = 'b'.ord.to_u8
  {2, 2}
end
str # => "ab"

def self.new(slice : Bytes)Source

Creates a String from the given slice. Bytes will be copied from the slice.

This method is always safe to call, and the resulting string will have the contents and size of the slice.

slice = Slice.new(4) { |i| ('a'.ord + i).to_u8 }
String.new(slice) # => "abcd"

def self.new(chars : Pointer(UInt8))Source

Creates a String from a pointer. Bytes will be copied from the pointer.

This method is unsafe: the pointer must point to data that eventually contains a zero byte that indicates the ends of the string. Otherwise, the result of this method is undefined and might cause a segmentation fault.

This method is typically used in C bindings, where you get a char* from a library and the library guarantees that this pointer eventually has an ending zero byte.

ptr = Pointer.malloc(5) { |i| i == 4 ? 0_u8 : ('a'.ord + i).to_u8 }
String.new(ptr) # => "abcd"

def self.new(ctx : YAML::ParseContext, node : YAML::Nodes::Node)Source

def self.new(chars : Pointer(UInt8), bytesize, size = 0)Source

Creates a new String from a pointer, indicating its bytesize count and, optionally, the UTF-8 codepoints count (size). Bytes will be copied from the pointer.

If the given size is zero, the amount of UTF-8 codepoints will be lazily computed when needed.

ptr = Pointer.malloc(4) { |i| ('a'.ord + i).to_u8 }
String.new(ptr, 2) # => "ab"

def self.new(bytes : Bytes, encoding : String, invalid : Symbol? = nil) : String Source

Creates a new String from the given bytes, which are encoded in the given encoding.

The invalid argument can be:

nil: an exception is raised on invalid byte sequences
:skip: invalid byte sequences are ignored

slice = Slice.new(2, 0_u8)
slice[0] = 186_u8
slice[1] = 195_u8
String.new(slice, "GB2312") # => "好"

Class Method Detail

def self.from_json_object_key?(key : String)Source

def self.from_utf16(pointer : Pointer(UInt16)) : Tuple(String, Pointer(UInt16))Source

Decodes the given slice UTF-16 sequence into a String and returns the pointer after reading. The string ends when a zero value is found.

slice = Slice[104_u16, 105_u16, 0_u16, 55296_u16, 56485_u16, 0_u16]
String.from_utf16(slice) # => "hi\0000𐂥"
pointer = slice.to_unsafe
string, pointer = String.from_utf16(pointer) # => "hi"
string, pointer = String.from_utf16(pointer) # => "𐂥"

Invalid values are encoded using the unicode replacement char with codepoint 0xfffd.

def self.interpolation(values : T) forall TSource

Implementation of string interpolation of multiple, possibly non-string values.

For example, this code will end up invoking this method:

value1 = "hello"
value2 = 123
"#{value1} #{value2}!" # same as String.interpolation(value1, " ", value2, "!")

In this case the implementation will call String.build with the given values.