Class StringView
java.lang.Object
net.sf.saxon.str.UnicodeString
net.sf.saxon.str.StringView
- All Implemented Interfaces:
Comparable<UnicodeString>, AtomicMatchKey
An implementation of the CodePoints interface that wraps an ordinary Java string.
-
Method Summary
Modifier and TypeMethodDescriptionintcodePointAt(long index) Get the code point at a given position in the stringGet an iterator over the code points present in the string.intcompareTo(UnicodeString other) Compare this string to another using codepoint comparisonconcat(UnicodeString other) Concatenate with another string, returning a new string(package private) voidcopy16bit(char[] target, int offset) Copy this string, as a sequence of 16-bit characters, to a specified array(package private) voidcopy24bit(byte[] target, int offset) Copy this string, as a sequence of 24-bit characters, to a specified array(package private) voidcopy8bit(byte[] target, int offset) Copy this string, as a sequence of 8-bit characters, to a specified arraybooleanlongGet the estimated length of the string, suitable for space allocation.intgetWidth()Get the number of bits needed to hold all the characters in this stringinthashCode()Compute a hashCode.longindexOf(int codePoint, long from) Get the position of the first occurrence of the specified codepoint, starting the search at a given position in the stringbooleanisEmpty()Ask whether the string is emptylonglength()Get the length of the stringstatic UnicodeStringsubstring(long start, long end) Get a substring of this codepoint sequence, with a given start and end positiontidy()Ensure that the implementation is capable of counting codepoints in the string.static UnicodeStringtoString()static UnicodeStringMethods inherited from class UnicodeString
asAtomic, checkSubstringBounds, hasSubstring, indexOf, indexOf, indexWhere, length32, prefix, requireInt, substring, verifyCharacters
-
Method Details
-
of
-
wrap
-
tidy
-
tidy
Description copied from class:UnicodeStringEnsure that the implementation is capable of counting codepoints in the string. This is normally a null operation, but it may cause internal reorganisation.- Overrides:
tidyin classUnicodeString- Returns:
- this
UnicodeString, or another that represents the same sequence of characters.
-
isEmpty
public boolean isEmpty()Description copied from class:UnicodeStringAsk whether the string is empty- Overrides:
isEmptyin classUnicodeString- Returns:
- true if the length of the string is zero
-
estimatedLength
public long estimatedLength()Description copied from class:UnicodeStringGet the estimated length of the string, suitable for space allocation.- Overrides:
estimatedLengthin classUnicodeString- Returns:
- for a
UnicodeString, the actual length of the string in codepoints
-
length
public long length()Get the length of the string- Specified by:
lengthin classUnicodeString- Returns:
- the number of code points in the string
-
getWidth
public int getWidth()Get the number of bits needed to hold all the characters in this string- Specified by:
getWidthin classUnicodeString- Returns:
- 7 for ascii characters, 8 for latin-1, 16 for BMP, 24 for general Unicode.
-
indexOf
public long indexOf(int codePoint, long from) Get the position of the first occurrence of the specified codepoint, starting the search at a given position in the string- Specified by:
indexOfin classUnicodeString- Parameters:
codePoint- the sought codepointfrom- the position from which the search should start (0-based). There is no restriction on the value. Negative values are treated as zero; values greater than or equal to length() return -1.- Returns:
- the position (zero-based) of the first occurrence found, or -1 if not found
-
codePointAt
public int codePointAt(long index) Get the code point at a given position in the string- Specified by:
codePointAtin classUnicodeString- Parameters:
index- the given position (0-based)- Returns:
- the code point at the given position
- Throws:
IndexOutOfBoundsException- if the index is out of range
-
substring
Get a substring of this codepoint sequence, with a given start and end position- Specified by:
substringin classUnicodeString- Parameters:
start- the start position (0-based): that is, the position of the first code point to be includedend- the end position (0-based): specifically, the position of the first code point not to be included- Returns:
- the requested substring
- Throws:
UnsupportedOperationException- if theUnicodeStringhas not been prepared for codePoint access
-
codePoints
Description copied from class:UnicodeStringGet an iterator over the code points present in the string.- Specified by:
codePointsin classUnicodeString- Returns:
- an iterator that delivers the individual code points
-
concat
Description copied from class:UnicodeStringConcatenate with another string, returning a new string- Overrides:
concatin classUnicodeString- Parameters:
other- the string to be appended- Returns:
- the result of concatenating this string followed by the other
-
compareTo
Description copied from class:UnicodeStringCompare this string to another using codepoint comparison- Specified by:
compareToin interfaceComparable<UnicodeString>- Overrides:
compareToin classUnicodeString- Parameters:
other- the other string- Returns:
- -1 if this string comes first, 0 if they are equal, +1 if the other string comes first
-
equals
- Overrides:
equalsin classUnicodeString
-
hashCode
public int hashCode()Description copied from class:UnicodeStringCompute a hashCode. All implementations ofUnicodeStringuse compatible hash codes and the hashing algorithm is therefore identical to that forjava.lang.String. This means that for strings containing Astral characters, the hash code needs to be computed by decomposing an Astral character into a surrogate pair.- Overrides:
hashCodein classUnicodeString- Returns:
- the hash code
-
economize
- Overrides:
economizein classUnicodeString
-
toString
-
copy8bit
void copy8bit(byte[] target, int offset) Description copied from class:UnicodeStringCopy this string, as a sequence of 8-bit characters, to a specified array- Overrides:
copy8bitin classUnicodeString- Parameters:
target- the target array: the caller must ensure there is sufficient capacityoffset- the position in the target array
-
copy16bit
void copy16bit(char[] target, int offset) Description copied from class:UnicodeStringCopy this string, as a sequence of 16-bit characters, to a specified array- Overrides:
copy16bitin classUnicodeString- Parameters:
target- the target array: the caller must ensure there is sufficient capacityoffset- the position in the target array
-
copy24bit
void copy24bit(byte[] target, int offset) Description copied from class:UnicodeStringCopy this string, as a sequence of 24-bit characters, to a specified array- Overrides:
copy24bitin classUnicodeString- Parameters:
target- the target array: the caller must ensure there is sufficient capacityoffset- the position in the target array as a byte offset (that is, the character offset times 3)
-