Class ByteString
String. Concatenation is
likewise supported without copying (long strings) by building a tree of
pieces in RopeByteString.
Like String, the contents of a ByteString can never be
observed to change, not even in the presence of a data race or incorrect
API usage in the client code.
- Author:
- crazybob@google.com Bob Lee, kenton@google.com Kenton Varda, carlanton@google.com Carl Haverl, martinrb@google.com Martin Buchholz
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic interfaceThis interface extendsIterator<Byte>, so that we can return an unboxedbyte.static final classOutputs to aByteStringinstance. -
Field Summary
Fields -
Method Summary
Modifier and TypeMethodDescriptionabstract ByteBufferConstructs a read-onlyjava.nio.ByteBufferwhose content is equal to the contents of this byte string.abstract List<ByteBuffer>Constructs a list of read-onlyjava.nio.ByteBufferobjects such that the concatenation of their contents is equal to the contents of this byte string.abstract bytebyteAt(int index) Gets the byte at the given index.concat(ByteString other) Concatenate the givenByteStringto this one.static ByteStringcopyFrom(byte[] bytes) Copies the given bytes into aByteString.static ByteStringcopyFrom(byte[] bytes, int offset, int size) Copies the given bytes into aByteString.static ByteStringcopyFrom(Iterable<ByteString> byteStrings) Concatenates all byte strings in the iterable and returns the result.static ByteStringEncodestextinto a sequence of bytes using the named charset and returns the result as aByteString.static ByteStringcopyFrom(ByteBuffer bytes) Copies the remaining bytes from ajava.nio.ByteBufferinto aByteString.static ByteStringcopyFrom(ByteBuffer bytes, int size) Copies the nextsizebytes from ajava.nio.ByteBufferinto aByteString.static ByteStringcopyFromUtf8(String text) Encodestextinto a sequence of UTF-8 bytes and returns the result as aByteString.voidcopyTo(byte[] target, int offset) Copies bytes into a buffer at the given offset.voidcopyTo(byte[] target, int sourceOffset, int targetOffset, int numberToCopy) Copies bytes into a buffer.abstract voidcopyTo(ByteBuffer target) Copies bytes into a ByteBuffer.protected abstract voidcopyToInternal(byte[] target, int sourceOffset, int targetOffset, int numberToCopy) Internal (package private) implementation ofbooleanendsWith(ByteString suffix) Tests if this bytestring ends with the specified suffix.abstract booleanprotected abstract intReturn the depth of the tree representing thisByteString, if any, whose root is this node.abstract inthashCode()Return a non-zero hashCode depending only on the sequence of bytes in this ByteString.protected abstract booleanReturntrueif this ByteString is literal (a leaf node) or a flat-enough tree in the sense ofRopeByteString.booleanisEmpty()Returnstrueif the size is0,falseotherwise.abstract booleanTells whether thisByteStringrepresents a well-formed UTF-8 byte sequence, such that the original bytes can be converted to a String object and then round tripped back to bytes without loss.abstract ByteString.ByteIteratoriterator()Return aByteString.ByteIteratorover the bytes in the ByteString.abstract CodedInputStreamCreates aCodedInputStreamwhich can be used to read the bytes.abstract InputStreamnewInput()Creates anInputStreamwhich can be used to read the bytes.static ByteString.OutputCreates a newByteString.Output.static ByteString.OutputnewOutput(int initialCapacity) Creates a newByteString.Outputwith the given initial capacity.protected abstract intpartialHash(int h, int offset, int length) Compute the hash across the value bytes starting with the given hash, and return the result.protected abstract intpartialIsValidUtf8(int state, int offset, int length) Tells whether the given byte sequence is a well-formed, malformed, or incomplete UTF-8 byte sequence.protected abstract intReturn the cached hash code if available.static ByteStringreadFrom(InputStream streamToDrain) Completely reads the given stream's bytes into aByteString, blocking if necessary until all bytes are read through to the end of the stream.static ByteStringreadFrom(InputStream streamToDrain, int chunkSize) Completely reads the given stream's bytes into aByteString, blocking if necessary until all bytes are read through to the end of the stream.static ByteStringreadFrom(InputStream streamToDrain, int minChunkSize, int maxChunkSize) abstract intsize()Gets the number of bytes.booleanstartsWith(ByteString prefix) Tests if this bytestring starts with the specified prefix.substring(int beginIndex) Return the substring frombeginIndex, inclusive, to the end of the string.abstract ByteStringsubstring(int beginIndex, int endIndex) Return the substring frombeginIndex, inclusive, toendIndex, exclusive.byte[]Copies bytes to abyte[].toString()abstract StringConstructs a newStringby decoding the bytes using the specified charset.Constructs a newStringby decoding the bytes as UTF-8.abstract voidwriteTo(OutputStream out) Writes the complete contents of this byte string to the specified output stream argument.Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, waitMethods inherited from interface java.lang.Iterable
forEach, spliterator
-
Field Details
-
EMPTY
EmptyByteString.
-
-
Method Details
-
byteAt
public abstract byte byteAt(int index) Gets the byte at the given index. This method should be used only for random access to individual bytes. To access bytes sequentially, use theByteString.ByteIteratorreturned byiterator(), and callsubstring(int, int)first if necessary.- Parameters:
index- index of byte- Returns:
- the value
- Throws:
ArrayIndexOutOfBoundsException-indexis < 0 or >= size
-
iterator
Return aByteString.ByteIteratorover the bytes in the ByteString. To avoid auto-boxing, you may get the iterator manually and callByteString.ByteIterator.nextByte(). -
size
public abstract int size()Gets the number of bytes.- Returns:
- size in bytes
-
isEmpty
public boolean isEmpty()Returnstrueif the size is0,falseotherwise.- Returns:
- true if this is zero bytes long
-
substring
Return the substring frombeginIndex, inclusive, to the end of the string.- Parameters:
beginIndex- start at this index- Returns:
- substring sharing underlying data
- Throws:
IndexOutOfBoundsException- ifbeginIndex < 0orbeginIndex > size().
-
substring
Return the substring frombeginIndex, inclusive, toendIndex, exclusive.- Parameters:
beginIndex- start at this indexendIndex- the last character is the one before this index- Returns:
- substring sharing underlying data
- Throws:
IndexOutOfBoundsException- ifbeginIndex < 0,endIndex > size(), orbeginIndex > endIndex.
-
startsWith
Tests if this bytestring starts with the specified prefix. Similar toString.startsWith(String)- Parameters:
prefix- the prefix.- Returns:
trueif the byte sequence represented by the argument is a prefix of the byte sequence represented by this string;falseotherwise.
-
endsWith
Tests if this bytestring ends with the specified suffix. Similar toString.endsWith(String)- Parameters:
suffix- the suffix.- Returns:
trueif the byte sequence represented by the argument is a suffix of the byte sequence represented by this string;falseotherwise.
-
copyFrom
Copies the given bytes into aByteString.- Parameters:
bytes- source arrayoffset- offset in source arraysize- number of bytes to copy- Returns:
- new
ByteString
-
copyFrom
Copies the given bytes into aByteString.- Parameters:
bytes- to copy- Returns:
- new
ByteString
-
copyFrom
Copies the nextsizebytes from ajava.nio.ByteBufferinto aByteString.- Parameters:
bytes- source buffersize- number of bytes to copy- Returns:
- new
ByteString
-
copyFrom
Copies the remaining bytes from ajava.nio.ByteBufferinto aByteString.- Parameters:
bytes- sourceBuffer- Returns:
- new
ByteString
-
copyFrom
public static ByteString copyFrom(String text, String charsetName) throws UnsupportedEncodingException Encodestextinto a sequence of bytes using the named charset and returns the result as aByteString.- Parameters:
text- source stringcharsetName- encoding to use- Returns:
- new
ByteString - Throws:
UnsupportedEncodingException- if the encoding isn't found
-
copyFromUtf8
Encodestextinto a sequence of UTF-8 bytes and returns the result as aByteString.- Parameters:
text- source string- Returns:
- new
ByteString
-
readFrom
Completely reads the given stream's bytes into aByteString, blocking if necessary until all bytes are read through to the end of the stream. Performance notes: The returnedByteStringis an immutable tree of byte arrays ("chunks") of the stream data. The first chunk is small, with subsequent chunks each being double the size, up to 8K. If the caller knows the precise length of the stream and wishes to avoid all unnecessary copies and allocations, consider using the two-argument version of this method, below.- Parameters:
streamToDrain- The source stream, which is read completely but not closed.- Returns:
- A new
ByteStringwhich is made up of chunks of various sizes, depending on the behavior of the underlying stream. - Throws:
IOException- IOException is thrown if there is a problem reading the underlying stream.
-
readFrom
Completely reads the given stream's bytes into aByteString, blocking if necessary until all bytes are read through to the end of the stream. Performance notes: The returnedByteStringis an immutable tree of byte arrays ("chunks") of the stream data. The chunkSize parameter sets the size of these byte arrays. In particular, if the chunkSize is precisely the same as the length of the stream, unnecessary allocations and copies will be avoided. Otherwise, the chunks will be of the given size, except for the last chunk, which will be resized (via a reallocation and copy) to contain the remainder of the stream.- Parameters:
streamToDrain- The source stream, which is read completely but not closed.chunkSize- The size of the chunks in which to read the stream.- Returns:
- A new
ByteStringwhich is made up of chunks of the given size. - Throws:
IOException- IOException is thrown if there is a problem reading the underlying stream.
-
readFrom
public static ByteString readFrom(InputStream streamToDrain, int minChunkSize, int maxChunkSize) throws IOException - Throws:
IOException
-
concat
Concatenate the givenByteStringto this one. Short concatenations, of total size smaller thanCONCATENATE_BY_COPY_SIZE, are produced by copying the underlying bytes (as per Rope.java, BAP95 . In general, the concatenate involves no copying.- Parameters:
other- string to concatenate- Returns:
- a new
ByteStringinstance
-
copyFrom
Concatenates all byte strings in the iterable and returns the result. This is designed to run in O(list size), not O(total bytes).The returned
ByteStringis not necessarily a unique object. If the list is empty, the returned object is the singleton emptyByteString. If the list has only one element, thatByteStringwill be returned without copying.- Parameters:
byteStrings- strings to be concatenated- Returns:
- new
ByteString
-
copyTo
public void copyTo(byte[] target, int offset) Copies bytes into a buffer at the given offset.- Parameters:
target- buffer to copy intooffset- in the target buffer- Throws:
IndexOutOfBoundsException- if the offset is negative or too large
-
copyTo
public void copyTo(byte[] target, int sourceOffset, int targetOffset, int numberToCopy) Copies bytes into a buffer.- Parameters:
target- buffer to copy intosourceOffset- offset within these bytestargetOffset- offset within the target buffernumberToCopy- number of bytes to copy- Throws:
IndexOutOfBoundsException- if an offset or size is negative or too large
-
copyToInternal
protected abstract void copyToInternal(byte[] target, int sourceOffset, int targetOffset, int numberToCopy) Internal (package private) implementation of -
copyTo
Copies bytes into a ByteBuffer.- Parameters:
target- ByteBuffer to copy into.- Throws:
ReadOnlyBufferException- if thetargetis read-onlyBufferOverflowException- if thetarget's remaining() space is not large enough to hold the data.
-
toByteArray
public byte[] toByteArray()Copies bytes to abyte[].- Returns:
- copied bytes
-
writeTo
Writes the complete contents of this byte string to the specified output stream argument.- Parameters:
out- the output stream to which to write the data.- Throws:
IOException- if an I/O error occurs.
-
asReadOnlyByteBuffer
Constructs a read-onlyjava.nio.ByteBufferwhose content is equal to the contents of this byte string. The result uses the same backing array as the byte string, if possible.- Returns:
- wrapped bytes
-
asReadOnlyByteBufferList
Constructs a list of read-onlyjava.nio.ByteBufferobjects such that the concatenation of their contents is equal to the contents of this byte string. The result uses the same backing arrays as the byte string.By returning a list, implementations of this method may be able to avoid copying even when there are multiple backing arrays.
- Returns:
- a list of wrapped bytes
-
toString
Constructs a newStringby decoding the bytes using the specified charset.- Parameters:
charsetName- encode using this charset- Returns:
- new string
- Throws:
UnsupportedEncodingException- if charset isn't recognized
-
toStringUtf8
Constructs a newStringby decoding the bytes as UTF-8.- Returns:
- new string using UTF-8 encoding
-
isValidUtf8
public abstract boolean isValidUtf8()Tells whether thisByteStringrepresents a well-formed UTF-8 byte sequence, such that the original bytes can be converted to a String object and then round tripped back to bytes without loss.More precisely, returns
truewhenever:Arrays.equals(byteString.toByteArray(), new String(byteString.toByteArray(), "UTF-8").getBytes("UTF-8"))This method returns
falsefor "overlong" byte sequences, as well as for 3-byte sequences that would map to a surrogate character, in accordance with the restricted definition of UTF-8 introduced in Unicode 3.1. Note that the UTF-8 decoder included in Oracle's JDK has been modified to also reject "overlong" byte sequences, but (as of 2011) still accepts 3-byte surrogate character byte sequences.See the Unicode Standard, Table 3-6. UTF-8 Bit Distribution, Table 3-7. Well Formed UTF-8 Byte Sequences.
- Returns:
- whether the bytes in this
ByteStringare a well-formed UTF-8 byte sequence
-
partialIsValidUtf8
protected abstract int partialIsValidUtf8(int state, int offset, int length) Tells whether the given byte sequence is a well-formed, malformed, or incomplete UTF-8 byte sequence. This method accepts and returns a partial state result, allowing the bytes for a complete UTF-8 byte sequence to be composed from multipleByteStringsegments.- Parameters:
state- either0(if this is the initial decoding operation) or the value returned from a call to a partial decoding method for the previous bytesoffset- offset of the first byte to checklength- number of bytes to check- Returns:
-1if the partial byte sequence is definitely malformed,0if it is well-formed (no additional input needed), or, if the byte sequence is "incomplete", i.e. apparently terminated in the middle of a character, an opaque integer "state" value containing enough information to decode the character when passed to a subsequent invocation of a partial decoding method.
-
equals
-
hashCode
public abstract int hashCode()Return a non-zero hashCode depending only on the sequence of bytes in this ByteString. -
newInput
Creates anInputStreamwhich can be used to read the bytes.The
InputStreamreturned by this method is guaranteed to be completely non-blocking. The methodInputStream.available()returns the number of bytes remaining in the stream. The methods {@link InputStream#read(byte[]),InputStream.read(byte[],int,int)andInputStream.skip(long)will read/skip as many bytes as are available.The methods in the returned
InputStreammight not be thread safe.- Returns:
- an input stream that returns the bytes of this byte string.
-
newCodedInput
Creates aCodedInputStreamwhich can be used to read the bytes. Using this is often more efficient than creating aCodedInputStreamthat wraps the result ofnewInput().- Returns:
- stream based on wrapped data
-
newOutput
Creates a newByteString.Outputwith the given initial capacity. CallByteString.Output.toByteString()to create theByteStringinstance.A
ByteString.Outputoffers the same functionality as aByteArrayOutputStream, except that it returns aByteStringrather than abytearray.- Parameters:
initialCapacity- estimate of number of bytes to be written- Returns:
OutputStreamfor building aByteString
-
newOutput
Creates a newByteString.Output. CallByteString.Output.toByteString()to create theByteStringinstance.A
ByteString.Outputoffers the same functionality as aByteArrayOutputStream, except that it returns aByteStringrather than abyte array.- Returns:
OutputStreamfor building aByteString
-
getTreeDepth
protected abstract int getTreeDepth()Return the depth of the tree representing thisByteString, if any, whose root is this node. If this is a leaf node, return 0.- Returns:
- tree depth or zero
-
isBalanced
protected abstract boolean isBalanced()Returntrueif this ByteString is literal (a leaf node) or a flat-enough tree in the sense ofRopeByteString.- Returns:
- true if the tree is flat enough
-
peekCachedHashCode
protected abstract int peekCachedHashCode()Return the cached hash code if available.- Returns:
- value of cached hash code or 0 if not computed yet
-
partialHash
protected abstract int partialHash(int h, int offset, int length) Compute the hash across the value bytes starting with the given hash, and return the result. This is used to compute the hash across strings represented as a set of pieces by allowing the hash computation to be continued from piece to piece.- Parameters:
h- starting hash valueoffset- offset into this value to start looking at data valueslength- number of data values to include in the hash computation- Returns:
- ending hash value
-
toString
-