Class ColGroupUncompressedArray
- java.lang.Object
-
- org.apache.sysds.runtime.compress.colgroup.AColGroup
-
- org.apache.sysds.runtime.compress.colgroup.ColGroupUncompressedArray
-
- All Implemented Interfaces:
Serializable
public class ColGroupUncompressedArray extends AColGroup
Special sideways compressed column group not supposed to be used outside of the compressed transform encode.- See Also:
- Serialized Form
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class org.apache.sysds.runtime.compress.colgroup.AColGroup
AColGroup.CompressionType
-
-
Constructor Summary
Constructors Constructor Description ColGroupUncompressedArray(Array<?> data, int id, IColIndex colIndexes)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description AColGroupappend(AColGroup g)Append the other column group to this column group.AColGroupbinaryRowOpLeft(BinaryOperator op, double[] v, boolean isRowSafe)Perform a binary row operation.AColGroupbinaryRowOpRight(BinaryOperator op, double[] v, boolean isRowSafe)Perform a binary row operation.CM_COV_ObjectcentralMoment(CMOperator op, int nRows)Central Moment instruction executed on a column group.voidcomputeColSums(double[] c, int nRows)Compute the column sumbooleancontainsValue(double pattern)Detect if the column group contains a specific value.AColGroupcopyAndSet(IColIndex colIndexes)Copy the content of the column group with pointers to the previous content but with new column given Note this method does not verify if the colIndexes specified are valid and correct dimensions for the underlying column groups.voiddecompressToDenseBlock(DenseBlock db, int rl, int ru, int offR, int offC)Decompress into the DenseBlock.voiddecompressToDenseBlockTransposed(DenseBlock db, int rl, int ru)Decompress a range of rows into a dense transposed block.voiddecompressToSparseBlock(SparseBlock sb, int rl, int ru, int offR, int offC)Decompress into the SparseBlock.voiddecompressToSparseBlockTransposed(SparseBlockMCSR sb, int nColOut)Decompress the column group to the sparse transposed block.longestimateInMemorySize()Get the upper bound estimate of in memory allocation for the column group.CompressedSizeInfoColGroupgetCompressionInfo(int nRow)Get the compression info for this column group.ICLASchemegetCompressionScheme()Get the compression scheme for this column group to enable compression of other data.AColGroup.CompressionTypegetCompType()Obtain the compression type.doublegetCost(ComputationCostEstimator e, int nRows)Get the computation cost associated with this column group.doublegetIdx(int r, int colIdx)Get the value at a colGroup specific row/column index position.doublegetMax()Short hand method for getting maximum value contained in this column group.doublegetMin()Short hand method for getting minimum value contained in this column group.longgetNumberNonZeros(int nRows)Get the number of nonZeros contained in this column group.intgetNumValues()Obtain number of distinct tuples in contained sets of values associated with this column group.doublegetSparsity()Get an approximate sparsity of this column groupdoublegetSum(int nRows)Short hand method for getting the sum of this column groupbooleanisEmpty()Get if the group is only containing zerovoidleftMultByAColGroup(AColGroup lhs, MatrixBlock result, int nRows)Left side matrix multiplication with a column group that is transposed.voidleftMultByMatrixNoPreAgg(MatrixBlock matrix, MatrixBlock result, int rl, int ru, int cl, int cu)Left multiply with this column group.AColGrouprecompress()Recompress this column group into a new column group.AColGroupreduceCols()Perform row sum on the internal dictionaries, and return the same index structure.AColGroupreplace(double pattern, double replace)Make a copy of the column group values, and replace all values that match pattern with replacement value.AColGrouprexpandCols(int max, boolean ignore, boolean cast, int nRows)Expand the column group to multiple columns.AColGrouprightMultByMatrix(MatrixBlock right, IColIndex allCols, int k)Right matrix multiplication with this column group.AColGroupscalarOperation(ScalarOperator op)Perform the specified scalar operation directly on the compressed column group, without decompressing individual cells if possible.AColGroupsliceRows(int rl, int ru)Slice range of rows out of the column group and return a new column group only containing the row segment.AColGroup[]splitReshape(int multiplier, int nRow, int nColOrg)This method returns a list of column groups that are naive splits of this column group as if it is reshaped.StringtoString()voidtsmm(MatrixBlock ret, int nRows)Do a transposed self matrix multiplication on the left side t(x) %*% x.voidtsmmAColGroup(AColGroup other, MatrixBlock result)Matrix multiply with this other column group, but: 1.voidunaryAggregateOperations(AggregateUnaryOperator op, double[] c, int nRows, int rl, int ru)Unary Aggregate operator, since aggregate operators require new object output, the output becomes an uncompressed matrix.AColGroupunaryOperation(UnaryOperator op)Perform unary operation on the column group and return a new column group-
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.AColGroup
addVector, appendN, clear, colSum, combine, combineWithSameIndex, combineWithSameIndex, decompressToDenseBlock, decompressToSparseBlock, get, getColIndices, getEncoding, getExactSizeOnDisk, getNumCols, morph, rightDecompressingMult, rightMultByMatrix, sameIndexStructure, selectionMultiply, shiftColIndices, sliceColumn, sliceColumns, sortColumnIndexes, splitReshapePushDown
-
-
-
-
Field Detail
-
array
public final Array<?> array
-
id
public final int id
-
-
Method Detail
-
getNumValues
public int getNumValues()
Description copied from class:AColGroupObtain number of distinct tuples in contained sets of values associated with this column group. If the column group is uncompressed the number or rows is returned.- Specified by:
getNumValuesin classAColGroup- Returns:
- the number of distinct sets of values associated with the bitmaps in this column group
-
estimateInMemorySize
public long estimateInMemorySize()
Description copied from class:AColGroupGet the upper bound estimate of in memory allocation for the column group.- Overrides:
estimateInMemorySizein classAColGroup- Returns:
- an upper bound on the number of bytes used to store this ColGroup in memory.
-
copyAndSet
public AColGroup copyAndSet(IColIndex colIndexes)
Description copied from class:AColGroupCopy the content of the column group with pointers to the previous content but with new column given Note this method does not verify if the colIndexes specified are valid and correct dimensions for the underlying column groups.- Specified by:
copyAndSetin classAColGroup- Parameters:
colIndexes- the new indexes to use in the copy- Returns:
- a new object with pointers to underlying data.
-
decompressToDenseBlockTransposed
public void decompressToDenseBlockTransposed(DenseBlock db, int rl, int ru)
Description copied from class:AColGroupDecompress a range of rows into a dense transposed block.- Specified by:
decompressToDenseBlockTransposedin classAColGroup- Parameters:
db- Dense target blockrl- Row in this column group to start at.ru- Row in this column group to end at.
-
decompressToSparseBlockTransposed
public void decompressToSparseBlockTransposed(SparseBlockMCSR sb, int nColOut)
Description copied from class:AColGroupDecompress the column group to the sparse transposed block. Note that the column groups would only need to decompress into specific sub rows of the Sparse block- Specified by:
decompressToSparseBlockTransposedin classAColGroup- Parameters:
sb- Sparse target blocknColOut- The number of columns in the sb.
-
getIdx
public double getIdx(int r, int colIdx)Description copied from class:AColGroupGet the value at a colGroup specific row/column index position.
-
getCompType
public AColGroup.CompressionType getCompType()
Description copied from class:AColGroupObtain the compression type.- Specified by:
getCompTypein classAColGroup- Returns:
- How the elements of the column group are compressed.
-
decompressToDenseBlock
public void decompressToDenseBlock(DenseBlock db, int rl, int ru, int offR, int offC)
Description copied from class:AColGroupDecompress into the DenseBlock. (no NNZ handling)- Specified by:
decompressToDenseBlockin classAColGroup- Parameters:
db- Target DenseBlockrl- Row to start decompression fromru- Row to end decompression at (not inclusive)offR- Row offset into the target to decompressoffC- Column offset into the target to decompress
-
decompressToSparseBlock
public void decompressToSparseBlock(SparseBlock sb, int rl, int ru, int offR, int offC)
Description copied from class:AColGroupDecompress into the SparseBlock. (no NNZ handling) Note this method is allowing to calls to append since it is assumed that the sparse column indexes are sorted afterwards- Specified by:
decompressToSparseBlockin classAColGroup- Parameters:
sb- Target SparseBlockrl- Row to start decompression fromru- Row to end decompression at (not inclusive)offR- Row offset into the target to decompressoffC- Column offset into the target to decompress
-
rightMultByMatrix
public AColGroup rightMultByMatrix(MatrixBlock right, IColIndex allCols, int k)
Description copied from class:AColGroupRight matrix multiplication with this column group. This method can return null, meaning that the output overlapping group would have been empty.- Specified by:
rightMultByMatrixin classAColGroup- Parameters:
right- The MatrixBlock on the right of this matrix multiplicationallCols- A pre-materialized list of all col indexes, that can be shared across all column groups if use full, can be set to null.k- The parallelization degree allowed internally in this operation.- Returns:
- The new Column Group or null that is the result of the matrix multiplication.
-
tsmm
public void tsmm(MatrixBlock ret, int nRows)
Description copied from class:AColGroupDo a transposed self matrix multiplication on the left side t(x) %*% x. but only with this column group. This gives better performance since there is no need to iterate through all the rows of the matrix, but the execution can be limited to its number of distinct values. Note it only calculate the upper triangle
-
leftMultByMatrixNoPreAgg
public void leftMultByMatrixNoPreAgg(MatrixBlock matrix, MatrixBlock result, int rl, int ru, int cl, int cu)
Description copied from class:AColGroupLeft multiply with this column group.- Specified by:
leftMultByMatrixNoPreAggin classAColGroup- Parameters:
matrix- The matrix to multiply with on the leftresult- The result to output the values into, always dense for the purpose of the column groups parallelizingrl- The row to begin the multiplication from on the lhs matrixru- The row to end the multiplication at on the lhs matrixcl- The column to begin the multiplication from on the lhs matrixcu- The column to end the multiplication at on the lhs matrix
-
leftMultByAColGroup
public void leftMultByAColGroup(AColGroup lhs, MatrixBlock result, int nRows)
Description copied from class:AColGroupLeft side matrix multiplication with a column group that is transposed.- Specified by:
leftMultByAColGroupin classAColGroup- Parameters:
lhs- The left hand side Column group to multiply with, the left hand side should be considered transposed. Also it should be guaranteed that this column group is not empty.result- The result matrix to insert the result of the multiplication intonRows- Number of rows in the lhs colGroup
-
tsmmAColGroup
public void tsmmAColGroup(AColGroup other, MatrixBlock result)
Description copied from class:AColGroupMatrix multiply with this other column group, but: 1. Only output upper triangle values. 2. Multiply both ways with "this" being on the left and on the right. It should be guaranteed that the input is not the same as the caller of the method. The second step is achievable by treating the initial multiplied matrix, and adding its values to the correct locations in the output.- Specified by:
tsmmAColGroupin classAColGroup- Parameters:
other- The other Column group to multiply withresult- The result matrix to put the results into
-
scalarOperation
public AColGroup scalarOperation(ScalarOperator op)
Description copied from class:AColGroupPerform the specified scalar operation directly on the compressed column group, without decompressing individual cells if possible.- Specified by:
scalarOperationin classAColGroup- Parameters:
op- operation to perform- Returns:
- version of this column group with the operation applied
-
binaryRowOpLeft
public AColGroup binaryRowOpLeft(BinaryOperator op, double[] v, boolean isRowSafe)
Description copied from class:AColGroupPerform a binary row operation.- Specified by:
binaryRowOpLeftin classAColGroup- Parameters:
op- The operation to executev- The vector of values to apply the values contained should be at least the length of the highest value in the column indexisRowSafe- True if the binary op is applied to an entire zero row and all results are zero- Returns:
- A updated column group with the new values.
-
binaryRowOpRight
public AColGroup binaryRowOpRight(BinaryOperator op, double[] v, boolean isRowSafe)
Description copied from class:AColGroupPerform a binary row operation.- Specified by:
binaryRowOpRightin classAColGroup- Parameters:
op- The operation to executev- The vector of values to apply the values contained should be at least the length of the highest value in the column indexisRowSafe- True if the binary op is applied to an entire zero row and all results are zero- Returns:
- A updated column group with the new values.
-
unaryAggregateOperations
public void unaryAggregateOperations(AggregateUnaryOperator op, double[] c, int nRows, int rl, int ru)
Description copied from class:AColGroupUnary Aggregate operator, since aggregate operators require new object output, the output becomes an uncompressed matrix. The range of rl to ru only applies to row aggregates. (ReduceCol)- Specified by:
unaryAggregateOperationsin classAColGroup- Parameters:
op- The operator usedc- The output matrix blocknRows- The total number of rows in the Column Grouprl- The starting row to do aggregation fromru- The last row to do aggregation to (not included)
-
sliceRows
public AColGroup sliceRows(int rl, int ru)
Description copied from class:AColGroupSlice range of rows out of the column group and return a new column group only containing the row segment. Note that this slice should maintain pointers back to the original dictionaries and only modify index structures.
-
getMin
public double getMin()
Description copied from class:AColGroupShort hand method for getting minimum value contained in this column group.
-
getMax
public double getMax()
Description copied from class:AColGroupShort hand method for getting maximum value contained in this column group.
-
getSum
public double getSum(int nRows)
Description copied from class:AColGroupShort hand method for getting the sum of this column group
-
containsValue
public boolean containsValue(double pattern)
Description copied from class:AColGroupDetect if the column group contains a specific value.- Specified by:
containsValuein classAColGroup- Parameters:
pattern- The value to look for.- Returns:
- boolean saying true if the value is contained.
-
getNumberNonZeros
public long getNumberNonZeros(int nRows)
Description copied from class:AColGroupGet the number of nonZeros contained in this column group.- Specified by:
getNumberNonZerosin classAColGroup- Parameters:
nRows- The number of rows in the column group, this is used for groups that does not contain information about how many rows they have.- Returns:
- The nnz.
-
replace
public AColGroup replace(double pattern, double replace)
Description copied from class:AColGroupMake a copy of the column group values, and replace all values that match pattern with replacement value.
-
computeColSums
public void computeColSums(double[] c, int nRows)Description copied from class:AColGroupCompute the column sum- Specified by:
computeColSumsin classAColGroup- Parameters:
c- The array to add the column sum to.nRows- The number of rows in the column group.
-
centralMoment
public CM_COV_Object centralMoment(CMOperator op, int nRows)
Description copied from class:AColGroupCentral Moment instruction executed on a column group.- Specified by:
centralMomentin classAColGroup- Parameters:
op- The Operator to use.nRows- The number of rows contained in the ColumnGroup.- Returns:
- A Central Moment object.
-
rexpandCols
public AColGroup rexpandCols(int max, boolean ignore, boolean cast, int nRows)
Description copied from class:AColGroupExpand the column group to multiple columns. (one hot encode the column group)- Specified by:
rexpandColsin classAColGroup- Parameters:
max- The number of columns to expand to and cutoff values at.ignore- If zero and negative values should be ignored.cast- If the double values contained should be cast to whole numbers.nRows- The number of rows in the column group.- Returns:
- A new column group containing max number of columns.
-
getCost
public double getCost(ComputationCostEstimator e, int nRows)
Description copied from class:AColGroupGet the computation cost associated with this column group.
-
unaryOperation
public AColGroup unaryOperation(UnaryOperator op)
Description copied from class:AColGroupPerform unary operation on the column group and return a new column group- Specified by:
unaryOperationin classAColGroup- Parameters:
op- The operation to perform- Returns:
- The new column group
-
isEmpty
public boolean isEmpty()
Description copied from class:AColGroupGet if the group is only containing zero
-
append
public AColGroup append(AColGroup g)
Description copied from class:AColGroupAppend the other column group to this column group. This method tries to combine them to return a new column group containing both. In some cases it is possible in reasonable time, in others it is not. The result is first this column group followed by the other column group in higher row values. If it is not possible or very inefficient null is returned.
-
getCompressionScheme
public ICLAScheme getCompressionScheme()
Description copied from class:AColGroupGet the compression scheme for this column group to enable compression of other data.- Specified by:
getCompressionSchemein classAColGroup- Returns:
- The compression scheme of this column group
-
recompress
public AColGroup recompress()
Description copied from class:AColGroupRecompress this column group into a new column group.- Specified by:
recompressin classAColGroup- Returns:
- A new or the same column group depending on optimization goal.
-
getCompressionInfo
public CompressedSizeInfoColGroup getCompressionInfo(int nRow)
Description copied from class:AColGroupGet the compression info for this column group.- Specified by:
getCompressionInfoin classAColGroup- Parameters:
nRow- The number of rows in this column group.- Returns:
- The compression info for this group.
-
reduceCols
public AColGroup reduceCols()
Description copied from class:AColGroupPerform row sum on the internal dictionaries, and return the same index structure. This method returns null on empty column groups. Note this method does not guarantee correct behavior if the given group is AMorphingGroup, instead it should be morphed to a valid columngroup via extractCommon first.- Specified by:
reduceColsin classAColGroup- Returns:
- The reduced colgroup.
-
getSparsity
public double getSparsity()
Description copied from class:AColGroupGet an approximate sparsity of this column group- Specified by:
getSparsityin classAColGroup- Returns:
- the approximate sparsity of this columngroup
-
splitReshape
public AColGroup[] splitReshape(int multiplier, int nRow, int nColOrg)
Description copied from class:AColGroupThis method returns a list of column groups that are naive splits of this column group as if it is reshaped. This means the column groups rows are split into x number of other column groups where x is the multiplier. The indexes are assigned round robbin to each of the output groups, meaning the first index is assigned. If for instance the 4. column group is split by a 2 multiplier and there was 5 columns in total originally. The output becomes 2 column groups at column index 4 and one at 9. If possible the split column groups should reuse pointers back to the original dictionaries!- Specified by:
splitReshapein classAColGroup- Parameters:
multiplier- The number of column groups to split intonRow- The number of rows in this column group in case the underlying column group does not knownColOrg- The number of overall columns in the host CompressedMatrixBlock.- Returns:
- a list of split column groups
-
-