DispatchCreation

`-iree-dispatch-creation-annotate-data-tiling-hints`link

Adds data-tiling hint attribute to linalg operations.

The pass does nothing, if any operation already has the data-tiling hint attribute. Otherwise, it aggressively filters linalg ops and adds the data-tiling hint attribute to the operations.

`-iree-dispatch-creation-bitcast-unsupported-element-types`link

Bitcasts tensor element types unsupported by the HAL

`-iree-dispatch-creation-bubble-up-expand-shapes`link

Propagate expand_shapes up the program (and collapse_shapes down).

Optionslink

-enable-reshape-movement-across-reductions : Enables propagation of 'expand_shape's and 'collapse_shape's through 'linalg.generic's with reductions

`-iree-dispatch-creation-clone-producers-into-dispatch-regions`link

Clone producers into dispatch regions to be isolated above.

Pass to clone into dispatch regions producers of values used in the dispatch regions but defined in the above. This prepares the dispatch regions for converting to dispatch workgroups with explicit captures.

Optionslink

-aggressive : Include operations that are cloned only under aggressive fusion mode

`-iree-dispatch-creation-collapse-dimensions`link

Collapse dimensions of Linalg Ops on tensor ops.

Collapse dimensions of Linalg Ops on tensor ops inside dispatch.region ops and hoist the reshaping operations out of the dispatch.

Optionslink

-max-iterations : Maximum number of iterations to wait for collapse dimensions to converge

`-iree-dispatch-creation-convert-dispatch-regions-to-workgroups`link

Convert dispatch regions to dispatch workgroups.

Pass to convert dispatch regions to dispatch workgroups. This pass is intended to be used after dispatch regions have been formed.

Statisticslink

num-dispatches : Number of dispatches created

`-iree-dispatch-creation-convert-encoding-to-flow`link

Convert top-level Encoding ops to Flow ops.

Pass to convert top-level Encoding ops to Flow ops, which only converts the Encoding ops outside flow.dispatch.region to Flow.

`-iree-dispatch-creation-convert-tensor-to-flow`link

Convert tensor operations to flow

Pass to convert tensor operations to flow.tensor.* operations.

Statisticslink

num-slow-copy-dispatches : Number of slow copy dispatches (for handling slices) created

`-iree-dispatch-creation-elementwise-op-fusion`link

Fuse elementwise operations.

Optionslink

-intra-dispatch       : Fuse operations within a dispatch only (default is to fuse only operations outside of a dispatch)
-fuse-multi-reduction : Fuse ops that have multiple reduction iterators
-fuse-truncate-ops    : Fuse producer truncate-like operations with consumers
-fuse-broadcast-ops   : Fuse broadcast-like ops with producers

`-iree-dispatch-creation-fold-reshapes-into-tensor-barriers`link

Fold reshape operations into tensor barriers.

Moves tensor.expand_shape and tensor.collapse_shape operations through iree_tensor_ext.compute_barrier.start and iree_tensor_ext.compute_barrier.end operations. This blocks reshapes on the edge of the program from being propagated.

`-iree-dispatch-creation-fold-unit-extent-dims`link

Fold unit extent dimension of operations.

Imports upstream patterns to fold unit extent dims but with IREE control.

`-iree-dispatch-creation-fold-unit-extent-dims-for-func`link

Fold unit extent dimension of operations on a function.

Imports upstream patterns to fold unit extent dims but with IREE control.

`-iree-dispatch-creation-form-dispatch-regions`link

Form Dispatch Region Ops from Linalg operations on tensors to form dispatch.regions.

Pass to form dispatch.region ops from Linalg on tensor ops. A dispatch region is created for each tiled loop nest. This pass only moves the root compute op into the dispatch region, allowing producers to be outside.

Optionslink

-aggressive-fusion       : Aggressive mode enabling fusions not ready for all backends
-fuse-pad-with-consumers : Enable fusing pad with consumer
-fuse-pad-with-producers : Enable fusion of pad with producers

`-iree-dispatch-creation-form-scalar-dispatches`link

Form Dispatch Regions for scalar computations.

`-iree-dispatch-creation-form-split-reduction-dispatches`link

Partially tile reduction operations and place into dispatches

Optionslink

-split-size      : Tile sizes for split reduction (innermost first)
-enable-fuse-pad : Fuse pad into scf.forall

`-iree-dispatch-creation-fuse-encoding-ops-into-dispatch-regions-pass`link

Fuses set_encoding ops into producer dispatch regions, or forms new dispatches around them.

Optionslink

-enable-aggressive-fusion : Enable encoding op fusion if the producer has more than one use

`-iree-dispatch-creation-fuse-horizontal-contractions`link

Fuses horizontal contraction ops

For cases where multiple contractions - that dont have a direct dependence - that have the same LHS operand - all the N dimensions of the RHS operands used are the same Such contractions can be executed as a single contraction, i.e.

A = matmul(lhs, rhs0); B = matmul(lhs, rhs1); C = matmul(lhs, rhs2);

can be combined into result = matmul(lhs, concat_along_N(rhs0, rhs1, rhs2)); A = slice0(result) B = slice1(result) C = slice2(result)

Instead of doing an actual concat of the RHS operands, and extracting slices of the result, the pass generates a single operation with - the lhs operands - all the rhs operands - multiple results representing the individual matmuls

Optionslink

-fusion-limit : Maximum number of contractions fused into one

Statisticslink

num-fusion-groups : Number of fusion groups found
num-size-2-groups : Number of fusion groups of size 2
num-size-3-groups : Number of fusion groups of size 3

`-iree-dispatch-creation-fuse-multi-use-elementwise-producer`link

Fuse elementwise linalg operations on tensors when producers have multiple uses.

Optionslink

-intra-dispatch : Fuse operations within a dispatch only (default is to fuse only operations outside of a dispatch)
-num-iterations : Number of iterations to fuse multiuse ops

`-iree-dispatch-creation-fusion-preprocessing`link

Run useful preprocessing patterns that help with fusion.

`-iree-dispatch-creation-hoist-encoding-ops`link

Hoists tensor encoding ops out of flow dispatch regions.

`-iree-dispatch-creation-hoist-uniform-scalar-compute`link

Hoists scalar (computation) out of dispatch regions.

`-iree-dispatch-creation-insert-tensor-barriers`link

Insert tensor barrier markers around computation regions.

Walks from function boundaries and inserts iree_tensor_ext.compute_barrier.start and iree_tensor_ext.compute_barrier.end operations at the boundaries of computation regions (tensor/linalg/linalgext ops). These barriers allow controlling where certain transformations like reshape propagation can occur.

The pass identifies compute operations and wraps tensor values flowing into and out of the compute region with the barrier operations.

`-iree-dispatch-creation-materialize-default-workgroup-count-region`link

Canonicalize dispatch workgroups ops.

Apply dispatch workgroups canonicalization patterns.

`-iree-dispatch-creation-propagate-encodings`link

Propagate encodings across other operations.

`-iree-dispatch-creation-remove-tensor-barriers`link

Remove tensor barrier markers from the program.

Removes iree_tensor_ext.compute_barrier.start and iree_tensor_ext.compute_barrier.end operations from the program. These are identity operations that simply pass through their operands, so they can be safely removed by replacing all uses of their results with their operands.

`-iree-dispatch-creation-set-encoding`link

Introduces tensor encoding for flow dispatch regions.

Sets the encoding for compute operations to allow execution of the operations in tiled/padded layouts.

Optionslink

-encoding-option : Select the type of encoding options to add.

`-iree-dispatch-creation-set-split-reduction-sizes`link

Set 'iree_linalg_ext.split_reduction' attribute on ops

Optionslink

-split-reduction-target-size : Target tile size for split reduction. Inner reduction dimensions are tiled first, with the tile size rounded up until it evenly divides the iteration domain.

`-iree-dispatch-creation-sink-reshapes`link

Sink reshapes to allow for compute op -> consumer fusion.

`-iree-dispatch-creation-split-reduction-ops`link

Split reduction dimension to increase parallelism.

`-iree-dispatch-creation-tensor-pad-to-tensor-insert-slice`link

Convert tensor.pad into linalg.fill + tensor.insert_slice.

Optionslink

-skip-one-linalg-use-case : Skip the op that has only one use which is usedby a Linalg op

`-iree-dispatch-creation-transpose-generic-ops`link

Transpose generic op loops.

DispatchCreation

-iree-dispatch-creation-annotate-data-tiling-hintslink

-iree-dispatch-creation-bitcast-unsupported-element-typeslink

-iree-dispatch-creation-bubble-up-expand-shapeslink

Optionslink

-iree-dispatch-creation-clone-producers-into-dispatch-regionslink

Optionslink

-iree-dispatch-creation-collapse-dimensionslink

Optionslink

-iree-dispatch-creation-convert-dispatch-regions-to-workgroupslink

Statisticslink

-iree-dispatch-creation-convert-encoding-to-flowlink

-iree-dispatch-creation-convert-tensor-to-flowlink

Statisticslink

-iree-dispatch-creation-elementwise-op-fusionlink

Optionslink

-iree-dispatch-creation-fold-reshapes-into-tensor-barrierslink

-iree-dispatch-creation-fold-unit-extent-dimslink

-iree-dispatch-creation-fold-unit-extent-dims-for-funclink

-iree-dispatch-creation-form-dispatch-regionslink

Optionslink

-iree-dispatch-creation-form-scalar-dispatcheslink

-iree-dispatch-creation-form-split-reduction-dispatcheslink

Optionslink

-iree-dispatch-creation-fuse-encoding-ops-into-dispatch-regions-passlink

Optionslink

-iree-dispatch-creation-fuse-horizontal-contractionslink

Optionslink

Statisticslink

-iree-dispatch-creation-fuse-multi-use-elementwise-producerlink

Optionslink

-iree-dispatch-creation-fusion-preprocessinglink

-iree-dispatch-creation-hoist-encoding-opslink

-iree-dispatch-creation-hoist-uniform-scalar-computelink

-iree-dispatch-creation-insert-tensor-barrierslink

-iree-dispatch-creation-materialize-default-workgroup-count-regionlink

-iree-dispatch-creation-propagate-encodingslink

-iree-dispatch-creation-remove-tensor-barrierslink

-iree-dispatch-creation-set-encodinglink

Optionslink

-iree-dispatch-creation-set-split-reduction-sizeslink

Optionslink

-iree-dispatch-creation-sink-reshapeslink

-iree-dispatch-creation-split-reduction-opslink

-iree-dispatch-creation-tensor-pad-to-tensor-insert-slicelink

Optionslink

-iree-dispatch-creation-transpose-generic-opslink

`-iree-dispatch-creation-annotate-data-tiling-hints`link

`-iree-dispatch-creation-bitcast-unsupported-element-types`link

`-iree-dispatch-creation-bubble-up-expand-shapes`link

`-iree-dispatch-creation-clone-producers-into-dispatch-regions`link

`-iree-dispatch-creation-collapse-dimensions`link

`-iree-dispatch-creation-convert-dispatch-regions-to-workgroups`link

`-iree-dispatch-creation-convert-encoding-to-flow`link

`-iree-dispatch-creation-convert-tensor-to-flow`link

`-iree-dispatch-creation-elementwise-op-fusion`link

`-iree-dispatch-creation-fold-reshapes-into-tensor-barriers`link

`-iree-dispatch-creation-fold-unit-extent-dims`link

`-iree-dispatch-creation-fold-unit-extent-dims-for-func`link

`-iree-dispatch-creation-form-dispatch-regions`link

`-iree-dispatch-creation-form-scalar-dispatches`link

`-iree-dispatch-creation-form-split-reduction-dispatches`link

`-iree-dispatch-creation-fuse-encoding-ops-into-dispatch-regions-pass`link

`-iree-dispatch-creation-fuse-horizontal-contractions`link

`-iree-dispatch-creation-fuse-multi-use-elementwise-producer`link

`-iree-dispatch-creation-fusion-preprocessing`link

`-iree-dispatch-creation-hoist-encoding-ops`link

`-iree-dispatch-creation-hoist-uniform-scalar-compute`link

`-iree-dispatch-creation-insert-tensor-barriers`link

`-iree-dispatch-creation-materialize-default-workgroup-count-region`link

`-iree-dispatch-creation-propagate-encodings`link

`-iree-dispatch-creation-remove-tensor-barriers`link

`-iree-dispatch-creation-set-encoding`link

`-iree-dispatch-creation-set-split-reduction-sizes`link

`-iree-dispatch-creation-sink-reshapes`link

`-iree-dispatch-creation-split-reduction-ops`link

`-iree-dispatch-creation-tensor-pad-to-tensor-insert-slice`link

`-iree-dispatch-creation-transpose-generic-ops`link