Optimization optionslink
This page documents various supported flags for optimizing IREE programs. Each is presented with its English name, flag to enable/disable, and default state.
These flags can be passed to the:
iree-compile
command line toolextra_args=["--flag"]
argument toiree.compiler.tools
Python wrappers- In-process Python compiler API
iree.compiler.transforms.iree-compile.CompilerOptions("--flag", "--flag2")
constructor ireeCompilerOptionsSetFlags()
compiler C API function
Optimization levellink
As in other compilers like clang and gcc, IREE provides a high level optimization
level flag (iree-opt-level
) that enables different sets of underlying options.
iree-opt-level
specifies the optimization level for the entire compilation
flow. Lower optimization levels prioritize debuggability and stability, while
higher levels focus on maximizing performance. By default, iree-opt-level
is
set to O0
(minimal or no optimizations).
Note
Not all flags that control performance are nested under iree-opt-level
.
See High level program optimizations
below for subflags not covered by optimization flags.
This flag takes the following values:
Optimization Level | Pros | Cons |
---|---|---|
O0 (Default, Minimal Optimizations) |
|
|
O1 (Basic Optimizations) |
|
|
O2 (Optimizations without full backend support) |
|
|
O3 (Aggressive Optimization) |
|
|
Although iree-opt-level
sets the default for each subflag, they can be
explicitly set on or off independently.
For example:
# Apply the default optimizations of `O2` but don't remove assertions.
iree-compile --iree-opt-level=O2 --iree-strip-assertions=false
# Minimize optimizations, but still preform aggressive fusion.
iree-compile --iree-opt-level=O0 --iree-dispatch-creation-enable-aggressive-fusion=true
Pipeline-level controllink
In addition to iree-opt-level
, IREE provides optimization controls at the
pipeline level. These flags allow fine-grained tuning of specific compilation
stages while still respecting the topmost optimization level unless explicitly
overridden.
Dispatch Creation (iree-dispatch-creation-opt-level
)link
-
iree-dispatch-creation-enable-aggressive-fusion
(enabled atO2
)Enables more aggressive fusion opportunities not yet supported by all backends
Global Optimization (iree-global-optimization-opt-level
)link
-
iree-opt-strip-assertions
(enabled atO1
)Strips all
std.assert
ops in the input program after useful information for optimization analysis has been extracted. Assertions provide useful user-visible error messages but can prevent critical optimizations. Assertions are not, however, a substitution for control flow and frontends that want to check errors in optimized release builds should do so via actual code - similar to when one wouldif (foo) return false;
vs.assert(foo);
in a normal program. -
iree-opt-outer-dim-concat
(enabled atO1
)Transpose concat operations to ocurr along the outermost dimension. The resulting concat will now be contiguous and the inserted transposes can possibly be fused with surrounding ops.
-
iree-opt-aggressively-propagate-transposes
(enabled atO3
)Enables more transpose propagation by allowing transposes to be propagated to
linalg
named ops even when the resulting op will be alinalg.generic
.
High level program optimizationslink
Constant evaluation (--iree-opt-const-eval
(on))link
Performs compile-time evaluation of any global initializers which produce the initial values for global constants, storing the global directly in the program as constant data. This extracts such constant program fragments and recursively compiles them, using the runtime to evaluate the results.
Note that this only has any effect on computations in module initializer
functions, not free-standing operations in the program which may produce
constant-derived results. See --iree-opt-const-expr-hoisting
for options to
optimize these.
Constant expression hoisting (--iree-opt-const-expr-hoisting
(off))link
Identifies all trees of constant expressions in the program and uses a
heuristic to determine which would be profitable to hoist into global
initializers for evaluation at module load. Together with
--iree-opt-const-eval
, this will convert eligible trees of expressions to
purely static data embedded in the module.
The heuristic is currently relatively primitive, using static information to disable hoisting of leaf operations which are metadata only (i.e. broadcasts, etc) or are expected to fold away as part of operator fusion. Notably, the current heuristic is likely to pessimize module size in the case of complicated programs with trees of constant, large tensors.
Numeric precision reduction (--iree-opt-numeric-precision-reduction
(off))link
Analyzes program constant data and program flow to identify math operations which can be safely evaluated with reduced precision (currently with a minimum of 8bit integers but being extended to infer any bit depth) and inserts appropriate casts. In conjunction with Constant Expression Hoisting, Constant Evaluation and other automatic optimizations, this can produce programs where large amounts (up to the whole) have had their numeric operations and constant data rewritten to lower precision types.
This feature is actively evolving and will be the subject of dedicated documentation when ready.