(compute, capacity, movement, latency)
rate at which executable operations are retired assuming ideal data availability and resolved dependencies.
instruction throughput
execution width
pipeline structure
specialized units
frequency behavior
maximum live state that can be retained without eviction, spill, or recomputation. capacity is tiered. each tier introduces a discrete eviction boundary.
registers
caches
main memory
persistent working set
maximum rate at which state can cross a boundary between capacity domains. movement is layered and directional. each boundary has an independent ceiling.
intra-core transfer
cache hierarchy transfer
memory subsystem
interconnect
storage io
minimum time required to resolve a dependency chain. latency bounds critical paths and tail behavior independent of steady-state throughput.
memory access latency
control flow latency
synchronization latency
io latency
remote access latency
this section classifies common low-level workloads by the tuple element that first violates its invariant. the intent is diagnostic compression rather than exhaustive taxonomy.
typical cases
characteristic structure
typical cases
characteristic structure
typical cases
characteristic structure
typical cases
characteristic structure
typical cases
characteristic structure
performance optimization is constraint management under fixed semantics.
the objective is to delay violation of the dominant invariant governing execution. time, throughput, and latency are observable projections of which constraint binds first.
given:
optimization seeks to:
optimality is defined relative to a binding invariant.
an implementation may be optimal along one axis while pathological along another. claims of optimality are only meaningful when qualified by the dominant constraint.
hardware scaling is effective if and only if:
otherwise hardware scaling amplifies inefficiency already present in the mapping.
determine which axis saturates or dominates delay. optimization without this identification is unguided.
reduce demand on the binding axis before addressing others. improvements to non-binding axes do not change first-order behavior.
effective optimizations exchange pressure across axes:
all such trades must preserve semantics and invariants.
structural changes:
parametric changes:
structural changes alter which invariant binds. parametric changes only approach an existing bound.