Streaming SIMD Extension
8 new 128-bit SSE registers (OS support)
Execute two SSE instructions simultaneously
- Double-cycle through existing 64-bit hardware
- 2 micro-ops per SSE instruction
- Latency often two cycles or greater
Explicit scalar SSE instructions
Prefetch to various levels of cache