Avoid the RAM Latency: Keeping the Cache Hot and on Linear Access is the Ultimate C++ Optimization
In this benchmark, we explore the importance of keeping data within the CPU cache to avoid expensive retrieval from RAM. By simply ensuring linear data access and accessing by blocks that fit in L1 and L2, we can achieve massive performance gains without changing the underlying algorithm.