A Systems Engineer’s Guide to Benchmarking with RDTSC
A Systems Engineer’s Guide to Benchmarking with RDTSC
A Systems Engineer’s Guide to Benchmarking with RDTSC
Compiling Python to Run Anywhere
What Makes System Calls Expensive: A Linux Internals Deep Dive
Big O vs Hardware: Better Complexity ≠ Better Performance
One Law to Rule Them All: The Iron Law of Software Performance
Why This Python Performance Trick Doesn’t Matter Anymore
Python Performance: Why 'if not list' is 2x Faster Than Using len()
Hardware-Aware Coding: CPU Architecture Concepts Every Developer Should Know
Context Switching and Performance: What Every Developer Should Know
How Unix Spell Ran in 64kB RAM
Linux Context Switching Internals: Part 1 - Process State and Memory
The CAP Theorem of Clustering: Why Every Algorithm Must Sacrifice Something
Disillusioning the Magic of the fork System Call
An Unreachable Hidden XKCD Easter Egg inside CPython
CPython's Garbage Collector and its Impact on Application Performance
The Pythonic Emptiness
A Selective Survey of Efficient Speculative Decoding Techniques for LLM Inference
CPython Runtime Internals: Key Data Structures & Runtime Bootstrapping
The Design & Implementation of the CPython Virtual Machine
Are Function Calls Still Slow in Python? An Analysis of Recent Optimizations in CPython
Yes, that makes much more sense.