Profiling & Optimising Python Performance

High-performance Python isn't about writing "faster code" — it's about finding bottlenecks and eliminating them with scientific precision. You cannot optimise what you do not measure.

Note: Most profiling tools in this lesson require Python installed on your computer (not browser-based). Download Python here. Keep the website open and test our code examples with Python running on your computer.

What You'll Learn in This Lesson

• How to profile CPU usage with cProfile and line_profiler
• How to measure memory usage with tracemalloc and memory_profiler
• How to find the real bottleneck in your code (it's rarely where you think)
• Practical optimisation techniques: caching, algorithm choice, data structures
• How to write benchmarks using timeit and interpret results correctly
• Production-level performance patterns used in real Python systems

🔥 1. Why Profiling Matters

🏥 Real-World Analogy: Profiling is like getting a medical checkup. You wouldn't guess which organ is causing problems — you run tests and let the data tell you. Similarly, don't guess what's slow in your code — measure it!

Beginners try to "guess" what's slow.
Advanced developers measure what's slow.

Approach	Method	Result
❌ Guessing	"This loop looks slow"	Waste time optimizing the wrong code
✔ Profiling	Measure actual execution time	Find and fix real bottlenecks

The 80/20 Rule:

20% of code → 80% of runtime

Optimizing the wrong 80% gives no improvement!

💡 Key Insight: Profiling transforms debugging from guessing → scientific analysis. It exposes bottlenecks, slow loops, expensive allocations, and memory leaks.

⚙️ 2. Timing Functions with time.perf_counter()

For quick micro-benchmarks:

Timing with perf_counter

Quick micro-benchmarks

Column	What It Shows
ncalls	Number of times function was called
tottime	Total time in this function (excluding subcalls)
cumtime	Cumulative time (including subcalls)

Sort By	What It Shows	Best For Finding
`"tottime"`	Time in function itself	The actual slow functions
`"cumtime"`	Time including all sub-calls	Functions that call slow things
`"ncalls"`	Number of times called	Unexpectedly hot loops

Memory Issue	Symptom	Common Cause
Memory spike	Sudden +500MB on one line	Loading large dataset at once
Memory leak	Memory grows over time	Data accumulating in loops
High baseline	Program starts with 200MB+	Heavy imports (pandas, tensorflow)

❌ Mistake	Why It's Slow	✅ Better Approach
Unnecessary list copies	Copies entire list in memory	Use slices or itertools
Python loops for math	Interpreted = slow	NumPy vectorized operations
String concatenation in loop	Creates new string each time	Use `''.join(list)`
Opening files repeatedly	Disk I/O is expensive	Open once, read/write many
Blocking I/O in async	Blocks the entire event loop	Use `run_in_executor()`

Tool / Syntax	What it does
cProfile.run('fn()')	Profile function call counts and time
timeit.timeit('expr', number=1000)	Benchmark small code snippets
line_profiler	Profile line-by-line execution time
memory_profiler	Track memory usage per line
__slots__	Reduce class memory footprint

Profiling & Optimising Python Performance

What You'll Learn in This Lesson

🔥 1. Why Profiling Matters

⚙️ 2. Timing Functions with time.perf_counter()

Timing with perf_counter

🧠 3. Profiling With cProfile — The Standard Tool

cProfile Command

Profile a Function

📊 4. Making Results Readable With pstats

Readable Profiling Results

🧠 5. Line-by-Line Profiling With line_profiler

Install line_profiler

Line Profiler Decorator

Run Line Profiler

🧩 6. Memory Profiling

Install memory_profiler

Memory Profiler

⚡ 7. Real Techniques for Faster Python

Built-ins vs Manual Loops

List Comprehensions

Generators

lru_cache

Multiprocessing

🏎️ 8. Avoiding the Biggest Performance Mistakes

🧪 9. Real-World Example: Speeding Up JSON Parsing

Slow JSON Parsing

Fast JSON Parsing

🎉 Conclusion

📋 Quick Reference — Profiling & Performance

Cookie & Privacy Settings