Courses/Python/Memory Management & Garbage Collection

Lesson 25 • Advanced

Memory Management & Garbage Collection Internals

Python may look simple on the surface, but underneath it has a powerful and complex memory management system. To write high-performance Python — whether you're building ML pipelines, backend servers, or tools that process millions of objects — you must understand how Python allocates and frees memory, reference counting, garbage collection cycles, memory fragmentation, and how to track leaks and optimize memory usage.

What You'll Learn in This Lesson

• How Python allocates memory at the object level using PyMalloc
• How reference counting works and when it fails (circular refs)
• How the cyclic garbage collector detects and breaks reference cycles
• How to detect memory leaks using tracemalloc and objgraph
• How __slots__ reduces per-instance memory by 40–70%
• How weak references prevent memory leaks in caches and observers

Note: To run Python code on your computer, you'll need Python installed. Download Python here. Keep the website open and test our code examples with Python running on your computer.

🔥 1. How Python Allocates Memory

🏪 Real-World Analogy: Think of Python's memory system like a warehouse with three levels: a corner shop (fast access for common items like small integers), a warehouse floor (Python's memory manager for medium objects), and afactory (the OS, only called for big orders). Python avoids "calling the factory" because it's slow!

Python uses a private memory manager (PyMalloc) layered on top of the OS allocator.

There are three layers:

Layer	What It Does	Speed
Object-specific allocators	Custom optimized allocators for ints, lists, dicts, strings	⚡ Fastest
Python memory manager	Handles small object pools, caches freed memory	⚡ Fast
OS-level allocator	malloc(), free() — used for large blocks	🐢 Slow

Python tries to avoid calling the OS too often, because OS allocations are slow.

⚙️ 2. Reference Counting — The Core Mechanism

📚 Real-World Analogy: Imagine a library book with a "borrowers count". Each time someone borrows it, the count goes up. When they return it, the count goes down. When the count reaches 0 (nobody has it), the library can recycle it. Python works exactly this way with objects!

Every Python object has an internal counter: how many references point to it.

You can inspect it:

Reference Counting

Inspect object reference counts with sys.getrefcount()

Action	Effect on Refcount	Example
Create object	+1	`x = []`
Assign to another variable	+1	`y = x`
Delete reference	-1	`del y`
Leave function scope	-1	Local variables cleaned up

Generation	Contains	Checked
Gen 0	Newest objects	Most frequently
Gen 1	Survived 1+ collections	Less often
Gen 2	Long-lived objects	Rarely

Reason	What Happens
Freed blocks stay in pools	Python keeps them for reuse
Partially used arenas	OS can't reclaim until completely empty
Long-lived objects	Create "holes" in memory
Extension modules	Allocate outside Python's control

Layer	Size	Purpose
Arena	~256 KB	Large chunk from OS
Pool	4 KB	For objects of same size
Block	variable	Individual object

Concept / Tool	What it does
sys.getrefcount(obj)	Check reference count of an object
gc.collect()	Manually trigger garbage collection
weakref.ref(obj)	Hold reference without preventing GC
__slots__	Reduce per-instance memory overhead
tracemalloc	Trace memory allocations

Memory Management & Garbage Collection Internals

What You'll Learn in This Lesson

🔥 1. How Python Allocates Memory

⚙️ 2. Reference Counting — The Core Mechanism

Reference Counting

🧠 3. The Problem: Reference Cycles

Reference Cycles

🌀 4. Garbage Collection for Cycles

⚡ 5. Viewing & Controlling the GC

Controlling Garbage Collection

📦 6. Memory Fragmentation

🧪 7. Detecting Memory Leaks

Using tracemalloc

Using objgraph

🧩 8. Efficient Memory Techniques

Generators vs Lists

Using __slots__

Manual Cleanup

🔥 9. Memory & Speed Tradeoffs

🔥 10. How Python Stores Objects in Memory (Deep Internal View)

Integer Caching

🧬 11. Arena Allocation (The Deepest Python Memory Detail)

🧠 12. Why Lists & Dicts "Grow" in Memory

🧨 13. Object Lifetimes — From Creation to Deallocation

🔍 14. Memory Leak Patterns in Real Python Code

Growing Global Lists

Closure Memory Trap

🧪 15. Real-World Debugging — Finding a Leak in a Web Server

Step 1: Enable tracemalloc

Step 2: Take Snapshot

Step 3: Compare Snapshots

Step 4: Find Dangling References

⚡ 16. Avoiding Fragmentation in Large Applications

📦 17. Working With Huge Data Without Crashing RAM

File Chunking

Generator Streaming

🔧 18. Advanced Optimisation Tools

📊 19. Memory & Performance Profiling Workflow (Professional Method)

🧠 20. Python Memory Myths (Corrected)

🎓 21. Final Summary of Python Memory Mastery

🔥 Practical Engineering Summary

🧠 The Biggest Causes of Memory Problems in Real Systems

⚙️ Practical Checklist for Writing Memory-Safe Python

🔥 Ultimate Takeaways (The "If You Remember Only 10 Things…" List)

🎉 Final Conclusion — You Now Understand Memory Like a Senior Engineer

📋 Quick Reference — Memory Management

Cookie & Privacy Settings

Using slots