## Memory Management in Python

Memory management in Python involves a combination of automatic garbage collection, reference counting, and various internal optimizations to manage the allocation and deallocation of memory.

### Key Concepts

1. **Memory Allocation and Deallocation**: Python uses a built-in garbage collector to manage memory. It automatically allocates and deallocates memory as needed.

2. **Garbage Collection**: Python's garbage collector is responsible for deallocating memory that is no longer in use. It uses reference counting and cyclic garbage collection to free memory.

3. **Memory Leaks**: A memory leak occurs when memory that is no longer needed is not released. This can lead to increased memory usage and can slow down or crash a program.

4. **Memory Pools**: Python uses memory pools to manage small objects. This reduces fragmentation and improves performance.

5. **Reference Counting**: Each object in Python maintains a count of references pointing to it. When the reference count drops to zero, the memory occupied by the object is deallocated.

6. **The gc Module**: The `gc` module provides an interface to the garbage collection facility. It allows for manual garbage collection and provides functions to interact with the garbage collector.

### Best Practices

- Use built-in data types and structures whenever possible, as they are optimized for performance.
- Avoid creating circular references, as they can lead to memory leaks.
- Use weak references for large objects that are not needed after a certain point in time.


In [1]:
## Reference Counting

import sys

a = []
print(sys.getrefcount(a))  # Output: 2 -> One for the variable 'a' and one for the argument passed to getrefcount()

2


In [2]:
b = a
print(sys.getrefcount(b))  # Output: 3 -> One for 'b', one for 'a', and one for the argument passed to getrefcount()

3


In [3]:
del b
print(sys.getrefcount(a))  # Output: 2 -> One for the variable 'a' and one for the argument passed to getrefcount()

2


In [4]:
## Garbage Collection

import gc

gc.enable()  # Enable automatic garbage collection

In [5]:
gc.disable()  # Disable automatic garbage collection

In [6]:
gc.collect()  # Manually trigger garbage collection

0

In [7]:
## Get garbage collection statistics

print(gc.get_stats()) # Output explains the current state of the garbage collector

[{'collections': 183, 'collected': 1287, 'uncollectable': 0}, {'collections': 16, 'collected': 205, 'uncollectable': 0}, {'collections': 2, 'collected': 74, 'uncollectable': 0}]


## Memory Management Best Practices

1. **Use Built-in Data Types**: Python's built-in data types (like lists, dictionaries, sets) are implemented in C and are more memory-efficient than custom classes.

2. **Avoid Circular References**: Circular references can lead to memory leaks. Use weak references (via the `weakref` module) to break cycles.

3. **Use Generators**: For large datasets, use generators instead of lists to save memory. Generators yield items one at a time and do not store the entire dataset in memory.

4. **Profile Memory Usage**: Use tools like `memory_profiler` to identify memory bottlenecks in your code.

5. **Explicitly Delete Unused Objects**: Use `del` to delete objects that are no longer needed, especially in long-running applications.

6. **Use Context Managers**: For managing resources (like file handles), use context managers (`with` statement) to ensure proper cleanup.

7. **Optimize Data Structures**: Choose the right data structure for your needs. For example, use tuples instead of lists for fixed collections of items.

8. **Limit Global Variables**: Global variables can lead to increased memory usage and make code harder to understand. Limit their use and prefer function arguments.

9. **Use `__slots__` in Classes**: If you have a class with many instances, consider using `__slots__` to reduce memory overhead.

10. **Regularly Review and Refactor Code**: Periodically review your code for memory inefficiencies and refactor as needed.

In [8]:
import gc

class MyObject:
    def __init__(self, name):
        self.name = name
        print(f"Object {self.name} created")

    def __del__(self):
        print(f"Object {self.name} deleted")
    

In [9]:
obj1 = MyObject("obj1")
obj2 = MyObject("obj2")

## Create circular references
obj1.ref = obj2
obj2.ref = obj1

Object obj1 created
Object obj2 created


In [10]:
del obj1
del obj2

## Manually trigger garbage collection
gc.collect()

Object obj1 deleted
Object obj2 deleted


2

In [11]:
## Generators For Memory Efficiency

def generate_numbers(n):
    for i in range(n):
        yield i
        
## Using the generator
for num in generate_numbers(1000):
    print(num)
    if num > 10:
        break


0
1
2
3
4
5
6
7
8
9
10
11


In [12]:
## Profiling Memory Usage with `tracemalloc`

import tracemalloc
# Your code here

def create_list():
    return [i for i in range(10000)]

def main():
    tracemalloc.start()
    create_list()
    snapshot = tracemalloc.take_snapshot()  # Take a snapshot of the current memory usage
    top_stats = snapshot.statistics('lineno')  # Get statistics by line number

    print("[ Top 10 Memory Usage ]")
    for stat in top_stats[:10]:
        print(stat)

main()

[ Top 10 Memory Usage ]
/var/folders/hd/3zq3znjx2xn4yscbm_651t2r0000gn/T/ipykernel_13709/3841724893.py:6: size=72 B, count=1, average=72 B
/opt/miniconda3/lib/python3.12/tracemalloc.py:551: size=72 B, count=1, average=72 B
