<a href="https://colab.research.google.com/github/Suruchi264/NLP-DL-ML/blob/main/Memory_Management.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

***PYTHON MEMORY MANAGEMENT***

Memory management in python involves a combination of automatic garbage collection, reference counting, and various internal optimizations to efficiently manage memory allocation and deallocation. Understanding these mechanisms can help developers write more efficient and robust applications.




1.   Key Concepts in Python Memory Management
2.   Memory Allocation and Deallocation
3.   Reference Counting
4.   Garbage Collection
5.   The gc Module
6.   Memory Management Best Practices

***Reference Counting ***

Reference counting is the primary method Python uses to manage memory. Each object in Python maintains a count of reference pointing to it. When the reference count drops to zero, the memory occupied by the objects is deallocated.

In [None]:
### Reference Counting

import sys

a = []
## 2 (one reference from 'a' and one from getrefcount(a))
print(sys.getrefcount(a))

2


In [None]:
b=a
print(sys.getrefcount(b))


# When you call sys.getrefcount(b), a temporary reference to the object that b points to is created internally by the getrefcount() function itself. This temporary reference exists only for the duration of the function call.

# So, the four references are:

# The variable a
# The variable b
# The argument passed to sys.getrefcount() (the temporary reference within the function call)
# An internal reference held by the CPython implementation (this can vary slightly depending on the Python version and internal optimizations, but it's a common reason for the count being higher than expected).
# The key takeaway is that sys.getrefcount() is primarily useful for understanding relative changes in reference counts rather than getting an absolute, precise count that directly reflects just the variables you've defined.


5


"\nWhen you call sys.getrefcount(b), a temporary reference to the object that b points to is created internally by the getrefcount() function itself. This temporary reference exists only for the duration of the function call.\n\nSo, the four references are:\n\nThe variable a\nThe variable b\nThe argument passed to sys.getrefcount() (the temporary reference within the function call)\nAn internal reference held by the CPython implementation (this can vary slightly depending on the Python version and internal optimizations, but it's a common reason for the count being higher than expected).\nThe key takeaway is that sys.getrefcount() is primarily useful for understanding relative changes in reference counts rather than getting an absolute, precise count that directly reflects just the variables you've defined.\n\n"

In [None]:
del b
print(sys.getrefcount(a))

# before del b was executed, you had the following references to the list object:

# a
# b
# The temporary reference inside the sys.getrefcount() call in the previous cell's execution.
# The internal CPython reference.
# This gave a total count of 4, as you saw.

# Now, when you execute del b, you are removing the variable name b. However, the object that b was referencing still exists as long as there are other references to it.

# When you then call print(sys.getrefcount(a)):

# The variable a still references the list object.
# A new temporary reference is created inside this new sys.getrefcount() call.
# The internal CPython reference still exists.
# The temporary reference from the previous sys.getrefcount(b) call might still be lingering in the execution context for a brief period before being fully cleaned up.
# There might be other internal references created by the interpreter during the execution of the cells.
# The exact behavior of sys.getrefcount() can be influenced by the internal workings of the CPython interpreter and how it manages temporary references and the execution stack. Because of these internal mechanisms, the count can appear higher than just the explicit variables you've defined.

# Again, the most important thing to understand is the relative change in reference counts as you add and remove references, rather than relying on sys.getrefcount() for a precise absolute count in all scenarios.

5


In [None]:
del b

In [None]:
print(sys.getrefcount(a))

4


***GARBAGE COLLECTION***

Python includes a cyclic garbage collector to handle reference cycles. Reference cycles occur when objects reference each other, preventing their reference counts from reaching zero.

In [None]:
import gc
## enable the garbage collection
gc.enable()

In [None]:
gc.disable()

In [None]:
gc.collect()

1649

In [None]:
### get garbage collection stats
print(gc.get_stats())

[{'collections': 713, 'collected': 6704, 'uncollectable': 0}, {'collections': 64, 'collected': 2753, 'uncollectable': 0}, {'collections': 6, 'collected': 1731, 'uncollectable': 0}]


In [None]:
print(gc.garbage)

[]


***MEMORY MANAGEMENT BEST PRACTICES***



1.   **Use Local Vairables**: Local variables have a shorter lifespan and are freed sooner than global variables.


2.   **Avoid Circular References**: Circular references can lead to memory leaks if not properly managed.


3.   **Use Generators**: generators produce items one at a time and only keeps one item in memory at a tiime, making them memory efficient.


4.   **Explicitly Delete Objects**: Use the del statement to delete variables and objects explicitly.


5.   **Profile Memory Usage**: Use memory profiling tools like tracemalloc and memory_profiler to identify memory leaks and optimize memory usage.

In [1]:
## Handling circular references
import gc

class MyObject:
    def __init__(self,name):
        self.name = name
        print(f"Object {self.name} created")

    def __del__(self):
        print(f"Object {self.name} deleted")

## create circular reference
obj1 = MyObject("obj1")
obj2 = MyObject("obj2")
obj1.ref = obj2
obj2.ref = obj1

del obj1
del obj2

## Manually trigger the garbage collection
gc.collect()

Object obj1 created
Object obj2 created
Object obj1 deleted
Object obj2 deleted


2

In [2]:
## Generators for memory efficiency
## Generatots allow you to produce items one at a time, using memory efficiently by only keeping one item in a memory at a time.

def generate_numbers(n):
    for i in range(n):
        yield i

## using the generator
for num in generate_numbers(10000):
    print(num)
    if num>10:
        break

0
1
2
3
4
5
6
7
8
9
10
11


In [4]:
## Profiling Memory Usage with tracemalloc

import tracemalloc
def create_list():
    return [i for i in range(10000)]

def main():
    tracemalloc.start()
    create_list()
    snapshot = tracemalloc.take_snapshot()
    top_stats = snapshot.statistics('lineno')

    print("[ Top 10 ]")
    for stat in top_stats[::]:
        print(stat)

main()

[ Top 10 ]
/usr/local/lib/python3.11/dist-packages/IPython/core/compilerop.py:101: size=10.9 KiB, count=107, average=104 B
/usr/local/lib/python3.11/dist-packages/zmq/sugar/attrsettr.py:44: size=2310 B, count=42, average=55 B
/usr/lib/python3.11/codeop.py:125: size=2234 B, count=23, average=97 B
/usr/lib/python3.11/json/decoder.py:353: size=1959 B, count=24, average=82 B
/usr/local/lib/python3.11/dist-packages/zmq/utils/jsonapi.py:24: size=1795 B, count=11, average=163 B
/usr/local/lib/python3.11/dist-packages/traitlets/traitlets.py:744: size=1551 B, count=22, average=70 B
/usr/local/lib/python3.11/dist-packages/debugpy/_vendored/pydevd/_pydevd_bundle/pydevd_net_command.py:101: size=1509 B, count=1, average=1509 B
/usr/local/lib/python3.11/dist-packages/debugpy/_vendored/pydevd/_pydevd_bundle/_debug_adapter/pydevd_schema.py:11968: size=1456 B, count=7, average=208 B
/usr/local/lib/python3.11/dist-packages/traitlets/traitlets.py:1535: size=1245 B, count=18, average=69 B
/usr/local/lib/p