Given an array of size n and an integer k, return the count of distinct numbers in all windows of size k.

```
Input: arr[] = {1, 2, 1, 3, 4, 2, 3};
       k = 4
Output: 3 4 4 3

Explanation:
First window is {1, 2, 1, 3}, count of distinct numbers is 3
Second window is {2, 1, 3, 4} count of distinct numbers is 4
Third window is {1, 3, 4, 2} count of distinct numbers is 4
Fourth window is {3, 4, 2, 3} count of distinct numbers is 3

Input: arr[] = {1, 2, 4, 4};
       k = 2
Output: 2 2 1

Explanation:
First window is {1, 2}, count of distinct numbers is 2
First window is {2, 4}, count of distinct numbers is 2
First window is {4, 4}, count of distinct numbers is 1

```

**Naive Approach:** The naive solution is to traverse the given array considering every window in it and keeping a count on the distinct elements of the window.

Algorithm:

1. For every index i from 0 to len_array(n) – k, i.e n – k, traverse the array from i to i + k. This is the window
2. Traverse the window, from i to that index and check if the element is present or not.
3. If the element is not present in the prefix of the array, i.e no duplicate element is present from i to index-1, then increase the count.
4. Print the count.


In [1]:
def countDistinctUtil(arr):
    
    n = len(arr)
    totalDistinctElement = 0
    
    for i in range(n):
        j = 0
        
        while j < i:
            if arr[i] == arr[j]:
                break
            else:
                j += 1
        
        if i == j:
            totalDistinctElement += 1
        
    return totalDistinctElement

In [2]:
def countDistinct(arr, k):
    
    for i in range(len(arr)-k+1):
        print(countDistinctUtil(arr[i:i+k]),end=" ")

In [3]:
arr = [1, 2, 1, 3, 4, 2, 3] 
k = 4

countDistinct(arr, k)

3 4 4 3 

Here the time complexity of this is **O((n-k) * k2)** <br>
Because Util function doing twice work to find distinct element

**Efficient Approach:** So, there is an efficient solution using hashing, though hashing requires extra O(n) space but the time complexity will improve.

The trick is to use the count of the previous window while sliding the window. To do this a hash map can be used that stores elements of the current window. The hash-map is also operated on by simultaneous addition and removal of an element while keeping track of distinct elements. The problem deals with finding the count of distinct elements in a window of length k, at any step while shifting the window and discarding all the computation done in the previous step, even though k – 1 elements are same from the previous adjacent window. For example, assume that elements from index i to i + k – 1 are stored in a Hash Map as an element-frequency pair. So, while updating the Hash Map in range i + 1 to i + k, reduce the frequency of the i-th element by 1 and increase the frequency of (i + k)-th element by 1.
Insertion and deletion from the HashMap takes constant time.

<img src="https://media.geeksforgeeks.org/wp-content/cdn-uploads/20190702115832/CountdistinctElementsIneveryWindow.png" >

Algorithm:
1. Create an empty hash map. Let the hash map be hM.
2. Initialize the count of distinct element as dist_count to 0.
3. Traverse through the first window and insert elements of the first window to hM. The elements are used as key and their counts as the value in hM. Also, keep updating dist_count
4. Print distinct count for the first window.
5. Traverse through the remaining array (or other windows).
6. Remove the first element of the previous window.
7. If the removed element appeared only once, remove it from hM and decrease the distinct count, i.e. do “dist_count–“
else (appeared multiple times in hM), then decrement its count in hM
8. Add the current element (last element of the new window)
9. If the added element is not present in hM, add it to hM and increase the distinct count, i.e. do “dist_count++”
10. Else (the added element appeared multiple times), increment its count in hM

In [4]:
from collections import defaultdict
def countDistinct(arr, k):
    hashMap = defaultdict(lambda: 0)
    n = len(arr)
    countDistinct = 0
    
    for i in range(k):
        if hashMap[arr[i]] == 0:
            countDistinct += 1
        hashMap[arr[i]] += 1
    
    print(countDistinct,end=" ")
    
    for i in range(k, n):
        
        if hashMap[arr[i-k]] == 1:
            countDistinct -= 1
        hashMap[arr[i-k]] -= 1
            
        if hashMap[arr[i]] == 0:
            countDistinct += 1
        hashMap[arr[i]] += 1
        
        print(countDistinct,end=" ")
            
    

In [5]:
arr = [1, 2, 1, 3, 4, 2, 3]  
k = 4
countDistinct(arr, 4)

3 4 4 3 