Question 1: Reverse List by N Elements

Problem Statement:

Write a function that takes a list and an integer n, and returns the list with every group of n elements reversed. If there are fewer than n elements left at the end, reverse all of them.

Requirements:

You must not use any built-in slicing or reverse functions to directly reverse the sublists.
The result should reverse the elements in groups of size n.
Example:

Input: [1, 2, 3, 4, 5, 6, 7, 8], n=3

Output: [3, 2, 1, 6, 5, 4, 8, 7]
Input: [1, 2, 3, 4, 5], n=2

Output: [2, 1, 4, 3, 5]
Input: [10, 20, 30, 40, 50, 60, 70], n=4

Output: [40, 30, 20, 10, 70, 60, 50]

In [1]:
from typing import Dict, List
import pandas as pd

In [2]:
def reverse_by_n_elements(lst: List[int], n: int) -> List[int]:
    result = []
    for i in range(0, len(lst), n):
        group = lst[i:i + n]
        for j in range(len(group) - 1, -1, -1):  # Reversing without built-in functions
            result.append(group[j])
    return result


Question 2: Lists & Dictionaries

Problem Statement:

Write a function that takes a list of strings and groups them by their length. The result should be a dictionary where:

The keys are the string lengths.
The values are lists of strings that have the same length as the key.
Requirements:

Each string should appear in the list corresponding to its length.
The result should be sorted by the lengths (keys) in ascending order.
Example:

Input: ["apple", "bat", "car", "elephant", "dog", "bear"]

Output: {3: ['bat', 'car', 'dog'], 4: ['bear'], 5: ['apple'], 8: ['elephant']}
Input: ["one", "two", "three", "four"]

Output: {3: ['one', 'two'], 4: ['four'], 5: ['three']}

In [3]:
def group_by_length(lst: List[str]) -> Dict[int, List[str]]:
    length_dict = {}
    for string in lst:
        key = len(string)
        if key not in length_dict:
            length_dict[key] = []
        length_dict[key].append(string)
    return dict(sorted(length_dict.items()))


Question 3: Flatten a Nested Dictionary
You are given a nested dictionary that contains various details (including lists and sub-dictionaries). Your task is to write a Python function that flattens the dictionary such that:

Nested keys are concatenated into a single key with levels separated by a dot (.).
List elements should be referenced by their index, enclosed in square brackets (e.g., sections[0]).
For example, if a key points to a list, the index of the list element should be appended to the key string, followed by a dot to handle further nested dictionaries.

Requirements:

Nested Dictionary: Flatten nested dictionaries into a single level, concatenating keys.
Handling Lists: Flatten lists by using the index as part of the key.
Key Separator: Use a dot (.) as a separator between nested key levels.
Empty Input: The function should handle empty dictionaries gracefully.
Nested Depth: You can assume the dictionary has a maximum of 4 levels of nesting.
Example:

Input:

{
    "road": {
        "name": "Highway 1",
        "length": 350,
        "sections": [
            {
                "id": 1,
                "condition": {
                    "pavement": "good",
                    "traffic": "moderate"
                }
            }
        ]
    }
}
Output:

{
    "road.name": "Highway 1",
    "road.length": 350,
    "road.sections[0].id": 1,
    "road.sections[0].condition.pavement": "good",
    "road.sections[0].condition.traffic": "moderate"
}

In [4]:
def flatten_dict(nested_dict: Dict, sep: str = '.') -> Dict:
    def recurse(d, parent_key=''):
        items = []
        for k, v in d.items():
            new_key = f"{parent_key}{sep}{k}" if parent_key else k
            if isinstance(v, dict):
                items.extend(recurse(v, new_key).items())
            elif isinstance(v, list):
                for idx, item in enumerate(v):
                    items.extend(recurse({f'{new_key}[{idx}]': item}).items())
            else:
                items.append((new_key, v))
        return dict(items)
    
    return recurse(nested_dict)


Question 4: Generate Unique Permutations

Problem Statement:

You are given a list of integers that may contain duplicates. Your task is to generate all unique permutations of the list. The output should not contain any duplicate permutations.

Example:

Input:

[1, 1, 2]
Output:

[
    [1, 1, 2],
    [1, 2, 1],
    [2, 1, 1]
]

In [5]:
from itertools import permutations

def unique_permutations(nums: List[int]) -> List[List[int]]:
    return sorted([list(perm) for perm in set(permutations(nums))])


Question 5: Find All Dates in a Text

Problem Statement:

You are given a string that contains dates in various formats (such as "dd-mm-yyyy", "mm/dd/yyyy", "yyyy.mm.dd", etc.). Your task is to identify and return all the valid dates present in the string.

You need to write a function find_all_dates that takes a string as input and returns a list of valid dates found in the text. The dates can be in any of the following formats:

dd-mm-yyyy
mm/dd/yyyy
yyyy.mm.dd
You are required to use regular expressions to identify these dates.

Example:

Input:

text = "I was born on 23-08-1994, my friend on 08/23/1994, and another one on 1994.08.23."
Output:

["23-08-1994", "08/23/1994", "1994.08.23"]

In [6]:
import re

def find_all_dates(text: str) -> List[str]:
    pattern = r'(\d{2}-\d{2}-\d{4})|(\d{2}/\d{2}/\d{4})|(\d{4}\.\d{2}\.\d{2})'
    matches = re.findall(pattern, text)
    return [match for group in matches for match in group if match]


Question 6: Decode Polyline, Convert to DataFrame with Distances
You are given a polyline string, which encodes a series of latitude and longitude coordinates. Polyline encoding is a method to efficiently store latitude and longitude data using fewer bytes. The Python polyline module allows you to decode this string into a list of coordinates.

Write a function that performs the following operations:

Decode the polyline string using the polyline module into a list of (latitude, longitude) coordinates.
Convert these coordinates into a Pandas DataFrame with the following columns:
latitude: Latitude of the coordinate.
longitude: Longitude of the coordinate.
distance: The distance (in meters) between the current row's coordinate and the previous row's one. The first row will have a distance of 0 since there is no previous point.
Calculate the distance using the Haversine formula for points in successive rows.

In [8]:
!pip install haversine

Collecting haversine
  Downloading haversine-2.8.1-py2.py3-none-any.whl.metadata (5.9 kB)
Downloading haversine-2.8.1-py2.py3-none-any.whl (7.7 kB)
Installing collected packages: haversine
Successfully installed haversine-2.8.1


In [9]:
import polyline  
from haversine import haversine

def polyline_to_dataframe(polyline_str: str) -> pd.DataFrame:
    decoded = polyline.decode(polyline_str)  # Assuming polyline is imported
    lat_lon_pairs = [(lat, lon) for lat, lon in decoded]

    data = []
    for i in range(len(lat_lon_pairs)):
        if i == 0:
            distance = 0
        else:
            distance = haversine(lat_lon_pairs[i - 1], lat_lon_pairs[i]) * 1000
        data.append([lat_lon_pairs[i][0], lat_lon_pairs[i][1], distance])

    df = pd.DataFrame(data, columns=['latitude', 'longitude', 'distance'])
    return df

Question 7: Matrix Rotation and Transformation
Write a function that performs the following operations on a square matrix (n x n):

Rotate the matrix by 90 degrees clockwise.
After rotation, for each element in the rotated matrix, replace it with the sum of all elements in the same row and column (in the rotated matrix), excluding itself.
The function should return the transformed matrix.

Example
For the input matrix:

matrix = [[1, 2, 3],[4, 5, 6],[7, 8, 9]]
Rotate the matrix by 90 degrees clockwise:

rotated_matrix = [[7, 4, 1],[8, 5, 2],[9, 6, 3]]
Replace each element with the sum of all elements in the same row and column, excluding itself:

final_matrix = [[22, 19, 16],[23, 20, 17],[24, 21, 18]]

In [10]:
def rotate_and_multiply_matrix(matrix: List[List[int]]) -> List[List[int]]:
    n = len(matrix)
    rotated_matrix = [[matrix[n - j - 1][i] for j in range(n)] for i in range(n)]
    
    transformed_matrix = []
    for i in range(n):
        row_sum = sum(rotated_matrix[i])
        new_row = []
        for j in range(n):
            col_sum = sum([rotated_matrix[k][j] for k in range(n)])
            new_row.append(row_sum + col_sum - rotated_matrix[i][j])
        transformed_matrix.append(new_row)
    
    return transformed_matrix


Question 8: Time Check
You are given a dataset, dataset-1.csv, containing columns id, id_2, and timestamp (startDay, startTime, endDay, endTime). The goal is to verify the completeness of the time data by checking whether the timestamps for each unique (id, id_2) pair cover a full 24-hour period (from 12:00:00 AM to 11:59:59 PM) and span all 7 days of the week (from Monday to Sunday).

Create a function that accepts dataset-1.csv as a DataFrame and returns a boolean series that indicates if each (id, id_2) pair has incorrect timestamps. The boolean series must have multi-index (id, id_2).

In [11]:
def time_check(df: pd.DataFrame) -> pd.Series:
    def is_complete(group):
        full_days = set(['Monday', 'Tuesday', 'Wednesday', 'Thursday', 'Friday', 'Saturday', 'Sunday'])
        group_days = set(group['startDay'].unique())
        return full_days == group_days and group['time_range'].sum() == 24*60*60

    df['time_range'] = (pd.to_datetime(df['endTime']) - pd.to_datetime(df['startTime'])).dt.total_seconds()
    return df.groupby(['id', 'id_2']).apply(is_complete)
