# Task 2: Deep-SORT Overview and Implementation

## Objective:
The goal of this task is to implement the **Deep-SORT** (Simple Online and Realtime Tracking) algorithm to track both players and the soccer ball in a video. The tracking solution must utilize an object detector and display bounding boxes over the players and ball in the video. Deep-SORT enhances traditional tracking by adding deep learning-based appearance features, allowing it to handle occlusions and interactions better. Additionally, the task requires submitting a video showing the tracking results overlaid on the test video.

## Key Steps Taken:

### 1. **Deep-SORT Understanding and Integration:**
We first explored the **Deep-SORT** algorithm by reviewing papers and resources. The algorithm leverages two main components for object tracking: the **Kalman Filter** and the **Hungarian Algorithm**. The Kalman filter is used for predicting the object's position in the next frame, based on its previous states (like position, velocity, etc.). The Hungarian algorithm is used to associate detections to existing tracks by minimizing the cost of assignments.  
Deep-SORT extends traditional SORT by introducing **appearance features** using a pre-trained neural network. This allows the algorithm to differentiate between objects even when they are occluded or overlap, providing more robust tracking in challenging conditions.

### 2. **Architecture Diagram:**
The architecture of the solution was illustrated using **Excalidraw**, a tool compatible with GitHub rendering. The diagram depicted how the tracking system integrates with object detection and the Kalman filter to predict and associate objects across frames. The Hungarian algorithm ensures that the detections from the object detector are correctly associated with the existing tracks.

### 3. **Object Detection with YOLO:**
We used the **YOLO (You Only Look Once)** object detection model to detect the objects of interest, namely the **players** and the **soccer ball**. YOLO is a powerful, fast object detection algorithm that can detect multiple objects in real-time. In the implementation, we specifically selected the `person` and `sports ball` classes from the YOLO model to focus on tracking players and the ball.

### 4. **Deep-SORT Implementation:**
Deep-SORT was implemented by integrating YOLO with a feature extraction model (pre-trained MARS model) to obtain appearance features for each detection. The Deep-SORT tracker then tracks these detections across video frames using the Kalman filter for prediction and the Hungarian algorithm for data association. This allowed us to track the players and the soccer ball accurately.  
The tracker was initialized with each detection's bounding box and the associated appearance features, which were extracted from the regions of interest (players and ball) in each frame.

### 5. **Visualization:**
After the detection and tracking process, bounding boxes were drawn over the players and soccer ball in each frame. These boxes were updated as the objects moved throughout the video. The tracking IDs were also displayed alongside the objects to indicate the specific tracks being followed.

### 6. **Test Video Processing:**
The code was tested on a provided video, where the system successfully detected and tracked both players and the soccer ball. Bounding boxes were superposed on the test video, and the final result was saved for submission.

### 7. **Video Submission:**
The processed video, with the bounding boxes overlaying the detected players and the soccer ball, was prepared as the final submission. The video showed the effectiveness of Deep-SORT in accurately tracking multiple objects across frames, including handling occlusions when the players and ball temporarily disappeared from view.

## Deep-SORT Components Overview:

### 1. **Kalman Filter:**
The Kalman filter is used for estimating the state of a tracked object, including its position and velocity. It works by making predictions about the object’s future state, and then correcting these predictions using measurements from the detector. The Kalman filter helps to predict where an object should be in the next frame, which is crucial for maintaining continuous tracking even when there is uncertainty or noise in the measurements.

### 2. **Hungarian Algorithm:**
The Hungarian algorithm is used for **data association**. Given a set of predicted object positions and detected objects in a frame, the algorithm assigns each detection to the correct track (or creates a new track if the object is not yet tracked). The algorithm works by minimizing a cost matrix, where the cost is the difference between the predicted and actual positions. This helps to ensure that the correct object is matched with the correct track even in crowded scenes.

## Outcome:
The system was able to detect and track both the **soccer players** and the **soccer ball** throughout the video. The bounding boxes were accurately drawn around the objects, and the tracking IDs were displayed, allowing the viewer to easily follow the movement of each player and the ball. The video was processed successfully and saved as output, ready for submission.

## Conclusion:
This task demonstrated the implementation of Deep-SORT for tracking objects in a sports video. By combining YOLO for object detection and Deep-SORT for object tracking, we were able to create a robust system capable of tracking players and the soccer ball even in complex and dynamic environments. The use of the Kalman filter and Hungarian algorithm ensured that the objects were tracked accurately across frames, making the system effective for real-time tracking in sports videos.


### Setup and Installation

In this section, we install the necessary packages and dependencies to implement the Deep SORT object tracking solution.

1. **Cleaning Up Previous Builds**:  
   The command `!rm -rf build dist *.egg-info` removes any previous build directories or egg-info files that may interfere with fresh installations.

2. **Installing System Packages**:  
   The command `!apt-get install -y libblas-dev liblapack-dev libsuitesparse-dev` installs system libraries like BLAS, LAPACK, and SuiteSparse, which are essential for numerical computations and optimizations in machine learning algorithms.

3. **Installing Python Libraries**:  
   The script installs several Python libraries required for the project:
   - `yt-dlp`: A tool for downloading videos from YouTube.
   - `torch`, `torchvision`: Libraries for deep learning and computer vision.
   - `opencv-python`: Used for image processing and video manipulation.
   - `matplotlib`: For plotting and visualizing results.
   - `filterpy`: Provides Kalman filter implementations, which are crucial for the Deep SORT tracking algorithm.
   - `scikit-learn`: A machine learning library for various utilities.
   - `lap`: Library for solving the assignment problem using the Hungarian algorithm.
   - `scipy`: A library for scientific computing.

4. **Downloading Pre-trained Models**:  
   The pre-trained MARS (Multiple Object Tracking with Appearance and Re-Identification) model is downloaded using `wget` to support object re-identification in the Deep SORT tracking system.

5. **Cloning YOLOv5 Repository**:  
   The YOLOv5 repository is cloned from GitHub using `git clone`, which is necessary for the object detection part of the Deep SORT algorithm.

6. **Installing TensorFlow**:  
   Finally, TensorFlow is installed with the command `!pip install tensorflow` to ensure compatibility with the other components of the project, including object detection and tracking.


In [1]:
# Install required packages
!rm -rf build dist *.egg-info
!apt-get install -y libblas-dev liblapack-dev libsuitesparse-dev
!pip install --upgrade yt-dlp
!pip install torch torchvision --quiet
!pip install opencv-python --quiet
!pip install matplotlib --quiet
!pip install filterpy --quiet
!pip install scikit-learn --quiet
!pip install lap --quiet
!pip install scipy --quiet
!pip install ultralytics --quiet

# Download pre-trained MARS model
!wget https://github.com/Qidian213/deep_sort_yolov3/raw/master/model_data/mars-small128.pb

#yolov5 directory
!git clone https://github.com/ultralytics/yolov5.git

# Install TensorFlow
!pip install tensorflow --quiet

Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
liblapack-dev is already the newest version (3.10.0-2ubuntu1).
The following additional packages will be installed:
  libamd2 libbtf1 libcamd2 libccolamd2 libcholmod3 libcolamd2 libcxsparse3 libgraphblas-dev
  libgraphblas6 libklu1 libldl2 libmetis5 libmongoose2 librbio2 libsliplu1 libspqr2
  libsuitesparseconfig5 libumfpack5
Suggested packages:
  liblapack-doc
The following NEW packages will be installed:
  libamd2 libblas-dev libbtf1 libcamd2 libccolamd2 libcholmod3 libcolamd2 libcxsparse3
  libgraphblas-dev libgraphblas6 libklu1 libldl2 libmetis5 libmongoose2 librbio2 libsliplu1
  libspqr2 libsuitesparse-dev libsuitesparseconfig5 libumfpack5
0 upgraded, 20 newly installed, 0 to remove and 49 not upgraded.
Need to get 22.6 MB of archives.
After this operation, 170 MB of additional disk space will be used.
Get:1 http://archive.ubuntu.com/ubuntu jammy/main amd64 libsuitesparseconfig5 amd64 

### Downloading YouTube Video with yt-dlp

In this section, we use the `yt-dlp` library to download a video from YouTube with specific options set to control the download and output format.

1. **Importing yt-dlp**:  
   The script starts by importing the `yt-dlp` module, which is a powerful tool for downloading YouTube videos and handling video/audio downloads.

2. **Setting Download Options**:  
   A dictionary `ydl_opts` is created to specify various download options:
   - `'format'`: This option selects the best video and audio streams, or defaults to the best available if the preferred format is unavailable.
   - `'outtmpl'`: Defines the output file naming pattern, using the video title as the file name and the appropriate extension.
   - `'merge_output_format'`: Ensures that the final output is in MP4 format, even if the input video is in a different format.
   - `'postprocessors'`: Specifies post-processing instructions. In this case, it uses `FFmpegVideoConvertor` to convert the video to MP4 if necessary.
   - `'keepvideo'`: Retains intermediate files during the video merging process (if applicable), which can be useful for debugging or further processing.

3. **Downloading the Video**:  
   The `yt_dlp.YoutubeDL` object is initialized with the specified options, and the `download` method is called with the URL of the video to download (`'https://www.youtube.com/watch?v=l3NJNFmg09k'`). This fetches the video, applies the options, and saves it in the desired format.


In [2]:
import yt_dlp # Import the yt-dlp module
# Set options for yt-dlp
ydl_opts = {
    'format': 'bestvideo+bestaudio/best',  # Combines best video and audio or falls back to best available
    'outtmpl': '%(title)s.%(ext)s',       # Saves the file using the video's title and correct extension
    'merge_output_format': 'mp4',         # Ensures final output is in MP4 format
    'postprocessors': [
        {
            'key': 'FFmpegVideoConvertor',
            'preferedformat': 'mp4'       # Explicitly converts video to MP4 if necessary
        }
    ],
    'keepvideo': True                     # Keeps intermediate video files (if merging is needed)
}

# Download video from YouTube
with yt_dlp.YoutubeDL(ydl_opts) as ydl:
    ydl.download(['https://www.youtube.com/watch?v=l3NJNFmg09k'])

[youtube] Extracting URL: https://www.youtube.com/watch?v=l3NJNFmg09k
[youtube] l3NJNFmg09k: Downloading webpage
[youtube] l3NJNFmg09k: Downloading ios player API JSON
[youtube] l3NJNFmg09k: Downloading mweb player API JSON
[youtube] l3NJNFmg09k: Downloading player 89dfc5b3
[youtube] l3NJNFmg09k: Downloading m3u8 information
[info] l3NJNFmg09k: Downloading 1 format(s): 136+251
[download] Destination: Football match.f136.mp4
[download] 100% of   11.07MiB in 00:00:00 at 20.94MiB/s  
[download] Destination: Football match.f251.webm
[download] 100% of  497.43KiB in 00:00:00 at 8.35MiB/s   
[Merger] Merging formats into "Football match.mp4"
[VideoConvertor] Not converting media file "Football match.mp4"; already is in target format mp4


### Setting Up the Environment and Importing Libraries

This section sets up the environment to work with TensorFlow 1.x and imports necessary libraries for video processing and tracking.

1. **Disabling TensorFlow 2.x Behavior**:  
   Since the code is designed to work with TensorFlow 1.x, we begin by importing `tensorflow.compat.v1` and disabling TensorFlow 2.x behavior using `tf.disable_v2_behavior()`. This ensures compatibility with older models and methods built for TensorFlow 1.x.

2. **Importing Libraries**:  
   Several libraries are imported for use throughout the implementation:
   - `os` and `sys`: For handling system paths and operating system interactions.
   - `torch`: The PyTorch library is imported, likely for deep learning tasks (though it’s not explicitly used in this snippet).
   - `numpy`: A fundamental library for numerical computing, used for handling arrays and mathematical operations.
   - `cv2`: OpenCV library for computer vision tasks such as image and video processing.
   - `tqdm`: A library for showing progress bars in loops, used here for tracking the progress of video frame processing.
   - `scipy.linalg`: Part of the SciPy library, used for linear algebra operations.
   - `scipy.optimize.linear_sum_assignment`: This function is used for solving the linear sum assignment problem, which is likely employed for object tracking or matching tasks.

These libraries form the basis of the operations required to process video frames, track objects, and utilize deep learning models.


In [3]:
import tensorflow.compat.v1 as tf
tf.disable_v2_behavior()

# Import libraries
import os
import sys
import torch
import numpy as np
import cv2
from tqdm import tqdm
import scipy.linalg
from scipy.optimize import linear_sum_assignment

Instructions for updating:
non-resource variables are not supported in the long term


### Setting Up GPU for Computation

This section of the code handles the setup for utilizing the GPU for deep learning tasks, specifically with PyTorch.

1. **Checking GPU Availability**:  
   The code first checks if a GPU is available using the `torch.cuda.is_available()` function from the PyTorch library. If a GPU is detected, the code proceeds to configure the environment to use the first available GPU.

2. **Configuring CUDA for GPU**:  
   If a GPU is available, the environment variable `CUDA_VISIBLE_DEVICES` is set to `"0"`, which tells PyTorch to use the first GPU (GPU with index 0). The code also prints out the name of the GPU being used using `torch.cuda.get_device_name(0)`.

3. **Fallback to CPU**:  
   If no GPU is available, the code prints a message indicating that CUDA is not available and that the computation will fall back to using the CPU for processing.

This setup is crucial for ensuring that the code runs efficiently by utilizing hardware acceleration when available, while still being functional on systems without a GPU.


In [4]:
import os
import torch

# Automatically select the first available GPU
if torch.cuda.is_available():
    os.environ["CUDA_VISIBLE_DEVICES"] = "0"  # Use the first GPU
    print(f"CUDA is available. Using GPU: {torch.cuda.get_device_name(0)}")
else:
    print("CUDA is not available. Falling back to CPU.")

CUDA is available. Using GPU: Tesla T4


### Detection Class for Object Detection

The `Detection` class is used to represent individual object detections, such as bounding boxes, confidence scores, and features extracted from a deep learning model. This class is essential for tracking objects across frames in video processing tasks.

1. **Initialization (`__init__`)**:  
   The constructor initializes an object with four key attributes:
   - `tlwh`: Bounding box coordinates in the format (top-left x, top-left y, width, height).
   - `confidence`: The confidence score associated with the detection.
   - `class_name`: The class label of the detected object.
   - `feature`: A feature vector that describes the appearance of the object, which is used for matching in tracking.

2. **Class Name Retrieval (`get_class`)**:  
   This method returns the class name (label) of the detected object.

3. **Bounding Box Conversion (`to_tlbr`)**:  
   Converts the bounding box from the format `(top-left x, top-left y, width, height)` to the format `(min x, min y, max x, max y)` for easier processing.

4. **Bounding Box Conversion to `xyah` Format (`to_xyah`)**:  
   Converts the bounding box into a different format: `(center x, center y, aspect ratio, height)` where the aspect ratio is the width divided by the height. This is useful for some tracking algorithms.

5. **Conversion to Dictionary (`to_dict`)**:  
   Converts the detection attributes (bounding box, confidence, class name, feature) to a dictionary format, which is useful for serialization or logging.

6. **Confidence Update (`set_confidence`)**:  
   Updates the confidence score of the detection. This allows for dynamic changes in confidence during processing.

7. **Adjust Bounding Box (`adjust_bbox`)**:  
   Scales the bounding box coordinates by a given factor. This is useful if the image size changes or for other scaling purposes in video frames.

This class is central to storing and managing detections, especially in multi-object tracking scenarios where accurate bounding box adjustments and feature extraction are critical.


In [5]:
from ultralytics import YOLO

class Detection:
    def __init__(self, tlwh, confidence, class_name, feature):
        self.tlwh = np.asarray(tlwh, dtype=np.float32)
        self.confidence = float(confidence)
        self.class_name = class_name
        self.feature = np.asarray(feature, dtype=np.float32)

    def get_class(self):
        return self.class_name

    def to_tlbr(self):
        """Convert to (min x, min y, max x, max y)."""
        ret = self.tlwh.copy()
        ret[2:] += ret[:2]
        return ret

    def to_xyah(self):
        """Convert to (center x, center y, aspect ratio, height)."""
        ret = self.tlwh.copy()
        ret[:2] += ret[2:] / 2
        ret[2] = ret[2] / ret[3]
        return ret

    def to_dict(self):
        """Convert detection to dictionary format."""
        return {'tlwh': self.tlwh.tolist(), 'confidence': self.confidence, 'class_name': self.class_name, 'feature': self.feature.tolist()}

    def set_confidence(self, confidence):
        """Update the detection's confidence."""
        self.confidence = float(confidence)

    def adjust_bbox(self, scale):
        """Scale the bounding box by a factor."""
        self.tlwh[:2] *= scale
        self.tlwh[2:] *= scale

Creating new Ultralytics Settings v0.0.6 file ✅ 
View Ultralytics Settings with 'yolo settings' or at '/root/.config/Ultralytics/settings.json'
Update Settings with 'yolo settings key=value', i.e. 'yolo settings runs_dir=path/to/dir'. For help see https://docs.ultralytics.com/quickstart/#ultralytics-settings.


### Kalman Filter for Object Tracking

The `KalmanFilter` class implements a standard Kalman filter used for tracking objects based on measurements such as bounding boxes. It estimates the state of the object (position, velocity) and updates the estimate based on new measurements. Here's a breakdown of the key components:

1. **Initialization (`__init__`)**:  
   The constructor initializes the Kalman filter with motion and measurement matrices, which define how the object moves and how its state is observed. It also defines the standard deviations for position and velocity that influence the uncertainty in the model.

2. **State Initialization (`initiate`)**:  
   This method initializes the filter with an initial measurement (typically from the object detector). It sets up the initial state (position and velocity) and the corresponding uncertainty (covariance matrix). The covariance represents the uncertainty in the initial estimate.

3. **Prediction (`predict`)**:  
   The `predict` method computes the predicted state (position and velocity) and its uncertainty (covariance) for the next time step. It uses the motion model (a transition matrix) to predict the object's new state. The method also accounts for the uncertainty of the object's movement.

4. **Projection (`project`)**:  
   This method projects the predicted state into the measurement space. It calculates the expected measurement based on the predicted state and the corresponding uncertainty, which helps in comparing it with actual measurements to correct the state.

5. **Update (`update`)**:  
   The `update` method corrects the predicted state by incorporating new measurements. It computes the Kalman gain, which determines how much weight should be given to the new measurement. The state is then updated by adding the innovation (difference between measurement and predicted state), and the covariance is updated to reflect the improved estimate.

6. **Gating Distance (`gating_distance`)**:  
   This method calculates the "gating distance," which is a measure of how well a predicted state matches a set of new measurements. It computes the Mahalanobis distance, which measures the difference between the predicted and actual measurements, adjusted for the uncertainty in the prediction. If the distance is too large, the measurement is considered to be outside the expected range (i.e., a misdetection).

This Kalman filter is essential for maintaining a robust and real-time estimate of an object's state, even in noisy environments with varying levels of uncertainty, such as video tracking.


In [6]:
import numpy as np
import scipy.linalg

class KalmanFilter:
    def __init__(self):
        ndim, dt = 4, 1.

        # Motion model and measurement matrix
        self._motion_mat = np.eye(2 * ndim, 2 * ndim)
        for i in range(ndim):
            self._motion_mat[i, ndim + i] = dt

        self._update_mat = np.eye(ndim, 2 * ndim)

        # Standard deviations for position and velocity
        self._std_weight_position = 1. / 20
        self._std_weight_velocity = 1. / 160

    def initiate(self, measurement):
        """Initialize state with measurement."""
        mean_pos = measurement
        mean_vel = np.zeros_like(mean_pos)
        mean = np.r_[mean_pos, mean_vel]

        # Standard deviation based on measurement
        std = [
            2 * self._std_weight_position * measurement[3],  # x
            2 * self._std_weight_position * measurement[3],  # y
            1e-2,  # acceleration
            2 * self._std_weight_position * measurement[3],  # height
            10 * self._std_weight_velocity * measurement[3],  # x velocity
            10 * self._std_weight_velocity * measurement[3],  # y velocity
            1e-5,  # acceleration velocity
            10 * self._std_weight_velocity * measurement[3]]  # height velocity

        covariance = np.diag(np.square(std))
        return mean, covariance

    def predict(self, mean, covariance):
        """Predict next state."""
        std_pos = [
            self._std_weight_position * mean[3],
            self._std_weight_position * mean[3],
            1e-2,
            self._std_weight_position * mean[3]]

        std_vel = [
            self._std_weight_velocity * mean[3],
            self._std_weight_velocity * mean[3],
            1e-5,
            self._std_weight_velocity * mean[3]]

        motion_cov = np.diag(np.square(np.r_[std_pos, std_vel]))

        mean = np.dot(self._motion_mat, mean)
        covariance = np.linalg.multi_dot((self._motion_mat, covariance, self._motion_mat.T)) + motion_cov

        return mean, covariance

    def project(self, mean, covariance):
        """Project state to measurement space."""
        std = [
            self._std_weight_position * mean[3],
            self._std_weight_position * mean[3],
            1e-1,
            self._std_weight_position * mean[3]]

        innovation_cov = np.diag(np.square(std))

        mean = np.dot(self._update_mat, mean)
        covariance = np.linalg.multi_dot((self._update_mat, covariance, self._update_mat.T))

        return mean, covariance + innovation_cov

    def update(self, mean, covariance, measurement):
        """Correct the state with new measurement."""
        projected_mean, projected_cov = self.project(mean, covariance)

        chol_factor, lower = scipy.linalg.cho_factor(
            projected_cov, lower=True, check_finite=False)

        kalman_gain = scipy.linalg.cho_solve(
            (chol_factor, lower), np.dot(covariance, self._update_mat.T).T,
            check_finite=False).T

        innovation = measurement - projected_mean

        new_mean = mean + np.dot(innovation, kalman_gain.T)
        new_covariance = covariance - np.linalg.multi_dot((
            kalman_gain, projected_cov, kalman_gain.T))

        return new_mean, new_covariance

    def gating_distance(self, mean, covariance, measurements, only_position=False):
        """Calculate gating distance between predicted state and measurements."""
        mean, covariance = self.project(mean, covariance)

        if only_position:
            mean, covariance = mean[:2], covariance[:2, :2]
            measurements = measurements[:, :2]

        cholesky_factor = np.linalg.cholesky(covariance)
        d = measurements - mean
        z = scipy.linalg.solve_triangular(
            cholesky_factor, d.T, lower=True, check_finite=False, overwrite_b=True)

        squared_maha = np.sum(z * z, axis=0)
        return squared_maha

### Linear Assignment and Intersection over Union (IoU) for Object Tracking

This code snippet contains two key functions related to object tracking: `linear_assignment_fn` and `iou`. These functions help in solving the data association problem and evaluating the overlap between bounding boxes.

1. **Linear Assignment (`linear_assignment_fn`)**:
   - **Purpose**: This function solves the linear assignment problem, which is crucial for associating detected objects in one frame with the tracked objects in another. It ensures that the best matches are made based on a cost matrix.
   - **How it works**:
     - The `linear_sum_assignment` function from `scipy.optimize` is used to find the optimal matching between two sets of objects (e.g., detections and tracked objects).
     - The function returns the matched pairs, as well as unmatched detections and tracked objects. If the cost of the match exceeds a predefined threshold (`max_cost`), it is treated as an unmatched pair.
     - The cost matrix typically represents how well each detection matches each tracked object (e.g., using a distance metric like the IoU or feature similarity).

2. **Intersection over Union (IoU) (`iou`)**:
   - **Purpose**: The `iou` function calculates the intersection over union (IoU) between a given bounding box and a set of candidate bounding boxes.
   - **How it works**:
     - The function computes the overlap between two bounding boxes by finding their intersection area and dividing it by the union area.
     - It works with a bounding box and a matrix of candidate bounding boxes, calculating the IoU for each candidate.
     - IoU is a standard metric for evaluating object detection and tracking performance. A high IoU indicates a good match, while a low IoU means the bounding boxes are not aligned.

Together, these functions play a critical role in object tracking by helping to assign detections to existing tracks and evaluate how well the detections correspond to the tracked objects based on spatial overlap (IoU).


In [7]:
from scipy.optimize import linear_sum_assignment
import numpy as np

def linear_assignment_fn(cost_matrix, max_cost=1e+5):
    """
    Solve the linear assignment problem and return matches, unmatched a and b indices.
    """
    row_ind, col_ind = linear_sum_assignment(cost_matrix)
    matches = []
    unmatched_a = [i for i in range(cost_matrix.shape[0]) if i not in row_ind]
    unmatched_b = [j for j in range(cost_matrix.shape[1]) if j not in col_ind]

    for i, j in zip(row_ind, col_ind):
        if cost_matrix[i, j] > max_cost:
            unmatched_a.append(i)
            unmatched_b.append(j)
        else:
            matches.append((i, j))

    return matches, unmatched_a, unmatched_b

def iou(bbox, candidates):
    """
    Calculate intersection over union between a bounding box and candidate boxes.
    """
    # Calculate corners of the bounding boxes
    bbox_tl, bbox_br = bbox[:2], bbox[:2] + bbox[2:]
    candidates_tl = candidates[:, :2]
    candidates_br = candidates[:, :2] + candidates[:, 2:]

    # Compute intersection area
    tl = np.maximum(bbox_tl, candidates_tl)
    br = np.minimum(bbox_br, candidates_br)
    wh = np.maximum(0., br - tl)

    area_intersection = wh[:, 0] * wh[:, 1]
    area_bbox = bbox[2] * bbox[3]
    area_candidates = candidates[:, 2] * candidates[:, 3]

    # Return IoU for each candidate
    return area_intersection / (area_bbox + area_candidates - area_intersection)

### Nearest Neighbor Distance Metric for Object Tracking

This code defines the `NearestNeighborDistanceMetric` class, which is used for computing the nearest neighbor distance between target objects and detected features during tracking. The class supports both Euclidean and cosine distance metrics for evaluating the similarity between feature vectors.

#### Key Features:
1. **Metric Selection**:
   - The class supports two distance metrics for matching:
     - **Euclidean Distance**: Measures the straight-line distance between feature vectors.
     - **Cosine Distance**: Measures the cosine similarity between feature vectors, often used for evaluating angle-based similarity in high-dimensional spaces.
   - The distance metric to use is specified during initialization (`"euclidean"` or `"cosine"`).

2. **Partial Fit (`partial_fit`)**:
   - This method updates the feature samples associated with each target. It adds the new features of the tracked targets and stores them for future matching.
   - If a **budget** is set, it limits the number of stored features per target by keeping only the most recent ones.

3. **Distance Calculation (`distance`)**:
   - This method computes a cost matrix between the target objects and detected features. The cost matrix represents the distance (or similarity) between each target and all the detected features based on the selected distance metric.

4. **Distance Functions**:
   - **Euclidean Distance** (`_nn_euclidean_distance`): Calculates the Euclidean distance between two feature vectors. The minimum distance is taken across all possible pairings.
   - **Cosine Distance** (`_nn_cosine_distance`): Computes the cosine distance, which is defined as one minus the cosine similarity. The vectors are normalized before the computation to ensure the comparison is based purely on direction rather than magnitude.

#### Usage:
- This class is crucial in tracking algorithms where multiple targets need to be matched with detection results over time. It helps in assigning detections to existing tracks based on the similarity of their feature representations, which is particularly useful in object tracking systems such as Deep SORT.


In [8]:
class NearestNeighborDistanceMetric:
    """
    Computes the nearest neighbor distance for tracking, using either
    Euclidean or cosine distance metrics.
    """
    def __init__(self, metric, matching_threshold, budget=None):
        self.matching_threshold = matching_threshold
        self.budget = budget
        self.samples = {}
        # Select distance metric
        self._metric = self._nn_euclidean_distance if metric == "euclidean" else self._nn_cosine_distance

    def partial_fit(self, features, targets, active_targets):
        """Add new features for active targets."""
        for feature, target in zip(features, targets):
            # Update the sample list for the target
            self.samples.setdefault(target, []).append(feature)
            if self.budget:
                self.samples[target] = self.samples[target][-self.budget:]  # Limit the sample size
        # Keep only active target samples
        self.samples = {k: self.samples[k] for k in active_targets}

    def distance(self, features, targets):
        """Compute distance matrix between features and targets."""
        cost_matrix = np.zeros((len(targets), len(features)))
        for i, target in enumerate(targets):
            cost_matrix[i, :] = self._metric(self.samples[target], features)
        return cost_matrix

    def _nn_euclidean_distance(self, x, y):
        """Compute the Euclidean distance between feature vectors."""
        return np.linalg.norm(x[:, np.newaxis] - y[np.newaxis, :], axis=2).min(axis=0)

    def _nn_cosine_distance(self, x, y):
        """Compute cosine distance between feature vectors."""
        x, y = x / np.linalg.norm(x, axis=1, keepdims=True), y / np.linalg.norm(y, axis=1, keepdims=True)
        return (1. - np.dot(x, y.T)).min(axis=0)

### Track Class for Object Tracking

The `Track` class is designed to represent a single tracked object in an object tracking system. It manages the state of the object, such as its position, velocity, and other relevant features, while interacting with a Kalman filter to predict and update the object's state over time.

#### Key Features:
1. **Track Initialization**:
   - Each track is initialized with a **mean state** (position, aspect ratio, height, and velocity) and **covariance** (uncertainty in the state).
   - Tracks are assigned a unique **track ID** and have two key parameters: `n_init` (the number of successful updates required for a track to become confirmed) and `max_age` (the maximum number of missed frames before a track is deleted).

2. **Track State Management**:
   - The track state is initialized as `'Tentative'`, indicating that it has just been detected and is not yet confirmed. Once the track has enough updates (determined by `n_init`), it transitions to `'Confirmed'`.
   - Tracks can be marked as `'Deleted'` if they have been missed for too long or if their `time_since_update` exceeds the `max_age`.

3. **Track Prediction and Update**:
   - The `predict` method uses the Kalman filter (`kf`) to propagate the track's state forward by one time step, updating the position and velocity of the object.
   - The `update` method updates the track using a new detection. It performs a Kalman filter update step to refine the predicted state based on the observed detection, and the track's `hit` count is incremented.

4. **Track Deletion and Missed Updates**:
   - If a track is not updated within a certain number of frames (controlled by `max_age`), it is marked as `'Deleted'`. The `mark_missed` method handles this logic.
   - The track is considered `'Tentative'` until it has sufficient successful updates, after which it becomes `'Confirmed'`.

5. **Bounding Box Conversion**:
   - The track state is stored as a 4-dimensional vector, and the `to_tlwh` method converts the state to a bounding box in the format `(top-left x, top-left y, width, height)`.
   - The `to_tlbr` method converts the state to a bounding box in the format `(min x, min y, max x, max y)`.

6. **Track Features**:
   - Tracks maintain a list of features (`self.features`), which are updated with the features of the detected objects as new detections come in.

7. **Track Status**:
   - The track's state can be queried to determine if it is `'Tentative'`, `'Confirmed'`, or `'Deleted'` using the methods `is_tentative()`, `is_confirmed()`, and `is_deleted()`.

#### Usage:
- The `Track` class is essential for maintaining the state of tracked objects over time in object tracking applications. It works closely with the Kalman filter to predict future states and update the track when new detections are available. The class ensures that only valid tracks are maintained, and it supports object tracking in scenarios where objects may be temporarily lost or occluded.


In [9]:
class Track:
    """
    Represents a target track with state `(x, y, aspect_ratio, height)` and velocities.
    """
    def __init__(self, mean, covariance, track_id, n_init, max_age, feature=None, class_name=None):
        # Initialize track state, covariance, and metadata
        self.mean = mean
        self.covariance = covariance
        self.track_id = track_id
        self.hits = 1  # Number of successful updates
        self.age = 1  # Age of the track
        self.time_since_update = 0  # Time since last update

        # Initial state is 'Tentative'
        self.state = 'Tentative'
        self.features = [feature] if feature is not None else []

        # Track initialization parameters
        self._n_init = n_init
        self._max_age = max_age
        self.class_name = class_name

    def predict(self, kf):
        """Propagate the state forward by one time step."""
        self.mean, self.covariance = kf.predict(self.mean, self.covariance)
        self.age += 1
        self.time_since_update += 1

    def update(self, kf, detection):
        """Update track with a new detection and perform Kalman filter update."""
        self.mean, self.covariance = kf.update(self.mean, self.covariance, detection.to_xyah())
        self.features.append(detection.feature)

        # Update the track's hit count and reset time since last update
        self.hits += 1
        self.time_since_update = 0

        # If it's still tentative and has enough hits, make it confirmed
        if self.state == 'Tentative' and self.hits >= self._n_init:
            self.state = 'Confirmed'

    def mark_missed(self):
        """Mark track as missed if conditions are met."""
        if self.state == 'Tentative' or self.time_since_update > self._max_age:
            self.state = 'Deleted'

    def is_tentative(self):
        """Check if track is in tentative state."""
        return self.state == 'Tentative'

    def is_confirmed(self):
        """Check if track is confirmed."""
        return self.state == 'Confirmed'

    def is_deleted(self):
        """Check if track is deleted."""
        return self.state == 'Deleted'

    def to_tlwh(self):
        """Convert track's state to bounding box format (top left x, top left y, width, height)."""
        ret = self.mean[:4].copy()
        ret[2] *= ret[3]  # Convert aspect ratio to width
        ret[:2] -= ret[2:] / 2  # Convert center to top-left coordinates
        return ret

    def to_tlbr(self):
        """Convert track's state to bounding box format (min x, min y, max x, max y)."""
        ret = self.to_tlwh()
        ret[2:] += ret[:2]  # Convert width and height to bottom-right coordinates
        return ret

    def get_class(self):
        """Return the class name of the tracked object."""
        return self.class_name

# Tracker Class for Multi-Object Tracking

The `Tracker` class is designed for **multi-object tracking** using a combination of **Kalman filtering** for state prediction and a **distance metric** (such as Euclidean or cosine) to match tracks to new detections.

## Key Features and Functionality:

### 1. Track Initialization:
- The tracker uses a **Kalman filter** (`self.kf`) for predicting and updating the state of each track.
- Tracks are initiated using new detections and assigned a unique `track_id` from `self._next_id`.

### 2. State Prediction:
- The `predict` method propagates the state of all active tracks forward by one time step using the Kalman filter. This is essential for predicting where each object might be in the next frame.

### 3. Track Update:
- The `update` method takes new detections as input and:
  - **Matches tracks to detections** based on a distance metric.
  - **Updates matched tracks** using the Kalman filter.
  - **Marks unmatched tracks as missed** (i.e., they haven't been updated in the current frame).
  - **Initializes new tracks** for unmatched detections.

### 4. Track Matching:
- The `_match` method performs the matching of tracks to detections using the following steps:
  1. **Matching confirmed tracks**: Uses a custom gating function based on the distance between feature vectors (using the provided metric, such as Euclidean or cosine) to form a cost matrix.
  2. **Matching unconfirmed tracks**: Uses **Intersection over Union (IoU)** as a cost metric to handle unconfirmed tracks. This ensures that objects which are still being detected are matched based on their bounding box overlap.

### 5. Cost Matrix:
- A **cost matrix** is calculated to evaluate how well tracks match detections:
  - **For confirmed tracks**, a gated cost matrix is created based on Kalman filter distances.
  - **For unconfirmed tracks**, the **IoU cost matrix** is computed.
  - **Matching** is done using the **Hungarian algorithm** (`linear_sum_assignment`).


In [10]:
class Tracker:
    """
    Multi-target tracker that uses Kalman Filter and a distance metric for matching targets.
    """
    def __init__(self, metric, max_iou_distance=0.7, max_age=30, n_init=3):
        self.metric = metric
        self.max_iou_distance = max_iou_distance
        self.max_age = max_age
        self.n_init = n_init

        self.kf = KalmanFilter()  # Kalman filter instance for state propagation
        self.tracks = []  # List to hold active tracks
        self._next_id = 1  # Unique ID for each track

    def predict(self):
        """Propagate the state of all tracks one time step forward."""
        for track in self.tracks:
            track.predict(self.kf)

    def update(self, detections):
        """Update track states with new detections."""
        matches, unmatched_tracks, unmatched_detections = self._match(detections)

        # Update matched tracks
        for track_idx, detection_idx in matches:
            self.tracks[track_idx].update(self.kf, detections[detection_idx])

        # Mark unmatched tracks as missed
        for track_idx in unmatched_tracks:
            self.tracks[track_idx].mark_missed()

        # Initialize new tracks for unmatched detections
        for detection_idx in unmatched_detections:
            self._initiate_track(detections[detection_idx])

        # Clean up deleted tracks
        self.tracks = [t for t in self.tracks if not t.is_deleted()]

        # Update the metric with new features of confirmed tracks
        active_targets = [t.track_id for t in self.tracks if t.is_confirmed()]
        features, targets = [], []
        for track in self.tracks:
            if track.is_confirmed():
                features.extend(track.features)
                targets.extend([track.track_id] * len(track.features))
                track.features = []  # Clear features for the next cycle
        self.metric.partial_fit(np.asarray(features), np.asarray(targets), active_targets)

    def _match(self, detections):
        """Match tracks with detections using the cost matrix."""
        def gated_metric(tracks, dets, track_indices, detection_indices):
            features = np.array([dets[i].feature for i in detection_indices])
            targets = np.array([tracks[i].track_id for i in track_indices])
            cost_matrix = self.metric.distance(features, targets)
            return self._gate_cost_matrix(cost_matrix, tracks, dets, track_indices, detection_indices)

        # Separate confirmed and unconfirmed tracks
        confirmed_tracks = [i for i, t in enumerate(self.tracks) if t.is_confirmed()]
        unconfirmed_tracks = [i for i, t in enumerate(self.tracks) if not t.is_confirmed()]

        # Step 1: Match confirmed tracks
        matches_a, unmatched_tracks_a, unmatched_detections = self._min_cost_matching(
            gated_metric, self.tracks, detections, confirmed_tracks)

        # Step 2: Match unconfirmed tracks using IOU
        iou_track_candidates = unconfirmed_tracks + [k for k in unmatched_tracks_a if self.tracks[k].time_since_update == 1]
        unmatched_tracks_a = [k for k in unmatched_tracks_a if self.tracks[k].time_since_update != 1]
        matches_b, unmatched_tracks_b, unmatched_detections = self._min_cost_matching(
            self._iou_cost, self.tracks, detections, iou_track_candidates, unmatched_detections)

        # Combine matches and unmatched tracks
        matches = matches_a + matches_b
        unmatched_tracks = list(set(unmatched_tracks_a + unmatched_tracks_b))

        return matches, unmatched_tracks, unmatched_detections

    def _min_cost_matching(self, distance_metric, tracks, detections, track_indices, detection_indices=None):
        """Perform the Hungarian algorithm for matching tracks with detections."""
        if detection_indices is None:
            detection_indices = list(range(len(detections)))
        if len(detection_indices) == 0 or len(track_indices) == 0:
            return [], track_indices, detection_indices

        # Compute the cost matrix for the matching
        cost_matrix = distance_metric(tracks, detections, track_indices, detection_indices)
        cost_matrix[cost_matrix > self.metric.matching_threshold] = self.metric.matching_threshold + 1e-5

        # Apply the Hungarian algorithm for assignment
        matches, unmatched_tracks, unmatched_detections = linear_assignment_fn(cost_matrix)
        return matches, unmatched_tracks, unmatched_detections

    def _gate_cost_matrix(self, cost_matrix, tracks, detections, track_indices, detection_indices):
        """Apply gating to the cost matrix based on Kalman filter distance."""
        gating_threshold = 9.4877  # Chi-squared threshold for gating
        measurements = np.asarray([detections[i].to_xyah() for i in detection_indices])

        for row, track_idx in enumerate(track_indices):
            track = tracks[track_idx]
            gating_distance = self.kf.gating_distance(track.mean, track.covariance, measurements, only_position=False)
            cost_matrix[row, gating_distance > gating_threshold] = np.inf  # Invalid matches

        return cost_matrix

    def _iou_cost(self, tracks, detections, track_indices, detection_indices):
        """Calculate IOU-based cost for matching."""
        cost_matrix = np.zeros((len(track_indices), len(detection_indices)), dtype=np.float32)
        for row, track_idx in enumerate(track_indices):
            track = tracks[track_idx]
            bbox = track.to_tlwh()  # Track's bounding box
            candidates = np.array([detections[i].tlwh for i in detection_indices])  # Detections' bounding boxes
            cost_matrix[row, :] = 1. - iou(bbox, candidates)  # IOU cost

        return cost_matrix

    def _initiate_track(self, detection):
        """Initiate a new track with the given detection."""
        mean, covariance = self.kf.initiate(detection.to_xyah())
        class_name = detection.get_class()
        self.tracks.append(Track(mean, covariance, self._next_id, self.n_init, self.max_age, detection.feature, class_name))
        self._next_id += 1  # Increment track ID for the next track

# Overview of Box Encoder Creation for Feature Extraction

The code provides a system for extracting features from images using a pre-trained TensorFlow model. This system includes functions for creating a box encoder, processing images, and feeding them into the model to extract meaningful features. The key components of the system are:

## 1. **`create_box_encoder` Function**
This function is responsible for creating the box encoder. It takes a path to the pre-trained model (`model_filename`) and an optional batch size parameter. The function calls the `gdet_create_box_encoder` function to load the model and return an encoder that can be used to extract features from images.

### Key Features:
- Takes the `model_filename` (path to the pre-trained model) and `batch_size` (default value 32) as parameters.
- Returns an encoder function that can process batches of images and return their features.

## 2. **`gdet_create_box_encoder` Function**
This function loads the TensorFlow model from the specified file and returns an encoder function that can be used to encode images into feature vectors. The steps include:
- **Loading the Model**: It reads the model file and loads it into a TensorFlow graph.
- **Creating a Session**: A TensorFlow session is created to run the model.
- **Retrieving Tensors**: The function retrieves the input and output tensors of the model.
- **Preprocessing**: It prepares the images by resizing and normalizing them before feeding them into the model.
- **Encoding**: The function processes the images in batches and runs the model to get the feature vectors for each image.

### Key Features:
- Loads the pre-trained model and retrieves the required tensors.
- Preprocesses the images and runs them through the model.
- Returns a set of extracted feature vectors for the input images.

## 3. **`preprocess_image` Function**
This helper function is used to resize and normalize images before they are fed into the model. It ensures that all images are resized to the target shape expected by the model and that pixel values are normalized to the range `[0, 1]`.

### Key Features:
- **Resizing**: The image is resized to match the shape expected by the model.
- **Normalization**: The pixel values are normalized by dividing by `255.0`, converting them to the range `[0, 1]`.

## Summary:
The overall system allows for efficient feature extraction from images using a pre-trained TensorFlow model. The process includes:
1. Creating an encoder with `create_box_encoder`.
2. Loading the model and processing images in batches to extract features using `gdet_create_box_encoder`.
3. Preprocessing images to ensure they are in the correct format for the model using `preprocess_image`.

This setup is useful for applications such as object detection and tracking, where features from images can be used for further analysis or machine learning tasks.


In [15]:
def create_box_encoder(model_filename, batch_size=32):
    """
    Create a box encoder for feature extraction.
    """
    encoder = gdet_create_box_encoder(model_filename, batch_size=batch_size)
    return encoder


def gdet_create_box_encoder(model_filename, input_name="images", output_name="features", batch_size=32):
    """
    Load model and return an encoder function.
    """
    # Load the TensorFlow model
    model = tf.Graph()
    with model.as_default():
        graph_def = tf.GraphDef()
        with tf.gfile.GFile(model_filename, 'rb') as fid:
            serialized_graph = fid.read()
            graph_def.ParseFromString(serialized_graph)
            tf.import_graph_def(graph_def, name='')

    # Start a session to run the graph
    session = tf.Session(graph=model)

    # Get the input and output tensors from the model
    input_tensor = model.get_tensor_by_name(f"{input_name}:0")
    output_tensor = model.get_tensor_by_name(f"{output_name}:0")

    # Extract the shape of the input tensor
    image_shape = input_tensor.get_shape().as_list()[1:]

    def encoder(images, batch_size=batch_size):
        """
        Encode images into features.
        """
        features = []
        n_batches = int(np.ceil(len(images) / batch_size))  # Number of batches
        for batch_idx in range(n_batches):
            start = batch_idx * batch_size
            end = min((batch_idx + 1) * batch_size, len(images))
            batch_images = images[start:end]

            # Preprocess the images before feeding them into the model
            batch_images = np.array([preprocess_image(im, image_shape) for im in batch_images])

            # Feed the batch into the model
            feed_dict = {input_tensor: batch_images}
            batch_features = session.run(output_tensor, feed_dict=feed_dict)

            # Collect the features
            features.extend(batch_features)

        return np.array(features)

    return encoder

def preprocess_image(image, image_shape):
    """
    Resize and normalize the image.
    """
    image = cv2.resize(image, (image_shape[1], image_shape[0]))  # Resize to target shape
    image = image.astype(np.float32) / 255.0  # Normalize to [0, 1]
    return image

# SportsTrackerWithDeepSort Class Overview

The `SportsTrackerWithDeepSort` class is designed to track objects (specifically people and sports balls) in a video frame using the YOLOv5 object detection model and the DeepSORT tracking algorithm. This class combines feature extraction, object detection, and object tracking in a multi-target tracking scenario. The components include YOLO for detection, a custom feature encoder for DeepSORT, and the tracking algorithm itself.

## Class Components

### 1. **`__init__` Method**
- **Purpose**: Initializes the model, metric, encoder, and tracking components.
- **Key Features**:
  - Uses a CUDA-compatible device if available for faster processing.
  - Loads a pre-trained YOLOv5 model (`yolov5m.pt`) for object detection.
  - Defines the classes of interest (`person` and `sports ball`).
  - Sets the confidence threshold and image size for YOLO processing.
  - Initializes a `NearestNeighborDistanceMetric` for DeepSORT and a `Tracker` for multi-object tracking.
  - Loads a pre-trained feature extraction model (`mars-small128.pb`) to generate features for object tracking.

### 2. **`load_yolo_model` Method**
- **Purpose**: Loads the YOLOv5 model using the `YOLO` class.
- **Key Features**:
  - Returns the YOLOv5 model to be used for detecting objects in frames.

### 3. **`process_frame` Method**
- **Purpose**: Processes a single video frame, detects objects, extracts features, and updates the tracker.
- **Key Features**:
  - Runs YOLOv5 detection on the input frame, focusing on detecting "person" and "sports ball".
  - Filters detections based on size (non-zero width/height) and confidence score.
  - Converts bounding box coordinates into the format expected by the tracker.
  - For valid detections, extracts image patches from the frame, runs feature extraction on them, and uses these features in the tracking process.
  - Calls `predict` and `update` methods of the `Tracker` to update the state of the tracks.
  - Returns a frame with overlaid bounding boxes and track information.

### 4. **`visualize_detections` Method**
- **Purpose**: Draws bounding boxes and labels on the frame to visualize the tracked objects.
- **Key Features**:
  - Loops through the confirmed tracks and draws bounding boxes on the objects.
  - Labels each tracked object with its ID and class (either "Person" or "Ball").
  - Returns the frame with visual annotations.

## How It Works

1. **Detection**:
   - The YOLOv5 model is used to detect objects of interest (person and sports ball) in a frame.
   
2. **Feature Extraction**:
   - For each detected object, image patches are extracted, and features are generated using the pre-trained feature encoder (`mars-small128.pb`).

3. **Tracking**:
   - The features are passed to the `Tracker` (DeepSORT) to associate detections with existing tracks and create new tracks for unmatched detections.

4. **Visualization**:
   - The updated tracking information is drawn on the frame, including bounding boxes and object labels (Person, Ball).

## Key Libraries and Models
- **YOLOv5**: Object detection model used for detecting people and sports balls.
- **DeepSORT**: Tracking algorithm for associating detections across frames using cosine distance and Kalman filter for state propagation.
- **TensorFlow (for encoder)**: Used for loading the pre-trained feature extraction model.

### Benefits:
- **Real-time Multi-object Tracking**: Tracks both people and sports balls in real time.
- **Feature-based Tracking**: Uses feature extraction to improve tracking accuracy, especially for objects that are difficult to distinguish based on bounding boxes alone.

### Use Cases:
- **Sports Analytics**: Can be used in sports videos to track players and balls.
- **Surveillance**: Applicable for tracking people and objects of interest in surveillance footage.


In [24]:
class SportsTrackerWithDeepSort:
    def __init__(self):
        """Initialize model, metric, and encoder."""
        self.device = 'cuda' if torch.cuda.is_available() else 'cpu'
        self.model = self.load_yolo_model()
        self.names = self.model.names
        self.desired_classes = {'person': 0, 'sports ball': 32}
        self.confidence_threshold = 0.5
        self.img_size = 640

        self.metric = NearestNeighborDistanceMetric("cosine", matching_threshold=0.4)
        self.tracker = Tracker(self.metric)

        model_filename = 'mars-small128.pb'
        self.encoder = create_box_encoder(model_filename, batch_size=1)

    def load_yolo_model(self):
        """Load YOLOv5 model."""
        return YOLO('yolov5m.pt')

    def process_frame(self, frame):
      """Process frame and track objects."""
      results = self.model.predict(frame, device=self.device, classes=[0, 32])
      result = results[0]

      detections = []
      boxes = result.boxes
      for box in boxes:
          x1y1x2y2 = box.xyxy[0].cpu().numpy()
          conf = box.conf[0].cpu().numpy()
          cls = int(box.cls[0].cpu().numpy())
          if cls in self.desired_classes.values():
              x1, y1, x2, y2 = map(int, x1y1x2y2)
              w, h = x2 - x1, y2 - y1
              # Check if width or height is zero and confidence threshold
              if w > 0 and h > 0 and conf > self.confidence_threshold:  # Ensure the patch is not empty and confident
                bbox = [x1, y1, w, h]
                detections.append({'box': bbox, 'score': float(conf), 'class': cls})

      bbox_xywh, confs, classes = [], [], []
      for det in detections:
          x, y, w, h = det['box']
          x_c, y_c = x + w / 2, y + h / 2
          bbox_xywh.append([x_c, y_c, w, h])
          confs.append(det['score'])
          classes.append(det['class'])

      if bbox_xywh:
          patches = []
          for x, y, w, h in bbox_xywh:
              # Calculate the patch coordinates
              x1, y1 = max(0, int(x - w / 2)), max(0, int(y - h / 2))
              x2, y2 = min(frame.shape[1], int(x + w / 2)), min(frame.shape[0], int(y + h / 2))

              # Check if the patch is too small
              if x2 - x1 < 1 or y2 - y1 < 1:
                  continue  # Skip this patch

              # Extract the patch
              patch = frame[y1:y2, x1:x2]
              patches.append(patch)

          # Continue only if there are valid patches
          if patches:
              features = self.encoder(patches)
              detections = [
                  Detection(bbox, conf, cls, feat)
                  for bbox, conf, cls, feat in zip(bbox_xywh, confs, classes, features)
              ]
          else:
              detections = []
      else:
          detections = []

      self.tracker.predict()
      self.tracker.update(detections)

      return self.visualize_detections(frame, self.tracker.tracks)

    def visualize_detections(self, frame, tracks):
        """Draw boxes and labels on the frame."""
        frame_copy = frame.copy()
        for track in tracks:
            if not track.is_confirmed() or track.time_since_update > 1:
                continue
            bbox = track.to_tlbr()
            x1, y1, x2, y2 = map(int, bbox)
            track_id = track.track_id
            class_id = track.get_class()
            color = (255, 0, 0) if class_id == 0 else (0, 255, 0)

            cv2.rectangle(frame_copy, (x1, y1), (x2, y2), color, 2)
            label = f"ID {track_id}: {'Person' if class_id == 0 else 'Ball'}"
            cv2.putText(frame_copy, label, (x1, y1 - 10), cv2.FONT_HERSHEY_SIMPLEX, 0.5, color, 2)
        return frame_copy


# `process_video_with_deep_sort` Function Overview

The `process_video_with_deep_sort` function is designed to process a video, apply object detection and tracking using the `SportsTrackerWithDeepSort` class, and save the processed video with the tracked objects to an output file. This function integrates the DeepSORT tracking system, YOLOv5 for detection, and visualizes tracking results.

## Function Components

### 1. **Video Input**
- **Purpose**: Opens and reads the input video.
- **Key Features**:
  - Uses OpenCV's `cv2.VideoCapture` to load the video from the specified `video_path`.
  - Checks if the video was successfully opened. If not, raises an error.

### 2. **Video Properties**
- **Purpose**: Retrieves key properties of the video (frame dimensions, FPS, and total frames).
- **Key Features**:
  - The width, height, frames per second (FPS), and total number of frames are extracted from the video using OpenCV's `cv2.CAP_PROP_*` constants.
  
### 3. **Video Writer Setup**
- **Purpose**: Prepares to save the processed video to the `output_path`.
- **Key Features**:
  - The `cv2.VideoWriter` is initialized with the appropriate codec (`mp4v`), frame rate, and frame size to write the processed frames into an output video file.

### 4. **Progress Bar**
- **Purpose**: Displays a progress bar during video processing.
- **Key Features**:
  - Uses `tqdm` to show the progress of video processing (frame-by-frame processing).

### 5. **Frame Processing Loop**
- **Purpose**: Reads frames from the input video and processes them using DeepSORT for object tracking.
- **Key Features**:
  - In a loop, the function reads each frame from the video.
  - For each frame, it calls the `tracker.process_frame(frame)` method from the `SportsTrackerWithDeepSort` class to perform detection and tracking.
  - After processing, the frame is written to the output video file.

### 6. **Completion**
- **Purpose**: Finalizes video processing.
- **Key Features**:
  - Closes the progress bar, video capture, and video writer objects.
  - Prints a completion message indicating the end of video processing.

### How It Works:
1. **Input Video**: The video is opened using OpenCV.
2. **Processing**: Each frame is processed by the `SportsTrackerWithDeepSort` class, which performs object detection, feature extraction, and tracking using YOLOv5 and DeepSORT.
3. **Output Video**: The processed frames, with bounding boxes and labels overlaid, are saved to the output video file.
4. **Progress Reporting**: A progress bar is shown to track the processing status.

### Requirements:
- **OpenCV**: For video reading, writing, and frame manipulation.
- **tqdm**: For showing progress during the video processing.
- **SportsTrackerWithDeepSort**: For performing object detection and tracking using YOLO and DeepSORT.



In [13]:
def process_video_with_deep_sort(video_path, output_path):
    """Process video using Deep SORT and save the output."""
    tracker = SportsTrackerWithDeepSort()

    cap = cv2.VideoCapture(video_path)
    if not cap.isOpened():
        raise ValueError("Error: Could not open video.")

    # Video properties
    frame_width = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH))
    frame_height = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
    fps = cap.get(cv2.CAP_PROP_FPS)
    total_frames = int(cap.get(cv2.CAP_PROP_FRAME_COUNT))

    fourcc = cv2.VideoWriter_fourcc(*'mp4v')
    out = cv2.VideoWriter(output_path, fourcc, fps, (frame_width, frame_height))

    pbar = tqdm(total=total_frames, desc="Processing Frames")

    while True:
        success, frame = cap.read()
        if not success:
            break
        # Process frame
        processed_frame = tracker.process_frame(frame)
        # Save processed frame
        out.write(processed_frame)
        pbar.update(1)

    # Cleanup
    pbar.close()
    cap.release()
    out.release()
    print("Video processing completed.")


## Video Processing with DeepSORT

This script processes a video using DeepSORT for object tracking, ensuring that the input video exists and the output directory is set up properly.

### Script Overview

1. **Define Video Paths**:
    - **`video_path`**: Specifies the path to the input video file (`Football match.f136.mp4`).
    - **`output_path`**: Specifies the path to save the processed output video (`output task2.mp4`).

2. **Check if Input Video Exists**:
    - Uses `os.path.exists()` to check if the input video file exists at the given path.
    - If the file does not exist, a `FileNotFoundError` is raised with an appropriate error message.

3. **Check if Output Directory Exists**:
    - **`output_dir`**: Extracts the directory part of the output path using `os.path.dirname()`.
    - If the directory does not exist, it creates the directory using `os.makedirs()` to ensure that the processed video can be saved.

4. **Process the Video**:
    - The `process_video_with_deep_sort(video_path, output_path)` function is called to process the video with DeepSORT tracking.
    - If the processing is successful, a success message is printed with the path to the output video.
    - If an exception occurs during video processing (e.g., an error with reading the video, tracking, or writing the output), it will be caught and an error message will be printed.


In [25]:
import os

if __name__ == "__main__":
    # Define video paths
    video_path = 'Football match.f136.mp4'
    output_path = 'output task2.mp4'

    # Check if the input video exists
    if not os.path.exists(video_path):
        raise FileNotFoundError(f"Error: The input video '{video_path}' does not exist.")

    # Check if the output directory exists; if not, create it
    output_dir = os.path.dirname(output_path)
    if output_dir and not os.path.exists(output_dir):
        os.makedirs(output_dir)

    # Process the video
    try:
        process_video_with_deep_sort(video_path, output_path)
        print(f"Video processing completed. Output saved to {output_path}")
    except Exception as e:
        print(f"Error occurred during video processing: {e}")

PRO TIP 💡 Replace 'model=yolov5m.pt' with new 'model=yolov5mu.pt'.
YOLOv5 'u' models are trained with https://github.com/ultralytics/ultralytics and feature improved performance vs standard YOLOv5 models trained with https://github.com/ultralytics/yolov5.



Processing Frames:   0%|          | 0/1074 [00:00<?, ?it/s]


0: 384x640 6 persons, 24.8ms
Speed: 2.0ms preprocess, 24.8ms inference, 1.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   0%|          | 1/1074 [00:00<16:56,  1.06it/s]


0: 384x640 6 persons, 1 sports ball, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 1.5ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 6 persons, 1 sports ball, 24.1ms
Speed: 2.1ms preprocess, 24.1ms inference, 1.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   0%|          | 3/1074 [00:01<05:22,  3.32it/s]


0: 384x640 7 persons, 1 sports ball, 24.1ms
Speed: 1.8ms preprocess, 24.1ms inference, 1.4ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 6 persons, 1 sports ball, 24.1ms
Speed: 1.9ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   0%|          | 5/1074 [00:01<03:17,  5.42it/s]


0: 384x640 7 persons, 20.3ms
Speed: 2.1ms preprocess, 20.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 20.3ms
Speed: 2.8ms preprocess, 20.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   1%|          | 7/1074 [00:01<02:25,  7.34it/s]


0: 384x640 5 persons, 1 sports ball, 20.2ms
Speed: 1.8ms preprocess, 20.2ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 1 sports ball, 20.2ms
Speed: 1.8ms preprocess, 20.2ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   1%|          | 9/1074 [00:01<01:57,  9.08it/s]


0: 384x640 5 persons, 1 sports ball, 20.2ms
Speed: 1.9ms preprocess, 20.2ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 1 sports ball, 20.8ms
Speed: 2.0ms preprocess, 20.8ms inference, 4.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   1%|          | 11/1074 [00:01<01:45, 10.07it/s]


0: 384x640 4 persons, 1 sports ball, 22.1ms
Speed: 2.2ms preprocess, 22.1ms inference, 3.0ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 2 sports balls, 20.6ms
Speed: 2.1ms preprocess, 20.6ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   1%|          | 13/1074 [00:01<01:37, 10.89it/s]


0: 384x640 3 persons, 19.5ms
Speed: 1.9ms preprocess, 19.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 19.6ms
Speed: 1.9ms preprocess, 19.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   1%|▏         | 15/1074 [00:01<01:27, 12.11it/s]


0: 384x640 3 persons, 19.6ms
Speed: 1.9ms preprocess, 19.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 19.6ms
Speed: 2.1ms preprocess, 19.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   2%|▏         | 17/1074 [00:02<01:24, 12.58it/s]


0: 384x640 3 persons, 19.6ms
Speed: 1.8ms preprocess, 19.6ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 19.6ms
Speed: 2.1ms preprocess, 19.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   2%|▏         | 19/1074 [00:02<01:18, 13.52it/s]


0: 384x640 3 persons, 19.6ms
Speed: 2.1ms preprocess, 19.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 19.9ms
Speed: 2.0ms preprocess, 19.9ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   2%|▏         | 21/1074 [00:02<01:15, 13.89it/s]


0: 384x640 4 persons, 19.5ms
Speed: 2.0ms preprocess, 19.5ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 6 persons, 19.5ms
Speed: 1.8ms preprocess, 19.5ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   2%|▏         | 23/1074 [00:02<01:13, 14.30it/s]


0: 384x640 6 persons, 19.6ms
Speed: 1.9ms preprocess, 19.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 19.5ms
Speed: 2.0ms preprocess, 19.5ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   2%|▏         | 25/1074 [00:02<01:13, 14.32it/s]


0: 384x640 6 persons, 1 sports ball, 19.6ms
Speed: 2.0ms preprocess, 19.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 19.7ms
Speed: 6.5ms preprocess, 19.7ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   3%|▎         | 27/1074 [00:02<01:14, 14.01it/s]


0: 384x640 4 persons, 19.8ms
Speed: 2.1ms preprocess, 19.8ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 19.5ms
Speed: 2.0ms preprocess, 19.5ms inference, 4.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   3%|▎         | 29/1074 [00:02<01:14, 13.98it/s]


0: 384x640 5 persons, 19.5ms
Speed: 1.9ms preprocess, 19.5ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 19.5ms
Speed: 1.9ms preprocess, 19.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   3%|▎         | 31/1074 [00:03<01:13, 14.13it/s]


0: 384x640 6 persons, 19.5ms
Speed: 1.8ms preprocess, 19.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 19.6ms
Speed: 2.0ms preprocess, 19.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   3%|▎         | 33/1074 [00:03<01:12, 14.28it/s]


0: 384x640 4 persons, 19.6ms
Speed: 2.1ms preprocess, 19.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 19.5ms
Speed: 3.1ms preprocess, 19.5ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   3%|▎         | 35/1074 [00:03<01:12, 14.33it/s]


0: 384x640 5 persons, 19.5ms
Speed: 1.9ms preprocess, 19.5ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 19.5ms
Speed: 1.9ms preprocess, 19.5ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   3%|▎         | 37/1074 [00:03<01:09, 14.91it/s]


0: 384x640 4 persons, 20.6ms
Speed: 1.9ms preprocess, 20.6ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 20.6ms
Speed: 1.9ms preprocess, 20.6ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   4%|▎         | 39/1074 [00:03<01:09, 14.91it/s]


0: 384x640 4 persons, 22.2ms
Speed: 2.0ms preprocess, 22.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 29.4ms
Speed: 2.4ms preprocess, 29.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   4%|▍         | 41/1074 [00:03<01:14, 13.79it/s]


0: 384x640 5 persons, 21.0ms
Speed: 2.6ms preprocess, 21.0ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 21.6ms
Speed: 2.8ms preprocess, 21.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   4%|▍         | 43/1074 [00:03<01:15, 13.74it/s]


0: 384x640 4 persons, 31.1ms
Speed: 5.7ms preprocess, 31.1ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 21.4ms
Speed: 2.4ms preprocess, 21.4ms inference, 5.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   4%|▍         | 45/1074 [00:04<01:18, 13.07it/s]


0: 384x640 3 persons, 28.4ms
Speed: 2.2ms preprocess, 28.4ms inference, 5.5ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 22.8ms
Speed: 2.1ms preprocess, 22.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   4%|▍         | 47/1074 [00:04<01:17, 13.27it/s]


0: 384x640 3 persons, 21.7ms
Speed: 2.1ms preprocess, 21.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 24.8ms
Speed: 2.0ms preprocess, 24.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   5%|▍         | 49/1074 [00:04<01:17, 13.30it/s]


0: 384x640 3 persons, 22.3ms
Speed: 2.0ms preprocess, 22.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 26.1ms
Speed: 1.8ms preprocess, 26.1ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   5%|▍         | 51/1074 [00:04<01:20, 12.77it/s]


0: 384x640 3 persons, 28.8ms
Speed: 2.0ms preprocess, 28.8ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 39.1ms
Speed: 2.3ms preprocess, 39.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   5%|▍         | 53/1074 [00:04<01:32, 11.03it/s]


0: 384x640 3 persons, 35.8ms
Speed: 2.0ms preprocess, 35.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 29.5ms
Speed: 2.7ms preprocess, 29.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   5%|▌         | 55/1074 [00:05<01:50,  9.26it/s]


0: 384x640 3 persons, 30.8ms
Speed: 2.2ms preprocess, 30.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 31.9ms
Speed: 2.2ms preprocess, 31.9ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   5%|▌         | 57/1074 [00:05<01:53,  8.99it/s]


0: 384x640 1 person, 24.2ms
Speed: 2.0ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 24.2ms
Speed: 2.0ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   5%|▌         | 59/1074 [00:05<01:39, 10.22it/s]


0: 384x640 1 person, 24.1ms
Speed: 1.9ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 24.1ms
Speed: 2.4ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   6%|▌         | 61/1074 [00:05<01:29, 11.37it/s]


0: 384x640 1 person, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 25.0ms
Speed: 2.2ms preprocess, 25.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   6%|▌         | 63/1074 [00:05<01:25, 11.89it/s]


0: 384x640 1 person, 26.7ms
Speed: 4.1ms preprocess, 26.7ms inference, 2.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 24.1ms
Speed: 2.0ms preprocess, 24.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   6%|▌         | 65/1074 [00:05<01:20, 12.46it/s]


0: 384x640 1 person, 24.5ms
Speed: 2.0ms preprocess, 24.5ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 24.1ms
Speed: 2.0ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   6%|▌         | 67/1074 [00:06<01:17, 12.91it/s]


0: 384x640 1 person, 24.1ms
Speed: 2.2ms preprocess, 24.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 24.5ms
Speed: 2.3ms preprocess, 24.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   6%|▋         | 69/1074 [00:06<01:16, 13.17it/s]


0: 384x640 1 person, 24.1ms
Speed: 2.8ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 23.9ms
Speed: 1.9ms preprocess, 23.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   7%|▋         | 71/1074 [00:06<01:11, 13.98it/s]


0: 384x640 1 person, 23.6ms
Speed: 2.0ms preprocess, 23.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 42.2ms
Speed: 2.2ms preprocess, 42.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   7%|▋         | 73/1074 [00:06<01:14, 13.36it/s]


0: 384x640 1 person, 33.4ms
Speed: 2.2ms preprocess, 33.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 25.1ms
Speed: 3.7ms preprocess, 25.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   7%|▋         | 75/1074 [00:06<01:18, 12.70it/s]


0: 384x640 1 person, 39.7ms
Speed: 8.8ms preprocess, 39.7ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 36.1ms
Speed: 4.2ms preprocess, 36.1ms inference, 5.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   7%|▋         | 77/1074 [00:06<01:27, 11.35it/s]


0: 384x640 1 person, 27.9ms
Speed: 2.0ms preprocess, 27.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 30.3ms
Speed: 3.4ms preprocess, 30.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   7%|▋         | 79/1074 [00:07<01:28, 11.28it/s]


0: 384x640 1 person, 25.3ms
Speed: 2.2ms preprocess, 25.3ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 23.7ms
Speed: 2.9ms preprocess, 23.7ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   8%|▊         | 81/1074 [00:07<01:25, 11.58it/s]


0: 384x640 1 person, 28.7ms
Speed: 2.3ms preprocess, 28.7ms inference, 2.6ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 23.6ms
Speed: 1.9ms preprocess, 23.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   8%|▊         | 83/1074 [00:07<01:23, 11.82it/s]


0: 384x640 1 person, 23.6ms
Speed: 2.9ms preprocess, 23.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 23.6ms
Speed: 2.5ms preprocess, 23.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   8%|▊         | 85/1074 [00:07<01:20, 12.35it/s]


0: 384x640 1 person, 25.6ms
Speed: 5.9ms preprocess, 25.6ms inference, 3.2ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 25.9ms
Speed: 2.2ms preprocess, 25.9ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   8%|▊         | 87/1074 [00:07<01:22, 11.94it/s]


0: 384x640 1 person, 36.6ms
Speed: 4.1ms preprocess, 36.6ms inference, 4.2ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 31.8ms
Speed: 5.3ms preprocess, 31.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   8%|▊         | 89/1074 [00:07<01:29, 11.03it/s]


0: 384x640 1 person, 25.9ms
Speed: 5.5ms preprocess, 25.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 32.8ms
Speed: 2.3ms preprocess, 32.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   8%|▊         | 91/1074 [00:08<01:32, 10.68it/s]


0: 384x640 1 person, 28.5ms
Speed: 2.2ms preprocess, 28.5ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 31.6ms
Speed: 5.2ms preprocess, 31.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   9%|▊         | 93/1074 [00:08<01:30, 10.84it/s]


0: 384x640 1 person, 33.9ms
Speed: 2.0ms preprocess, 33.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 29.7ms
Speed: 4.1ms preprocess, 29.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   9%|▉         | 95/1074 [00:08<01:30, 10.79it/s]


0: 384x640 1 person, 32.3ms
Speed: 3.9ms preprocess, 32.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 34.5ms
Speed: 5.1ms preprocess, 34.5ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   9%|▉         | 97/1074 [00:08<01:33, 10.44it/s]


0: 384x640 1 person, 24.2ms
Speed: 2.5ms preprocess, 24.2ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 25.7ms
Speed: 2.1ms preprocess, 25.7ms inference, 7.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   9%|▉         | 99/1074 [00:08<01:30, 10.83it/s]


0: 384x640 1 person, 32.2ms
Speed: 4.8ms preprocess, 32.2ms inference, 4.4ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 31.1ms
Speed: 2.9ms preprocess, 31.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:   9%|▉         | 101/1074 [00:09<01:30, 10.76it/s]


0: 384x640 1 person, 29.9ms
Speed: 3.1ms preprocess, 29.9ms inference, 4.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 25.8ms
Speed: 3.2ms preprocess, 25.8ms inference, 6.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  10%|▉         | 103/1074 [00:09<01:28, 10.99it/s]


0: 384x640 1 person, 37.2ms
Speed: 3.0ms preprocess, 37.2ms inference, 8.4ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 26.2ms
Speed: 2.1ms preprocess, 26.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  10%|▉         | 105/1074 [00:09<01:28, 10.93it/s]


0: 384x640 1 person, 24.2ms
Speed: 2.0ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 24.2ms
Speed: 4.4ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  10%|▉         | 107/1074 [00:09<01:22, 11.77it/s]


0: 384x640 1 person, 24.2ms
Speed: 3.5ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 24.1ms
Speed: 2.1ms preprocess, 24.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  10%|█         | 109/1074 [00:09<01:16, 12.63it/s]


0: 384x640 1 person, 26.0ms
Speed: 2.8ms preprocess, 26.0ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  10%|█         | 111/1074 [00:09<01:12, 13.25it/s]


0: 384x640 1 person, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 24.7ms
Speed: 2.3ms preprocess, 24.7ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  11%|█         | 113/1074 [00:09<01:09, 13.92it/s]


0: 384x640 1 person, 23.2ms
Speed: 2.6ms preprocess, 23.2ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 25.5ms
Speed: 2.3ms preprocess, 25.5ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  11%|█         | 115/1074 [00:10<01:08, 14.10it/s]


0: 384x640 1 person, 23.2ms
Speed: 2.3ms preprocess, 23.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 23.2ms
Speed: 2.6ms preprocess, 23.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  11%|█         | 117/1074 [00:10<01:04, 14.83it/s]


0: 384x640 1 person, 23.1ms
Speed: 2.2ms preprocess, 23.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 23.2ms
Speed: 2.7ms preprocess, 23.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  11%|█         | 119/1074 [00:10<01:02, 15.40it/s]


0: 384x640 1 person, 20.6ms
Speed: 2.5ms preprocess, 20.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 1 person, 20.6ms
Speed: 2.6ms preprocess, 20.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  11%|█▏        | 121/1074 [00:10<00:59, 15.89it/s]


0: 384x640 5 persons, 1 sports ball, 20.6ms
Speed: 2.9ms preprocess, 20.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 6 persons, 1 sports ball, 20.6ms
Speed: 2.2ms preprocess, 20.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  11%|█▏        | 123/1074 [00:10<01:07, 14.00it/s]


0: 384x640 5 persons, 1 sports ball, 20.6ms
Speed: 2.3ms preprocess, 20.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 1 sports ball, 20.6ms
Speed: 3.0ms preprocess, 20.6ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  12%|█▏        | 125/1074 [00:10<01:12, 13.08it/s]


0: 384x640 5 persons, 1 sports ball, 24.8ms
Speed: 5.4ms preprocess, 24.8ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 1 sports ball, 20.6ms
Speed: 2.1ms preprocess, 20.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  12%|█▏        | 127/1074 [00:10<01:13, 12.90it/s]


0: 384x640 5 persons, 1 sports ball, 20.6ms
Speed: 2.2ms preprocess, 20.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 1 sports ball, 29.4ms
Speed: 2.0ms preprocess, 29.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  12%|█▏        | 129/1074 [00:11<01:15, 12.51it/s]


0: 384x640 5 persons, 2 sports balls, 20.6ms
Speed: 1.9ms preprocess, 20.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 1 sports ball, 20.6ms
Speed: 4.8ms preprocess, 20.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  12%|█▏        | 131/1074 [00:11<01:14, 12.67it/s]


0: 384x640 4 persons, 1 sports ball, 20.6ms
Speed: 1.9ms preprocess, 20.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 20.6ms
Speed: 5.4ms preprocess, 20.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  12%|█▏        | 133/1074 [00:11<01:12, 13.07it/s]


0: 384x640 4 persons, 1 sports ball, 20.6ms
Speed: 2.3ms preprocess, 20.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 1 sports ball, 20.6ms
Speed: 2.0ms preprocess, 20.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  13%|█▎        | 135/1074 [00:11<01:11, 13.09it/s]


0: 384x640 4 persons, 2 sports balls, 20.6ms
Speed: 1.8ms preprocess, 20.6ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 20.7ms
Speed: 1.8ms preprocess, 20.7ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  13%|█▎        | 137/1074 [00:11<01:11, 13.18it/s]


0: 384x640 3 persons, 1 sports ball, 23.7ms
Speed: 2.4ms preprocess, 23.7ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 20.6ms
Speed: 2.7ms preprocess, 20.6ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  13%|█▎        | 139/1074 [00:11<01:09, 13.39it/s]


0: 384x640 3 persons, 1 sports ball, 20.7ms
Speed: 2.4ms preprocess, 20.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 20.9ms
Speed: 2.5ms preprocess, 20.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  13%|█▎        | 141/1074 [00:11<01:07, 13.91it/s]


0: 384x640 3 persons, 1 sports ball, 20.6ms
Speed: 2.1ms preprocess, 20.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 20.6ms
Speed: 2.6ms preprocess, 20.6ms inference, 7.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  13%|█▎        | 143/1074 [00:12<01:06, 14.02it/s]


0: 384x640 3 persons, 1 sports ball, 21.7ms
Speed: 1.9ms preprocess, 21.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 20.9ms
Speed: 2.3ms preprocess, 20.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  14%|█▎        | 145/1074 [00:12<01:05, 14.21it/s]


0: 384x640 3 persons, 1 sports ball, 22.5ms
Speed: 2.0ms preprocess, 22.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 1 sports ball, 31.9ms
Speed: 2.2ms preprocess, 31.9ms inference, 4.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  14%|█▎        | 147/1074 [00:12<01:06, 13.87it/s]


0: 384x640 4 persons, 1 sports ball, 21.0ms
Speed: 6.3ms preprocess, 21.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 21.4ms
Speed: 1.9ms preprocess, 21.4ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  14%|█▍        | 149/1074 [00:12<01:09, 13.30it/s]


0: 384x640 3 persons, 1 sports ball, 21.4ms
Speed: 2.0ms preprocess, 21.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 21.3ms
Speed: 2.7ms preprocess, 21.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  14%|█▍        | 151/1074 [00:12<01:08, 13.45it/s]


0: 384x640 3 persons, 1 sports ball, 22.2ms
Speed: 3.3ms preprocess, 22.2ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 3 persons, 1 sports ball, 21.8ms
Speed: 4.4ms preprocess, 21.8ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  14%|█▍        | 153/1074 [00:12<01:13, 12.50it/s]


0: 384x640 3 persons, 1 sports ball, 22.2ms
Speed: 5.2ms preprocess, 22.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 2 persons, 1 sports ball, 22.2ms
Speed: 3.4ms preprocess, 22.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  14%|█▍        | 155/1074 [00:13<01:10, 12.95it/s]


0: 384x640 13 persons, 1 sports ball, 22.2ms
Speed: 3.3ms preprocess, 22.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 14 persons, 1 sports ball, 28.4ms
Speed: 3.3ms preprocess, 28.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  15%|█▍        | 157/1074 [00:13<01:25, 10.69it/s]


0: 384x640 14 persons, 26.0ms
Speed: 3.4ms preprocess, 26.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 14 persons, 1 sports ball, 23.2ms
Speed: 8.8ms preprocess, 23.2ms inference, 6.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  15%|█▍        | 159/1074 [00:13<01:35,  9.53it/s]


0: 384x640 14 persons, 1 sports ball, 35.7ms
Speed: 1.9ms preprocess, 35.7ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 14 persons, 34.9ms
Speed: 7.2ms preprocess, 34.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  15%|█▍        | 161/1074 [00:13<01:47,  8.46it/s]


0: 384x640 14 persons, 23.4ms
Speed: 2.5ms preprocess, 23.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  15%|█▌        | 162/1074 [00:14<02:00,  7.59it/s]


0: 384x640 14 persons, 23.7ms
Speed: 2.1ms preprocess, 23.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  15%|█▌        | 163/1074 [00:14<02:19,  6.51it/s]


0: 384x640 14 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  15%|█▌        | 164/1074 [00:14<02:10,  6.96it/s]


0: 384x640 14 persons, 24.1ms
Speed: 2.3ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  15%|█▌        | 165/1074 [00:14<02:02,  7.42it/s]


0: 384x640 14 persons, 23.2ms
Speed: 2.3ms preprocess, 23.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  15%|█▌        | 166/1074 [00:14<01:56,  7.80it/s]


0: 384x640 14 persons, 1 sports ball, 38.7ms
Speed: 4.1ms preprocess, 38.7ms inference, 5.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  16%|█▌        | 167/1074 [00:14<02:29,  6.09it/s]


0: 384x640 15 persons, 1 sports ball, 23.1ms
Speed: 2.6ms preprocess, 23.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  16%|█▌        | 168/1074 [00:14<02:14,  6.74it/s]


0: 384x640 14 persons, 1 sports ball, 23.2ms
Speed: 2.4ms preprocess, 23.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  16%|█▌        | 169/1074 [00:15<02:21,  6.41it/s]


0: 384x640 14 persons, 22.7ms
Speed: 2.3ms preprocess, 22.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  16%|█▌        | 170/1074 [00:15<02:10,  6.94it/s]


0: 384x640 16 persons, 28.6ms
Speed: 5.6ms preprocess, 28.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  16%|█▌        | 171/1074 [00:15<02:34,  5.84it/s]


0: 384x640 16 persons, 22.7ms
Speed: 6.2ms preprocess, 22.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  16%|█▌        | 172/1074 [00:15<02:19,  6.48it/s]


0: 384x640 17 persons, 30.8ms
Speed: 2.0ms preprocess, 30.8ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  16%|█▌        | 173/1074 [00:15<02:45,  5.44it/s]


0: 384x640 16 persons, 22.7ms
Speed: 2.6ms preprocess, 22.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 16 persons, 22.6ms
Speed: 5.2ms preprocess, 22.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  16%|█▋        | 175/1074 [00:16<02:11,  6.83it/s]


0: 384x640 16 persons, 24.6ms
Speed: 5.3ms preprocess, 24.6ms inference, 7.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  16%|█▋        | 176/1074 [00:16<02:03,  7.26it/s]


0: 384x640 15 persons, 22.7ms
Speed: 2.1ms preprocess, 22.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 16 persons, 22.6ms
Speed: 2.4ms preprocess, 22.6ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  17%|█▋        | 178/1074 [00:16<01:50,  8.13it/s]


0: 384x640 14 persons, 22.7ms
Speed: 3.9ms preprocess, 22.7ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  17%|█▋        | 179/1074 [00:16<01:47,  8.32it/s]


0: 384x640 15 persons, 22.7ms
Speed: 2.8ms preprocess, 22.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  17%|█▋        | 180/1074 [00:16<01:45,  8.47it/s]


0: 384x640 14 persons, 34.8ms
Speed: 3.8ms preprocess, 34.8ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  17%|█▋        | 181/1074 [00:16<01:47,  8.32it/s]


0: 384x640 16 persons, 27.9ms
Speed: 2.5ms preprocess, 27.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  17%|█▋        | 182/1074 [00:16<01:44,  8.53it/s]


0: 384x640 12 persons, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 13 persons, 24.2ms
Speed: 3.3ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  17%|█▋        | 184/1074 [00:17<01:33,  9.54it/s]


0: 384x640 12 persons, 24.1ms
Speed: 2.6ms preprocess, 24.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 12 persons, 23.6ms
Speed: 2.4ms preprocess, 23.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  17%|█▋        | 186/1074 [00:17<01:26, 10.22it/s]


0: 384x640 11 persons, 1 sports ball, 23.7ms
Speed: 2.0ms preprocess, 23.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 12 persons, 23.6ms
Speed: 1.9ms preprocess, 23.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  18%|█▊        | 188/1074 [00:17<01:24, 10.42it/s]


0: 384x640 11 persons, 23.6ms
Speed: 5.3ms preprocess, 23.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 12 persons, 23.4ms
Speed: 2.0ms preprocess, 23.4ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  18%|█▊        | 190/1074 [00:17<01:20, 11.01it/s]


0: 384x640 10 persons, 22.7ms
Speed: 2.4ms preprocess, 22.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 13 persons, 29.1ms
Speed: 2.8ms preprocess, 29.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  18%|█▊        | 192/1074 [00:17<01:18, 11.24it/s]


0: 384x640 13 persons, 26.8ms
Speed: 3.9ms preprocess, 26.8ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 11 persons, 22.2ms
Speed: 1.8ms preprocess, 22.2ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  18%|█▊        | 194/1074 [00:17<01:20, 10.89it/s]


0: 384x640 14 persons, 22.8ms
Speed: 2.0ms preprocess, 22.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 13 persons, 22.2ms
Speed: 3.8ms preprocess, 22.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  18%|█▊        | 196/1074 [00:18<01:19, 11.01it/s]


0: 384x640 14 persons, 22.3ms
Speed: 4.2ms preprocess, 22.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 14 persons, 22.2ms
Speed: 2.4ms preprocess, 22.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  18%|█▊        | 198/1074 [00:18<01:19, 10.98it/s]


0: 384x640 14 persons, 25.9ms
Speed: 4.7ms preprocess, 25.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 15 persons, 22.2ms
Speed: 1.9ms preprocess, 22.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  19%|█▊        | 200/1074 [00:18<01:20, 10.82it/s]


0: 384x640 14 persons, 22.2ms
Speed: 2.0ms preprocess, 22.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 13 persons, 22.3ms
Speed: 3.3ms preprocess, 22.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  19%|█▉        | 202/1074 [00:18<01:28,  9.87it/s]


0: 384x640 12 persons, 22.9ms
Speed: 3.5ms preprocess, 22.9ms inference, 3.2ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 8 persons, 1 sports ball, 22.2ms
Speed: 2.0ms preprocess, 22.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  19%|█▉        | 204/1074 [00:18<01:30,  9.61it/s]


0: 384x640 8 persons, 24.2ms
Speed: 4.0ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 9 persons, 1 sports ball, 23.4ms
Speed: 2.3ms preprocess, 23.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  19%|█▉        | 206/1074 [00:19<01:30,  9.61it/s]


0: 384x640 8 persons, 1 sports ball, 23.8ms
Speed: 5.3ms preprocess, 23.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  19%|█▉        | 207/1074 [00:19<01:31,  9.51it/s]


0: 384x640 8 persons, 1 sports ball, 22.2ms
Speed: 2.6ms preprocess, 22.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  19%|█▉        | 208/1074 [00:19<01:31,  9.48it/s]


0: 384x640 8 persons, 1 sports ball, 29.6ms
Speed: 2.1ms preprocess, 29.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  19%|█▉        | 209/1074 [00:19<01:30,  9.55it/s]


0: 384x640 8 persons, 1 sports ball, 29.2ms
Speed: 2.7ms preprocess, 29.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|█▉        | 210/1074 [00:19<01:43,  8.35it/s]


0: 384x640 7 persons, 1 sports ball, 38.3ms
Speed: 6.4ms preprocess, 38.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|█▉        | 211/1074 [00:19<01:54,  7.53it/s]


0: 384x640 7 persons, 42.7ms
Speed: 5.1ms preprocess, 42.7ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|█▉        | 212/1074 [00:19<01:56,  7.38it/s]


0: 384x640 8 persons, 1 sports ball, 29.6ms
Speed: 5.3ms preprocess, 29.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|█▉        | 213/1074 [00:20<01:53,  7.61it/s]


0: 384x640 9 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 5.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|█▉        | 214/1074 [00:20<02:14,  6.38it/s]


0: 384x640 8 persons, 27.3ms
Speed: 2.0ms preprocess, 27.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|██        | 215/1074 [00:20<02:28,  5.79it/s]


0: 384x640 8 persons, 29.1ms
Speed: 2.3ms preprocess, 29.1ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|██        | 216/1074 [00:20<02:26,  5.84it/s]


0: 384x640 8 persons, 1 sports ball, 35.1ms
Speed: 4.4ms preprocess, 35.1ms inference, 6.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|██        | 217/1074 [00:20<02:16,  6.28it/s]


0: 384x640 8 persons, 32.4ms
Speed: 4.1ms preprocess, 32.4ms inference, 6.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|██        | 218/1074 [00:21<02:39,  5.38it/s]


0: 384x640 7 persons, 32.6ms
Speed: 4.6ms preprocess, 32.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|██        | 219/1074 [00:21<02:39,  5.37it/s]


0: 384x640 8 persons, 1 sports ball, 30.8ms
Speed: 2.2ms preprocess, 30.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  20%|██        | 220/1074 [00:21<02:23,  5.94it/s]


0: 384x640 8 persons, 29.1ms
Speed: 2.4ms preprocess, 29.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██        | 221/1074 [00:21<02:14,  6.33it/s]


0: 384x640 9 persons, 33.5ms
Speed: 2.4ms preprocess, 33.5ms inference, 5.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██        | 222/1074 [00:21<02:21,  6.03it/s]


0: 384x640 8 persons, 35.3ms
Speed: 5.6ms preprocess, 35.3ms inference, 4.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██        | 223/1074 [00:21<02:11,  6.45it/s]


0: 384x640 8 persons, 24.8ms
Speed: 2.1ms preprocess, 24.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██        | 224/1074 [00:21<02:04,  6.85it/s]


0: 384x640 8 persons, 40.8ms
Speed: 4.2ms preprocess, 40.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██        | 225/1074 [00:22<02:38,  5.37it/s]


0: 384x640 9 persons, 28.2ms
Speed: 2.8ms preprocess, 28.2ms inference, 7.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██        | 226/1074 [00:22<02:44,  5.16it/s]


0: 384x640 8 persons, 24.7ms
Speed: 2.1ms preprocess, 24.7ms inference, 4.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██        | 227/1074 [00:22<02:25,  5.80it/s]


0: 384x640 9 persons, 25.3ms
Speed: 5.4ms preprocess, 25.3ms inference, 7.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██        | 228/1074 [00:22<02:18,  6.12it/s]


0: 384x640 9 persons, 24.2ms
Speed: 2.9ms preprocess, 24.2ms inference, 6.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██▏       | 229/1074 [00:22<02:07,  6.64it/s]


0: 384x640 8 persons, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 4.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  21%|██▏       | 230/1074 [00:22<02:07,  6.64it/s]


0: 384x640 9 persons, 35.1ms
Speed: 4.4ms preprocess, 35.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 231/1074 [00:23<02:01,  6.93it/s]


0: 384x640 10 persons, 27.8ms
Speed: 2.5ms preprocess, 27.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 232/1074 [00:23<02:01,  6.91it/s]


0: 384x640 9 persons, 24.2ms
Speed: 6.2ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 233/1074 [00:23<01:57,  7.15it/s]


0: 384x640 7 persons, 27.7ms
Speed: 2.9ms preprocess, 27.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 234/1074 [00:23<01:57,  7.15it/s]


0: 384x640 7 persons, 28.0ms
Speed: 2.1ms preprocess, 28.0ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 235/1074 [00:23<01:55,  7.28it/s]


0: 384x640 8 persons, 31.7ms
Speed: 2.4ms preprocess, 31.7ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 236/1074 [00:23<01:57,  7.11it/s]


0: 384x640 8 persons, 24.2ms
Speed: 5.4ms preprocess, 24.2ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 237/1074 [00:24<02:20,  5.96it/s]


0: 384x640 8 persons, 30.1ms
Speed: 10.9ms preprocess, 30.1ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 238/1074 [00:24<02:13,  6.24it/s]


0: 384x640 8 persons, 29.8ms
Speed: 3.4ms preprocess, 29.8ms inference, 11.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 239/1074 [00:24<02:11,  6.33it/s]


0: 384x640 8 persons, 25.4ms
Speed: 9.6ms preprocess, 25.4ms inference, 6.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 240/1074 [00:24<02:08,  6.50it/s]


0: 384x640 8 persons, 40.0ms
Speed: 3.0ms preprocess, 40.0ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  22%|██▏       | 241/1074 [00:24<02:22,  5.84it/s]


0: 384x640 8 persons, 27.5ms
Speed: 2.9ms preprocess, 27.5ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 242/1074 [00:24<02:11,  6.34it/s]


0: 384x640 8 persons, 31.4ms
Speed: 2.9ms preprocess, 31.4ms inference, 3.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 243/1074 [00:24<02:04,  6.67it/s]


0: 384x640 8 persons, 33.9ms
Speed: 2.2ms preprocess, 33.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 244/1074 [00:25<01:56,  7.15it/s]


0: 384x640 8 persons, 28.1ms
Speed: 2.0ms preprocess, 28.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 245/1074 [00:25<01:49,  7.57it/s]


0: 384x640 8 persons, 29.2ms
Speed: 2.0ms preprocess, 29.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 246/1074 [00:25<01:44,  7.91it/s]


0: 384x640 8 persons, 29.0ms
Speed: 2.1ms preprocess, 29.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 247/1074 [00:25<01:42,  8.11it/s]


0: 384x640 8 persons, 1 sports ball, 28.5ms
Speed: 2.2ms preprocess, 28.5ms inference, 6.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 248/1074 [00:25<01:41,  8.12it/s]


0: 384x640 8 persons, 1 sports ball, 28.6ms
Speed: 3.2ms preprocess, 28.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 249/1074 [00:25<01:42,  8.06it/s]


0: 384x640 8 persons, 54.3ms
Speed: 7.1ms preprocess, 54.3ms inference, 9.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 250/1074 [00:25<02:04,  6.61it/s]


0: 384x640 8 persons, 1 sports ball, 25.3ms
Speed: 4.3ms preprocess, 25.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 251/1074 [00:25<01:56,  7.09it/s]


0: 384x640 8 persons, 24.1ms
Speed: 2.0ms preprocess, 24.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  23%|██▎       | 252/1074 [00:26<01:49,  7.50it/s]


0: 384x640 8 persons, 1 sports ball, 28.5ms
Speed: 4.4ms preprocess, 28.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▎       | 253/1074 [00:26<01:48,  7.55it/s]


0: 384x640 8 persons, 1 sports ball, 25.7ms
Speed: 1.9ms preprocess, 25.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▎       | 254/1074 [00:26<01:57,  6.95it/s]


0: 384x640 8 persons, 1 sports ball, 28.0ms
Speed: 10.3ms preprocess, 28.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▎       | 255/1074 [00:26<01:50,  7.38it/s]


0: 384x640 8 persons, 1 sports ball, 24.7ms
Speed: 2.2ms preprocess, 24.7ms inference, 13.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▍       | 256/1074 [00:26<02:03,  6.64it/s]


0: 384x640 8 persons, 1 sports ball, 34.9ms
Speed: 3.2ms preprocess, 34.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▍       | 257/1074 [00:26<01:59,  6.86it/s]


0: 384x640 8 persons, 1 sports ball, 27.0ms
Speed: 2.0ms preprocess, 27.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▍       | 258/1074 [00:26<01:54,  7.12it/s]


0: 384x640 8 persons, 1 sports ball, 33.8ms
Speed: 2.0ms preprocess, 33.8ms inference, 5.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▍       | 259/1074 [00:27<01:48,  7.53it/s]


0: 384x640 8 persons, 1 sports ball, 24.1ms
Speed: 3.4ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▍       | 260/1074 [00:27<01:42,  7.95it/s]


0: 384x640 8 persons, 1 sports ball, 29.1ms
Speed: 2.0ms preprocess, 29.1ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▍       | 261/1074 [00:27<01:40,  8.10it/s]


0: 384x640 8 persons, 1 sports ball, 28.2ms
Speed: 2.2ms preprocess, 28.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▍       | 262/1074 [00:27<01:34,  8.57it/s]


0: 384x640 8 persons, 1 sports ball, 32.2ms
Speed: 2.0ms preprocess, 32.2ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  24%|██▍       | 263/1074 [00:27<01:36,  8.40it/s]


0: 384x640 8 persons, 1 sports ball, 25.1ms
Speed: 4.3ms preprocess, 25.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▍       | 264/1074 [00:27<01:40,  8.07it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 3.9ms preprocess, 24.2ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▍       | 265/1074 [00:27<01:46,  7.57it/s]


0: 384x640 8 persons, 1 sports ball, 27.6ms
Speed: 5.1ms preprocess, 27.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▍       | 266/1074 [00:27<01:46,  7.56it/s]


0: 384x640 8 persons, 26.8ms
Speed: 5.0ms preprocess, 26.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▍       | 267/1074 [00:28<01:40,  8.04it/s]


0: 384x640 8 persons, 1 sports ball, 27.1ms
Speed: 2.9ms preprocess, 27.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▍       | 268/1074 [00:28<01:37,  8.23it/s]


0: 384x640 8 persons, 1 sports ball, 26.3ms
Speed: 4.1ms preprocess, 26.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▌       | 269/1074 [00:28<01:36,  8.34it/s]


0: 384x640 8 persons, 1 sports ball, 26.3ms
Speed: 4.1ms preprocess, 26.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▌       | 270/1074 [00:28<01:35,  8.44it/s]


0: 384x640 8 persons, 1 sports ball, 24.5ms
Speed: 3.9ms preprocess, 24.5ms inference, 6.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▌       | 271/1074 [00:28<01:34,  8.51it/s]


0: 384x640 8 persons, 1 sports ball, 25.7ms
Speed: 5.4ms preprocess, 25.7ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▌       | 272/1074 [00:28<01:40,  8.02it/s]


0: 384x640 8 persons, 1 sports ball, 43.3ms
Speed: 2.0ms preprocess, 43.3ms inference, 6.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  25%|██▌       | 273/1074 [00:28<01:58,  6.76it/s]


0: 384x640 8 persons, 1 sports ball, 28.5ms
Speed: 7.8ms preprocess, 28.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  26%|██▌       | 274/1074 [00:28<01:53,  7.06it/s]


0: 384x640 8 persons, 1 sports ball, 25.2ms
Speed: 4.0ms preprocess, 25.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  26%|██▌       | 275/1074 [00:29<01:45,  7.57it/s]


0: 384x640 8 persons, 25.6ms
Speed: 3.9ms preprocess, 25.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  26%|██▌       | 276/1074 [00:29<01:40,  7.94it/s]


0: 384x640 8 persons, 1 sports ball, 35.2ms
Speed: 3.4ms preprocess, 35.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  26%|██▌       | 277/1074 [00:29<01:41,  7.89it/s]


0: 384x640 8 persons, 25.6ms
Speed: 3.5ms preprocess, 25.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  26%|██▌       | 278/1074 [00:29<01:37,  8.19it/s]


0: 384x640 7 persons, 27.2ms
Speed: 4.7ms preprocess, 27.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  26%|██▌       | 279/1074 [00:29<01:34,  8.44it/s]


0: 384x640 7 persons, 35.1ms
Speed: 3.4ms preprocess, 35.1ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  26%|██▌       | 280/1074 [00:29<01:55,  6.90it/s]


0: 384x640 7 persons, 24.2ms
Speed: 2.7ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 7 persons, 26.2ms
Speed: 1.8ms preprocess, 26.2ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  26%|██▋       | 282/1074 [00:30<02:02,  6.45it/s]


0: 384x640 7 persons, 24.2ms
Speed: 2.8ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 9 persons, 1 sports ball, 28.1ms
Speed: 7.3ms preprocess, 28.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  26%|██▋       | 284/1074 [00:30<01:47,  7.36it/s]


0: 384x640 7 persons, 32.2ms
Speed: 5.4ms preprocess, 32.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  27%|██▋       | 285/1074 [00:30<01:43,  7.62it/s]


0: 384x640 7 persons, 1 sports ball, 24.6ms
Speed: 2.1ms preprocess, 24.6ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  27%|██▋       | 286/1074 [00:30<01:40,  7.84it/s]


0: 384x640 7 persons, 1 sports ball, 33.5ms
Speed: 2.6ms preprocess, 33.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  27%|██▋       | 287/1074 [00:30<01:40,  7.84it/s]


0: 384x640 7 persons, 1 sports ball, 24.2ms
Speed: 4.1ms preprocess, 24.2ms inference, 6.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  27%|██▋       | 288/1074 [00:30<02:05,  6.29it/s]


0: 384x640 7 persons, 1 sports ball, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 7 persons, 1 sports ball, 24.2ms
Speed: 2.3ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  27%|██▋       | 290/1074 [00:31<02:04,  6.31it/s]


0: 384x640 7 persons, 1 sports ball, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 7 persons, 1 sports ball, 27.8ms
Speed: 2.2ms preprocess, 27.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  27%|██▋       | 292/1074 [00:31<01:56,  6.71it/s]


0: 384x640 7 persons, 1 sports ball, 26.2ms
Speed: 3.5ms preprocess, 26.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  27%|██▋       | 293/1074 [00:31<01:49,  7.16it/s]


0: 384x640 7 persons, 1 sports ball, 40.1ms
Speed: 2.8ms preprocess, 40.1ms inference, 7.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  27%|██▋       | 294/1074 [00:31<01:48,  7.19it/s]


0: 384x640 7 persons, 27.6ms
Speed: 2.3ms preprocess, 27.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  27%|██▋       | 295/1074 [00:31<01:54,  6.82it/s]


0: 384x640 8 persons, 1 sports ball, 28.7ms
Speed: 4.2ms preprocess, 28.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  28%|██▊       | 296/1074 [00:32<02:01,  6.43it/s]


0: 384x640 7 persons, 30.0ms
Speed: 2.8ms preprocess, 30.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  28%|██▊       | 297/1074 [00:32<01:52,  6.91it/s]


0: 384x640 7 persons, 26.2ms
Speed: 2.0ms preprocess, 26.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  28%|██▊       | 298/1074 [00:32<01:45,  7.37it/s]


0: 384x640 7 persons, 1 sports ball, 26.3ms
Speed: 2.0ms preprocess, 26.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  28%|██▊       | 299/1074 [00:32<02:05,  6.17it/s]


0: 384x640 7 persons, 25.2ms
Speed: 3.8ms preprocess, 25.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 7 persons, 24.9ms
Speed: 3.2ms preprocess, 24.9ms inference, 3.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  28%|██▊       | 301/1074 [00:32<01:43,  7.48it/s]


0: 384x640 6 persons, 27.5ms
Speed: 2.1ms preprocess, 27.5ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 7 persons, 24.2ms
Speed: 2.0ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  28%|██▊       | 303/1074 [00:32<01:31,  8.42it/s]


0: 384x640 6 persons, 25.7ms
Speed: 2.1ms preprocess, 25.7ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  28%|██▊       | 304/1074 [00:33<01:30,  8.55it/s]


0: 384x640 7 persons, 29.4ms
Speed: 2.2ms preprocess, 29.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 7 persons, 30.0ms
Speed: 2.6ms preprocess, 30.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  28%|██▊       | 306/1074 [00:33<01:23,  9.18it/s]


0: 384x640 7 persons, 24.8ms
Speed: 2.2ms preprocess, 24.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  29%|██▊       | 307/1074 [00:33<01:36,  7.99it/s]


0: 384x640 7 persons, 1 sports ball, 24.2ms
Speed: 4.5ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  29%|██▊       | 308/1074 [00:33<01:32,  8.30it/s]


0: 384x640 10 persons, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  29%|██▉       | 309/1074 [00:33<01:30,  8.42it/s]


0: 384x640 6 persons, 1 sports ball, 32.6ms
Speed: 2.7ms preprocess, 32.6ms inference, 4.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  29%|██▉       | 310/1074 [00:33<01:40,  7.63it/s]


0: 384x640 7 persons, 1 sports ball, 29.5ms
Speed: 2.2ms preprocess, 29.5ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  29%|██▉       | 311/1074 [00:33<01:37,  7.83it/s]


0: 384x640 7 persons, 1 sports ball, 26.4ms
Speed: 6.0ms preprocess, 26.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  29%|██▉       | 312/1074 [00:34<01:33,  8.16it/s]


0: 384x640 6 persons, 1 sports ball, 25.6ms
Speed: 3.5ms preprocess, 25.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  29%|██▉       | 313/1074 [00:34<01:29,  8.49it/s]


0: 384x640 5 persons, 1 sports ball, 27.9ms
Speed: 2.1ms preprocess, 27.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 6 persons, 1 sports ball, 24.1ms
Speed: 4.8ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  29%|██▉       | 315/1074 [00:34<01:20,  9.40it/s]


0: 384x640 6 persons, 1 sports ball, 26.0ms
Speed: 3.6ms preprocess, 26.0ms inference, 3.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  29%|██▉       | 316/1074 [00:34<01:21,  9.35it/s]


0: 384x640 6 persons, 1 sports ball, 32.2ms
Speed: 9.2ms preprocess, 32.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|██▉       | 317/1074 [00:34<01:26,  8.74it/s]


0: 384x640 6 persons, 30.6ms
Speed: 4.1ms preprocess, 30.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|██▉       | 318/1074 [00:34<01:46,  7.08it/s]


0: 384x640 6 persons, 28.6ms
Speed: 5.9ms preprocess, 28.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 6 persons, 25.0ms
Speed: 7.8ms preprocess, 25.0ms inference, 3.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|██▉       | 320/1074 [00:35<01:46,  7.09it/s]


0: 384x640 7 persons, 38.5ms
Speed: 2.3ms preprocess, 38.5ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|██▉       | 321/1074 [00:35<02:02,  6.13it/s]


0: 384x640 7 persons, 33.6ms
Speed: 6.3ms preprocess, 33.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|██▉       | 322/1074 [00:35<01:54,  6.57it/s]


0: 384x640 8 persons, 36.9ms
Speed: 2.1ms preprocess, 36.9ms inference, 4.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|███       | 323/1074 [00:35<02:08,  5.83it/s]


0: 384x640 8 persons, 1 sports ball, 35.2ms
Speed: 7.5ms preprocess, 35.2ms inference, 4.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|███       | 324/1074 [00:35<02:18,  5.41it/s]


0: 384x640 9 persons, 32.8ms
Speed: 2.1ms preprocess, 32.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|███       | 325/1074 [00:36<02:24,  5.17it/s]


0: 384x640 8 persons, 43.7ms
Speed: 2.1ms preprocess, 43.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|███       | 326/1074 [00:36<02:17,  5.43it/s]


0: 384x640 9 persons, 1 sports ball, 41.7ms
Speed: 2.1ms preprocess, 41.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  30%|███       | 327/1074 [00:36<02:32,  4.89it/s]


0: 384x640 9 persons, 1 sports ball, 31.4ms
Speed: 2.3ms preprocess, 31.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███       | 328/1074 [00:36<02:31,  4.94it/s]


0: 384x640 5 persons, 1 sports ball, 41.1ms
Speed: 7.4ms preprocess, 41.1ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███       | 329/1074 [00:36<02:21,  5.27it/s]


0: 384x640 5 persons, 1 sports ball, 26.6ms
Speed: 5.8ms preprocess, 26.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 1 sports ball, 25.8ms
Speed: 2.0ms preprocess, 25.8ms inference, 4.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███       | 331/1074 [00:37<01:46,  6.97it/s]


0: 384x640 5 persons, 1 sports ball, 24.2ms
Speed: 6.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███       | 332/1074 [00:37<01:39,  7.49it/s]


0: 384x640 5 persons, 1 sports ball, 27.0ms
Speed: 4.9ms preprocess, 27.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███       | 333/1074 [00:37<01:47,  6.86it/s]


0: 384x640 5 persons, 59.4ms
Speed: 2.1ms preprocess, 59.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███       | 334/1074 [00:37<01:44,  7.05it/s]


0: 384x640 5 persons, 24.6ms
Speed: 4.9ms preprocess, 24.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███       | 335/1074 [00:37<01:39,  7.45it/s]


0: 384x640 5 persons, 37.9ms
Speed: 3.0ms preprocess, 37.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███▏      | 336/1074 [00:37<01:39,  7.41it/s]


0: 384x640 5 persons, 28.8ms
Speed: 2.1ms preprocess, 28.8ms inference, 4.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███▏      | 337/1074 [00:37<01:37,  7.59it/s]


0: 384x640 5 persons, 37.5ms
Speed: 4.6ms preprocess, 37.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  31%|███▏      | 338/1074 [00:38<01:57,  6.26it/s]


0: 384x640 5 persons, 30.7ms
Speed: 2.2ms preprocess, 30.7ms inference, 3.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 339/1074 [00:38<01:57,  6.26it/s]


0: 384x640 5 persons, 32.4ms
Speed: 2.2ms preprocess, 32.4ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 340/1074 [00:38<01:51,  6.58it/s]


0: 384x640 5 persons, 1 sports ball, 30.6ms
Speed: 2.2ms preprocess, 30.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 341/1074 [00:38<01:47,  6.84it/s]


0: 384x640 5 persons, 28.4ms
Speed: 8.0ms preprocess, 28.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 342/1074 [00:38<01:48,  6.74it/s]


0: 384x640 5 persons, 27.4ms
Speed: 2.2ms preprocess, 27.4ms inference, 4.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 343/1074 [00:38<01:44,  6.98it/s]


0: 384x640 5 persons, 32.4ms
Speed: 2.9ms preprocess, 32.4ms inference, 6.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 344/1074 [00:38<02:00,  6.08it/s]


0: 384x640 5 persons, 33.2ms
Speed: 2.1ms preprocess, 33.2ms inference, 4.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 345/1074 [00:39<01:53,  6.41it/s]


0: 384x640 5 persons, 1 sports ball, 38.6ms
Speed: 2.4ms preprocess, 38.6ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 346/1074 [00:39<01:49,  6.63it/s]


0: 384x640 6 persons, 1 sports ball, 32.4ms
Speed: 4.7ms preprocess, 32.4ms inference, 5.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 347/1074 [00:39<01:57,  6.21it/s]


0: 384x640 5 persons, 1 sports ball, 40.3ms
Speed: 3.6ms preprocess, 40.3ms inference, 7.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 348/1074 [00:39<02:16,  5.33it/s]


0: 384x640 5 persons, 1 sports ball, 26.9ms
Speed: 2.0ms preprocess, 26.9ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  32%|███▏      | 349/1074 [00:39<02:04,  5.84it/s]


0: 384x640 5 persons, 1 sports ball, 25.4ms
Speed: 9.1ms preprocess, 25.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  33%|███▎      | 350/1074 [00:40<02:17,  5.25it/s]


0: 384x640 5 persons, 1 sports ball, 38.4ms
Speed: 12.9ms preprocess, 38.4ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  33%|███▎      | 351/1074 [00:40<02:08,  5.63it/s]


0: 384x640 5 persons, 1 sports ball, 36.4ms
Speed: 2.1ms preprocess, 36.4ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  33%|███▎      | 352/1074 [00:40<01:52,  6.42it/s]


0: 384x640 6 persons, 42.2ms
Speed: 2.2ms preprocess, 42.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  33%|███▎      | 353/1074 [00:40<02:08,  5.62it/s]


0: 384x640 6 persons, 35.6ms
Speed: 2.3ms preprocess, 35.6ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  33%|███▎      | 354/1074 [00:40<01:58,  6.10it/s]


0: 384x640 5 persons, 27.2ms
Speed: 7.5ms preprocess, 27.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  33%|███▎      | 355/1074 [00:40<01:49,  6.57it/s]


0: 384x640 5 persons, 30.0ms
Speed: 2.0ms preprocess, 30.0ms inference, 4.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  33%|███▎      | 356/1074 [00:40<01:52,  6.37it/s]


0: 384x640 5 persons, 30.8ms
Speed: 5.4ms preprocess, 30.8ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  33%|███▎      | 357/1074 [00:41<01:44,  6.88it/s]


0: 384x640 5 persons, 24.1ms
Speed: 2.2ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 5 persons, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  33%|███▎      | 359/1074 [00:41<01:25,  8.40it/s]


0: 384x640 6 persons, 24.5ms
Speed: 2.2ms preprocess, 24.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 7 persons, 24.1ms
Speed: 5.6ms preprocess, 24.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  34%|███▎      | 361/1074 [00:41<01:14,  9.61it/s]


0: 384x640 6 persons, 1 sports ball, 30.9ms
Speed: 5.1ms preprocess, 30.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  34%|███▎      | 362/1074 [00:41<01:17,  9.24it/s]


0: 384x640 6 persons, 24.5ms
Speed: 6.4ms preprocess, 24.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  34%|███▍      | 363/1074 [00:41<01:25,  8.36it/s]


0: 384x640 7 persons, 35.6ms
Speed: 7.8ms preprocess, 35.6ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  34%|███▍      | 364/1074 [00:41<01:25,  8.33it/s]


0: 384x640 6 persons, 23.5ms
Speed: 2.3ms preprocess, 23.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 6 persons, 25.7ms
Speed: 6.0ms preprocess, 25.7ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  34%|███▍      | 366/1074 [00:42<01:21,  8.69it/s]


0: 384x640 6 persons, 25.2ms
Speed: 8.3ms preprocess, 25.2ms inference, 3.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  34%|███▍      | 367/1074 [00:42<01:19,  8.86it/s]


0: 384x640 6 persons, 28.1ms
Speed: 2.6ms preprocess, 28.1ms inference, 3.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  34%|███▍      | 368/1074 [00:42<01:18,  9.00it/s]


0: 384x640 6 persons, 26.4ms
Speed: 3.9ms preprocess, 26.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  34%|███▍      | 369/1074 [00:42<01:17,  9.09it/s]


0: 384x640 6 persons, 25.8ms
Speed: 2.6ms preprocess, 25.8ms inference, 6.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  34%|███▍      | 370/1074 [00:42<01:17,  9.08it/s]


0: 384x640 7 persons, 31.0ms
Speed: 9.3ms preprocess, 31.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  35%|███▍      | 371/1074 [00:42<01:15,  9.32it/s]


0: 384x640 8 persons, 1 sports ball, 23.2ms
Speed: 2.1ms preprocess, 23.2ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 8 persons, 29.6ms
Speed: 5.6ms preprocess, 29.6ms inference, 11.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  35%|███▍      | 373/1074 [00:42<01:19,  8.83it/s]


0: 384x640 8 persons, 1 sports ball, 23.2ms
Speed: 2.6ms preprocess, 23.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  35%|███▍      | 374/1074 [00:42<01:20,  8.71it/s]


0: 384x640 8 persons, 1 sports ball, 23.2ms
Speed: 5.6ms preprocess, 23.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  35%|███▍      | 375/1074 [00:43<01:20,  8.66it/s]


0: 384x640 9 persons, 1 sports ball, 23.1ms
Speed: 5.9ms preprocess, 23.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  35%|███▌      | 376/1074 [00:43<01:35,  7.30it/s]


0: 384x640 8 persons, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 2.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  35%|███▌      | 377/1074 [00:43<01:31,  7.59it/s]


0: 384x640 8 persons, 28.6ms
Speed: 5.8ms preprocess, 28.6ms inference, 6.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  35%|███▌      | 378/1074 [00:43<01:44,  6.66it/s]


0: 384x640 8 persons, 26.0ms
Speed: 2.3ms preprocess, 26.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 8 persons, 32.2ms
Speed: 8.5ms preprocess, 32.2ms inference, 4.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  35%|███▌      | 380/1074 [00:43<01:36,  7.17it/s]


0: 384x640 8 persons, 1 sports ball, 28.8ms
Speed: 11.3ms preprocess, 28.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  35%|███▌      | 381/1074 [00:43<01:32,  7.50it/s]


0: 384x640 9 persons, 1 sports ball, 24.2ms
Speed: 8.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▌      | 382/1074 [00:44<01:30,  7.67it/s]


0: 384x640 8 persons, 36.3ms
Speed: 2.0ms preprocess, 36.3ms inference, 5.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▌      | 383/1074 [00:44<01:45,  6.52it/s]


0: 384x640 9 persons, 29.8ms
Speed: 2.4ms preprocess, 29.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▌      | 384/1074 [00:44<01:35,  7.20it/s]


0: 384x640 8 persons, 1 sports ball, 29.0ms
Speed: 4.7ms preprocess, 29.0ms inference, 5.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▌      | 385/1074 [00:44<01:35,  7.25it/s]


0: 384x640 8 persons, 1 sports ball, 24.3ms
Speed: 4.9ms preprocess, 24.3ms inference, 10.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▌      | 386/1074 [00:44<01:47,  6.42it/s]


0: 384x640 8 persons, 1 sports ball, 34.5ms
Speed: 5.3ms preprocess, 34.5ms inference, 5.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▌      | 387/1074 [00:44<01:51,  6.17it/s]


0: 384x640 8 persons, 1 sports ball, 32.7ms
Speed: 3.1ms preprocess, 32.7ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▌      | 388/1074 [00:44<01:43,  6.65it/s]


0: 384x640 8 persons, 25.4ms
Speed: 2.3ms preprocess, 25.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▌      | 389/1074 [00:45<01:38,  6.95it/s]


0: 384x640 8 persons, 1 sports ball, 24.5ms
Speed: 2.2ms preprocess, 24.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▋      | 390/1074 [00:45<01:36,  7.10it/s]


0: 384x640 8 persons, 1 sports ball, 25.6ms
Speed: 2.0ms preprocess, 25.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▋      | 391/1074 [00:45<01:30,  7.57it/s]


0: 384x640 8 persons, 1 sports ball, 38.9ms
Speed: 2.0ms preprocess, 38.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  36%|███▋      | 392/1074 [00:45<01:29,  7.59it/s]


0: 384x640 8 persons, 1 sports ball, 25.7ms
Speed: 6.8ms preprocess, 25.7ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 393/1074 [00:45<01:31,  7.44it/s]


0: 384x640 8 persons, 1 sports ball, 48.1ms
Speed: 2.1ms preprocess, 48.1ms inference, 4.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 394/1074 [00:45<01:42,  6.64it/s]


0: 384x640 8 persons, 1 sports ball, 29.5ms
Speed: 3.7ms preprocess, 29.5ms inference, 3.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 395/1074 [00:45<01:33,  7.28it/s]


0: 384x640 8 persons, 1 sports ball, 24.1ms
Speed: 2.2ms preprocess, 24.1ms inference, 4.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 396/1074 [00:46<01:26,  7.85it/s]


0: 384x640 8 persons, 1 sports ball, 24.6ms
Speed: 5.3ms preprocess, 24.6ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 397/1074 [00:46<01:21,  8.32it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 4.8ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 398/1074 [00:46<01:18,  8.56it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 8.3ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 399/1074 [00:46<01:15,  8.91it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 2.6ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 400/1074 [00:46<01:13,  9.14it/s]


0: 384x640 8 persons, 1 sports ball, 26.4ms
Speed: 3.8ms preprocess, 26.4ms inference, 7.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 401/1074 [00:46<01:31,  7.36it/s]


0: 384x640 8 persons, 1 sports ball, 30.7ms
Speed: 2.6ms preprocess, 30.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  37%|███▋      | 402/1074 [00:46<01:29,  7.49it/s]


0: 384x640 8 persons, 1 sports ball, 28.9ms
Speed: 2.2ms preprocess, 28.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 403/1074 [00:46<01:27,  7.64it/s]


0: 384x640 8 persons, 1 sports ball, 24.4ms
Speed: 7.2ms preprocess, 24.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 404/1074 [00:47<01:26,  7.78it/s]


0: 384x640 9 persons, 1 sports ball, 29.5ms
Speed: 2.2ms preprocess, 29.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 405/1074 [00:47<01:24,  7.87it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 6.9ms preprocess, 24.2ms inference, 6.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 406/1074 [00:47<01:22,  8.05it/s]


0: 384x640 8 persons, 1 sports ball, 27.8ms
Speed: 7.0ms preprocess, 27.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 407/1074 [00:47<01:21,  8.17it/s]


0: 384x640 8 persons, 1 sports ball, 26.9ms
Speed: 7.2ms preprocess, 26.9ms inference, 1.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 408/1074 [00:47<01:33,  7.12it/s]


0: 384x640 8 persons, 26.3ms
Speed: 7.5ms preprocess, 26.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 409/1074 [00:47<01:43,  6.41it/s]


0: 384x640 7 persons, 50.8ms
Speed: 2.9ms preprocess, 50.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 410/1074 [00:47<01:50,  6.00it/s]


0: 384x640 8 persons, 29.7ms
Speed: 4.1ms preprocess, 29.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 411/1074 [00:48<01:40,  6.61it/s]


0: 384x640 7 persons, 26.7ms
Speed: 3.2ms preprocess, 26.7ms inference, 6.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 412/1074 [00:48<01:31,  7.21it/s]


0: 384x640 7 persons, 32.0ms
Speed: 2.0ms preprocess, 32.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  38%|███▊      | 413/1074 [00:48<01:26,  7.64it/s]


0: 384x640 7 persons, 28.6ms
Speed: 2.0ms preprocess, 28.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▊      | 414/1074 [00:48<01:24,  7.83it/s]


0: 384x640 7 persons, 29.9ms
Speed: 3.5ms preprocess, 29.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▊      | 415/1074 [00:48<01:23,  7.85it/s]


0: 384x640 7 persons, 27.1ms
Speed: 4.2ms preprocess, 27.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▊      | 416/1074 [00:48<01:22,  7.96it/s]


0: 384x640 7 persons, 33.7ms
Speed: 4.9ms preprocess, 33.7ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▉      | 417/1074 [00:48<01:31,  7.16it/s]


0: 384x640 7 persons, 30.0ms
Speed: 3.2ms preprocess, 30.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▉      | 418/1074 [00:48<01:26,  7.63it/s]


0: 384x640 7 persons, 26.3ms
Speed: 7.5ms preprocess, 26.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▉      | 419/1074 [00:49<01:20,  8.15it/s]


0: 384x640 7 persons, 24.2ms
Speed: 2.6ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▉      | 420/1074 [00:49<01:26,  7.57it/s]


0: 384x640 7 persons, 2 sports balls, 26.7ms
Speed: 3.2ms preprocess, 26.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▉      | 421/1074 [00:49<01:20,  8.08it/s]


0: 384x640 7 persons, 1 sports ball, 24.1ms
Speed: 1.9ms preprocess, 24.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▉      | 422/1074 [00:49<01:22,  7.90it/s]


0: 384x640 7 persons, 1 sports ball, 27.4ms
Speed: 3.5ms preprocess, 27.4ms inference, 5.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▉      | 423/1074 [00:49<01:23,  7.78it/s]


0: 384x640 7 persons, 2 sports balls, 26.0ms
Speed: 4.9ms preprocess, 26.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  39%|███▉      | 424/1074 [00:49<01:23,  7.77it/s]


0: 384x640 7 persons, 2 sports balls, 36.7ms
Speed: 4.1ms preprocess, 36.7ms inference, 10.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|███▉      | 425/1074 [00:49<01:32,  7.02it/s]


0: 384x640 8 persons, 1 sports ball, 27.7ms
Speed: 3.4ms preprocess, 27.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|███▉      | 426/1074 [00:49<01:32,  7.04it/s]


0: 384x640 9 persons, 1 sports ball, 27.9ms
Speed: 2.2ms preprocess, 27.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|███▉      | 427/1074 [00:50<01:28,  7.33it/s]


0: 384x640 8 persons, 1 sports ball, 25.3ms
Speed: 5.2ms preprocess, 25.3ms inference, 4.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|███▉      | 428/1074 [00:50<01:25,  7.56it/s]


0: 384x640 7 persons, 1 sports ball, 30.3ms
Speed: 3.2ms preprocess, 30.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|███▉      | 429/1074 [00:50<01:21,  7.87it/s]


0: 384x640 7 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|████      | 430/1074 [00:50<01:22,  7.78it/s]


0: 384x640 7 persons, 1 sports ball, 26.1ms
Speed: 6.2ms preprocess, 26.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|████      | 431/1074 [00:50<01:22,  7.82it/s]


0: 384x640 7 persons, 1 sports ball, 24.9ms
Speed: 2.1ms preprocess, 24.9ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|████      | 432/1074 [00:50<01:24,  7.62it/s]


0: 384x640 7 persons, 1 sports ball, 32.2ms
Speed: 4.6ms preprocess, 32.2ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|████      | 433/1074 [00:50<01:22,  7.81it/s]


0: 384x640 7 persons, 1 sports ball, 26.1ms
Speed: 2.3ms preprocess, 26.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  40%|████      | 434/1074 [00:51<01:35,  6.69it/s]


0: 384x640 7 persons, 2 sports balls, 29.3ms
Speed: 2.4ms preprocess, 29.3ms inference, 3.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████      | 435/1074 [00:51<01:38,  6.51it/s]


0: 384x640 7 persons, 1 sports ball, 37.9ms
Speed: 2.2ms preprocess, 37.9ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████      | 436/1074 [00:51<01:35,  6.71it/s]


0: 384x640 7 persons, 1 sports ball, 30.1ms
Speed: 3.1ms preprocess, 30.1ms inference, 10.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████      | 437/1074 [00:51<01:43,  6.15it/s]


0: 384x640 7 persons, 1 sports ball, 43.5ms
Speed: 5.4ms preprocess, 43.5ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████      | 438/1074 [00:51<01:49,  5.81it/s]


0: 384x640 7 persons, 33.9ms
Speed: 4.7ms preprocess, 33.9ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████      | 439/1074 [00:51<01:49,  5.81it/s]


0: 384x640 7 persons, 1 sports ball, 51.9ms
Speed: 2.2ms preprocess, 51.9ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████      | 440/1074 [00:52<01:56,  5.45it/s]


0: 384x640 7 persons, 31.9ms
Speed: 5.4ms preprocess, 31.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████      | 441/1074 [00:52<01:55,  5.49it/s]


0: 384x640 7 persons, 45.1ms
Speed: 9.9ms preprocess, 45.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████      | 442/1074 [00:52<01:59,  5.29it/s]


0: 384x640 7 persons, 1 sports ball, 45.6ms
Speed: 10.1ms preprocess, 45.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████      | 443/1074 [00:52<01:53,  5.56it/s]


0: 384x640 7 persons, 1 sports ball, 37.8ms
Speed: 4.2ms preprocess, 37.8ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████▏     | 444/1074 [00:52<02:06,  5.00it/s]


0: 384x640 7 persons, 31.6ms
Speed: 3.3ms preprocess, 31.6ms inference, 2.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  41%|████▏     | 445/1074 [00:53<01:52,  5.60it/s]


0: 384x640 7 persons, 1 sports ball, 25.3ms
Speed: 2.1ms preprocess, 25.3ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 446/1074 [00:53<01:59,  5.26it/s]


0: 384x640 8 persons, 1 sports ball, 37.9ms
Speed: 7.5ms preprocess, 37.9ms inference, 9.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 447/1074 [00:53<01:51,  5.64it/s]


0: 384x640 7 persons, 1 sports ball, 30.2ms
Speed: 2.1ms preprocess, 30.2ms inference, 3.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 448/1074 [00:53<01:48,  5.75it/s]


0: 384x640 7 persons, 30.0ms
Speed: 8.1ms preprocess, 30.0ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 449/1074 [00:53<01:41,  6.15it/s]


0: 384x640 8 persons, 1 sports ball, 24.4ms
Speed: 4.2ms preprocess, 24.4ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 450/1074 [00:53<01:32,  6.75it/s]


0: 384x640 6 persons, 27.2ms
Speed: 3.8ms preprocess, 27.2ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 451/1074 [00:53<01:23,  7.45it/s]


0: 384x640 6 persons, 1 sports ball, 27.9ms
Speed: 8.5ms preprocess, 27.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 452/1074 [00:54<01:23,  7.46it/s]


0: 384x640 6 persons, 44.4ms
Speed: 2.0ms preprocess, 44.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 453/1074 [00:54<01:27,  7.10it/s]


0: 384x640 5 persons, 1 sports ball, 42.8ms
Speed: 8.4ms preprocess, 42.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 454/1074 [00:54<01:38,  6.29it/s]


0: 384x640 5 persons, 1 sports ball, 32.7ms
Speed: 3.1ms preprocess, 32.7ms inference, 5.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 455/1074 [00:54<01:36,  6.38it/s]


0: 384x640 5 persons, 1 sports ball, 42.2ms
Speed: 5.9ms preprocess, 42.2ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  42%|████▏     | 456/1074 [00:54<01:39,  6.23it/s]


0: 384x640 6 persons, 1 sports ball, 33.6ms
Speed: 4.0ms preprocess, 33.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 457/1074 [00:54<01:37,  6.30it/s]


0: 384x640 6 persons, 45.9ms
Speed: 3.0ms preprocess, 45.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 458/1074 [00:55<01:53,  5.42it/s]


0: 384x640 7 persons, 1 sports ball, 33.0ms
Speed: 2.2ms preprocess, 33.0ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 459/1074 [00:55<01:56,  5.28it/s]


0: 384x640 7 persons, 1 sports ball, 30.6ms
Speed: 3.0ms preprocess, 30.6ms inference, 7.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 460/1074 [00:55<02:01,  5.04it/s]


0: 384x640 5 persons, 1 sports ball, 46.5ms
Speed: 2.1ms preprocess, 46.5ms inference, 5.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 461/1074 [00:55<01:52,  5.45it/s]


0: 384x640 6 persons, 1 sports ball, 33.2ms
Speed: 2.2ms preprocess, 33.2ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 462/1074 [00:55<01:42,  5.97it/s]


0: 384x640 6 persons, 1 sports ball, 34.6ms
Speed: 5.8ms preprocess, 34.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 463/1074 [00:56<01:49,  5.57it/s]


0: 384x640 11 persons, 36.7ms
Speed: 13.3ms preprocess, 36.7ms inference, 5.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 464/1074 [00:56<01:58,  5.13it/s]


0: 384x640 11 persons, 43.5ms
Speed: 2.2ms preprocess, 43.5ms inference, 9.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 465/1074 [00:56<02:01,  4.99it/s]


0: 384x640 10 persons, 1 sports ball, 41.9ms
Speed: 5.8ms preprocess, 41.9ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 466/1074 [00:56<02:00,  5.05it/s]


0: 384x640 10 persons, 1 sports ball, 30.2ms
Speed: 3.3ms preprocess, 30.2ms inference, 6.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  43%|████▎     | 467/1074 [00:56<01:49,  5.53it/s]


0: 384x640 10 persons, 1 sports ball, 45.2ms
Speed: 12.9ms preprocess, 45.2ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▎     | 468/1074 [00:57<01:56,  5.21it/s]


0: 384x640 10 persons, 1 sports ball, 36.0ms
Speed: 2.1ms preprocess, 36.0ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▎     | 469/1074 [00:57<01:56,  5.18it/s]


0: 384x640 10 persons, 1 sports ball, 45.7ms
Speed: 2.1ms preprocess, 45.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▍     | 470/1074 [00:57<01:51,  5.42it/s]


0: 384x640 10 persons, 1 sports ball, 26.0ms
Speed: 8.8ms preprocess, 26.0ms inference, 5.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▍     | 471/1074 [00:57<01:53,  5.32it/s]


0: 384x640 10 persons, 1 sports ball, 24.8ms
Speed: 6.8ms preprocess, 24.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▍     | 472/1074 [00:57<02:00,  5.00it/s]


0: 384x640 11 persons, 1 sports ball, 26.9ms
Speed: 2.3ms preprocess, 26.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▍     | 473/1074 [00:57<01:47,  5.59it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▍     | 474/1074 [00:58<01:36,  6.21it/s]


0: 384x640 10 persons, 27.9ms
Speed: 3.7ms preprocess, 27.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▍     | 475/1074 [00:58<01:33,  6.44it/s]


0: 384x640 10 persons, 33.7ms
Speed: 4.1ms preprocess, 33.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▍     | 476/1074 [00:58<01:28,  6.75it/s]


0: 384x640 10 persons, 28.3ms
Speed: 7.4ms preprocess, 28.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  44%|████▍     | 477/1074 [00:58<01:38,  6.09it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 3.4ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▍     | 478/1074 [00:58<01:32,  6.43it/s]


0: 384x640 10 persons, 27.8ms
Speed: 3.4ms preprocess, 27.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▍     | 479/1074 [00:58<01:28,  6.75it/s]


0: 384x640 10 persons, 30.1ms
Speed: 3.6ms preprocess, 30.1ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▍     | 480/1074 [00:58<01:25,  6.99it/s]


0: 384x640 10 persons, 26.3ms
Speed: 8.2ms preprocess, 26.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▍     | 481/1074 [00:59<01:21,  7.27it/s]


0: 384x640 12 persons, 1 sports ball, 29.0ms
Speed: 2.0ms preprocess, 29.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▍     | 482/1074 [00:59<01:29,  6.61it/s]


0: 384x640 10 persons, 1 sports ball, 33.6ms
Speed: 5.9ms preprocess, 33.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▍     | 483/1074 [00:59<01:24,  6.98it/s]


0: 384x640 11 persons, 1 sports ball, 28.4ms
Speed: 1.8ms preprocess, 28.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▌     | 484/1074 [00:59<01:25,  6.94it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.0ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▌     | 485/1074 [00:59<01:26,  6.80it/s]


0: 384x640 10 persons, 34.1ms
Speed: 7.4ms preprocess, 34.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▌     | 486/1074 [00:59<01:39,  5.91it/s]


0: 384x640 10 persons, 24.2ms
Speed: 12.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▌     | 487/1074 [01:00<01:31,  6.42it/s]


0: 384x640 10 persons, 26.8ms
Speed: 2.6ms preprocess, 26.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  45%|████▌     | 488/1074 [01:00<01:25,  6.87it/s]


0: 384x640 10 persons, 1 sports ball, 25.9ms
Speed: 3.9ms preprocess, 25.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▌     | 489/1074 [01:00<01:22,  7.06it/s]


0: 384x640 11 persons, 1 sports ball, 24.5ms
Speed: 5.3ms preprocess, 24.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▌     | 490/1074 [01:00<01:31,  6.35it/s]


0: 384x640 12 persons, 1 sports ball, 30.8ms
Speed: 2.4ms preprocess, 30.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▌     | 491/1074 [01:00<01:38,  5.93it/s]


0: 384x640 10 persons, 1 sports ball, 45.5ms
Speed: 2.2ms preprocess, 45.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▌     | 492/1074 [01:00<01:30,  6.43it/s]


0: 384x640 10 persons, 1 sports ball, 24.7ms
Speed: 2.4ms preprocess, 24.7ms inference, 6.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▌     | 493/1074 [01:00<01:25,  6.78it/s]


0: 384x640 10 persons, 2 sports balls, 28.1ms
Speed: 5.3ms preprocess, 28.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▌     | 494/1074 [01:01<01:21,  7.09it/s]


0: 384x640 10 persons, 2 sports balls, 26.2ms
Speed: 2.6ms preprocess, 26.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▌     | 495/1074 [01:01<01:29,  6.45it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 7.4ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▌     | 496/1074 [01:01<01:24,  6.86it/s]


0: 384x640 10 persons, 1 sports ball, 34.7ms
Speed: 1.9ms preprocess, 34.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▋     | 497/1074 [01:01<01:23,  6.88it/s]


0: 384x640 11 persons, 1 sports ball, 24.4ms
Speed: 4.6ms preprocess, 24.4ms inference, 4.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▋     | 498/1074 [01:01<01:20,  7.17it/s]


0: 384x640 10 persons, 1 sports ball, 25.5ms
Speed: 10.5ms preprocess, 25.5ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  46%|████▋     | 499/1074 [01:01<01:33,  6.15it/s]


0: 384x640 9 persons, 2 sports balls, 29.2ms
Speed: 2.0ms preprocess, 29.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 500/1074 [01:01<01:27,  6.59it/s]


0: 384x640 9 persons, 1 sports ball, 25.5ms
Speed: 2.0ms preprocess, 25.5ms inference, 5.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 501/1074 [01:02<01:25,  6.71it/s]


0: 384x640 10 persons, 1 sports ball, 28.7ms
Speed: 8.3ms preprocess, 28.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 502/1074 [01:02<01:22,  6.91it/s]


0: 384x640 10 persons, 1 sports ball, 28.2ms
Speed: 2.2ms preprocess, 28.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 503/1074 [01:02<01:21,  7.04it/s]


0: 384x640 10 persons, 1 sports ball, 26.1ms
Speed: 3.1ms preprocess, 26.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 504/1074 [01:02<01:31,  6.21it/s]


0: 384x640 10 persons, 1 sports ball, 35.6ms
Speed: 3.3ms preprocess, 35.6ms inference, 2.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 505/1074 [01:02<01:39,  5.71it/s]


0: 384x640 10 persons, 1 sports ball, 33.0ms
Speed: 2.1ms preprocess, 33.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 506/1074 [01:02<01:30,  6.28it/s]


0: 384x640 10 persons, 1 sports ball, 36.1ms
Speed: 2.4ms preprocess, 36.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 507/1074 [01:03<01:32,  6.14it/s]


0: 384x640 11 persons, 1 sports ball, 26.4ms
Speed: 8.6ms preprocess, 26.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 508/1074 [01:03<01:25,  6.58it/s]


0: 384x640 9 persons, 1 sports ball, 26.5ms
Speed: 3.1ms preprocess, 26.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 509/1074 [01:03<01:32,  6.11it/s]


0: 384x640 10 persons, 1 sports ball, 26.6ms
Speed: 5.4ms preprocess, 26.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  47%|████▋     | 510/1074 [01:03<01:25,  6.60it/s]


0: 384x640 8 persons, 1 sports ball, 27.7ms
Speed: 1.9ms preprocess, 27.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 511/1074 [01:03<01:30,  6.26it/s]


0: 384x640 8 persons, 1 sports ball, 39.3ms
Speed: 9.9ms preprocess, 39.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 512/1074 [01:03<01:27,  6.44it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 2.8ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 513/1074 [01:04<01:24,  6.62it/s]


0: 384x640 10 persons, 2 sports balls, 25.2ms
Speed: 3.2ms preprocess, 25.2ms inference, 6.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 514/1074 [01:04<01:31,  6.14it/s]


0: 384x640 8 persons, 1 sports ball, 33.1ms
Speed: 3.3ms preprocess, 33.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 515/1074 [01:04<01:34,  5.90it/s]


0: 384x640 8 persons, 1 sports ball, 31.0ms
Speed: 2.1ms preprocess, 31.0ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 516/1074 [01:04<01:28,  6.33it/s]


0: 384x640 9 persons, 1 sports ball, 30.5ms
Speed: 3.3ms preprocess, 30.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 517/1074 [01:04<01:31,  6.07it/s]


0: 384x640 9 persons, 1 sports ball, 39.3ms
Speed: 10.8ms preprocess, 39.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 518/1074 [01:04<01:38,  5.67it/s]


0: 384x640 10 persons, 1 sports ball, 28.4ms
Speed: 2.3ms preprocess, 28.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 519/1074 [01:05<01:29,  6.22it/s]


0: 384x640 10 persons, 2 sports balls, 26.8ms
Speed: 6.3ms preprocess, 26.8ms inference, 9.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  48%|████▊     | 520/1074 [01:05<01:37,  5.68it/s]


0: 384x640 9 persons, 2 sports balls, 31.2ms
Speed: 2.1ms preprocess, 31.2ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▊     | 521/1074 [01:05<01:38,  5.60it/s]


0: 384x640 9 persons, 1 sports ball, 29.3ms
Speed: 8.0ms preprocess, 29.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▊     | 522/1074 [01:05<01:31,  6.00it/s]


0: 384x640 9 persons, 1 sports ball, 36.6ms
Speed: 6.9ms preprocess, 36.6ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▊     | 523/1074 [01:05<01:34,  5.80it/s]


0: 384x640 9 persons, 1 sports ball, 24.2ms
Speed: 2.3ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▉     | 524/1074 [01:05<01:26,  6.36it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▉     | 525/1074 [01:06<01:33,  5.84it/s]


0: 384x640 9 persons, 1 sports ball, 27.6ms
Speed: 2.2ms preprocess, 27.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▉     | 526/1074 [01:06<01:25,  6.39it/s]


0: 384x640 8 persons, 1 sports ball, 27.5ms
Speed: 2.0ms preprocess, 27.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▉     | 527/1074 [01:06<01:20,  6.79it/s]


0: 384x640 8 persons, 1 sports ball, 35.5ms
Speed: 2.0ms preprocess, 35.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▉     | 528/1074 [01:06<01:17,  7.02it/s]


0: 384x640 8 persons, 1 sports ball, 24.8ms
Speed: 2.4ms preprocess, 24.8ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▉     | 529/1074 [01:06<01:27,  6.22it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 2.3ms preprocess, 24.2ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▉     | 530/1074 [01:06<01:18,  6.95it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 2.5ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  49%|████▉     | 531/1074 [01:06<01:13,  7.41it/s]


0: 384x640 8 persons, 1 sports ball, 27.3ms
Speed: 2.1ms preprocess, 27.3ms inference, 8.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|████▉     | 532/1074 [01:07<01:22,  6.55it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|████▉     | 533/1074 [01:07<01:15,  7.14it/s]


0: 384x640 8 persons, 26.3ms
Speed: 3.5ms preprocess, 26.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|████▉     | 534/1074 [01:07<01:12,  7.40it/s]


0: 384x640 8 persons, 1 sports ball, 25.2ms
Speed: 3.7ms preprocess, 25.2ms inference, 6.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|████▉     | 535/1074 [01:07<01:10,  7.59it/s]


0: 384x640 8 persons, 1 sports ball, 29.6ms
Speed: 2.7ms preprocess, 29.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|████▉     | 536/1074 [01:07<01:14,  7.23it/s]


0: 384x640 8 persons, 1 sports ball, 47.8ms
Speed: 7.7ms preprocess, 47.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|█████     | 537/1074 [01:07<01:30,  5.95it/s]


0: 384x640 8 persons, 1 sports ball, 29.6ms
Speed: 2.3ms preprocess, 29.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|█████     | 538/1074 [01:07<01:23,  6.41it/s]


0: 384x640 8 persons, 1 sports ball, 51.8ms
Speed: 2.2ms preprocess, 51.8ms inference, 2.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|█████     | 539/1074 [01:08<01:34,  5.69it/s]


0: 384x640 8 persons, 1 sports ball, 29.6ms
Speed: 18.9ms preprocess, 29.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|█████     | 540/1074 [01:08<01:30,  5.92it/s]


0: 384x640 8 persons, 1 sports ball, 28.7ms
Speed: 8.1ms preprocess, 28.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|█████     | 541/1074 [01:08<01:26,  6.16it/s]


0: 384x640 8 persons, 1 sports ball, 30.9ms
Speed: 9.4ms preprocess, 30.9ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  50%|█████     | 542/1074 [01:08<01:27,  6.10it/s]


0: 384x640 8 persons, 1 sports ball, 39.5ms
Speed: 6.7ms preprocess, 39.5ms inference, 7.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████     | 543/1074 [01:08<01:40,  5.30it/s]


0: 384x640 8 persons, 1 sports ball, 28.9ms
Speed: 4.3ms preprocess, 28.9ms inference, 3.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████     | 544/1074 [01:09<01:40,  5.30it/s]


0: 384x640 8 persons, 1 sports ball, 39.6ms
Speed: 8.4ms preprocess, 39.6ms inference, 14.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████     | 545/1074 [01:09<01:49,  4.83it/s]


0: 384x640 8 persons, 1 sports ball, 31.1ms
Speed: 7.1ms preprocess, 31.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████     | 546/1074 [01:09<01:40,  5.27it/s]


0: 384x640 8 persons, 1 sports ball, 33.1ms
Speed: 6.0ms preprocess, 33.1ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████     | 547/1074 [01:09<01:36,  5.43it/s]


0: 384x640 8 persons, 1 sports ball, 34.5ms
Speed: 7.8ms preprocess, 34.5ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████     | 548/1074 [01:09<01:34,  5.59it/s]


0: 384x640 8 persons, 1 sports ball, 26.6ms
Speed: 4.5ms preprocess, 26.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████     | 549/1074 [01:09<01:28,  5.96it/s]


0: 384x640 8 persons, 1 sports ball, 25.6ms
Speed: 10.6ms preprocess, 25.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████     | 550/1074 [01:10<01:26,  6.06it/s]


0: 384x640 8 persons, 1 sports ball, 31.1ms
Speed: 6.3ms preprocess, 31.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████▏    | 551/1074 [01:10<01:24,  6.18it/s]


0: 384x640 8 persons, 1 sports ball, 27.7ms
Speed: 8.3ms preprocess, 27.7ms inference, 3.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████▏    | 552/1074 [01:10<01:23,  6.22it/s]


0: 384x640 8 persons, 1 sports ball, 28.4ms
Speed: 8.5ms preprocess, 28.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  51%|█████▏    | 553/1074 [01:10<01:23,  6.21it/s]


0: 384x640 8 persons, 1 sports ball, 54.6ms
Speed: 2.2ms preprocess, 54.6ms inference, 9.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 554/1074 [01:10<01:27,  5.95it/s]


0: 384x640 8 persons, 1 sports ball, 35.6ms
Speed: 3.2ms preprocess, 35.6ms inference, 3.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 555/1074 [01:10<01:24,  6.16it/s]


0: 384x640 8 persons, 24.2ms
Speed: 2.4ms preprocess, 24.2ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 556/1074 [01:11<01:21,  6.36it/s]


0: 384x640 9 persons, 48.4ms
Speed: 6.0ms preprocess, 48.4ms inference, 7.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 557/1074 [01:11<01:32,  5.58it/s]


0: 384x640 7 persons, 1 sports ball, 29.7ms
Speed: 4.6ms preprocess, 29.7ms inference, 3.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 558/1074 [01:11<01:36,  5.37it/s]


0: 384x640 7 persons, 1 sports ball, 56.6ms
Speed: 3.5ms preprocess, 56.6ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 559/1074 [01:11<01:36,  5.32it/s]


0: 384x640 7 persons, 2 sports balls, 25.6ms
Speed: 2.8ms preprocess, 25.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 560/1074 [01:11<01:31,  5.61it/s]


0: 384x640 7 persons, 1 sports ball, 64.4ms
Speed: 2.3ms preprocess, 64.4ms inference, 6.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 561/1074 [01:12<01:29,  5.76it/s]


0: 384x640 8 persons, 1 sports ball, 34.3ms
Speed: 3.9ms preprocess, 34.3ms inference, 7.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 562/1074 [01:12<01:34,  5.43it/s]


0: 384x640 9 persons, 1 sports ball, 32.1ms
Speed: 5.5ms preprocess, 32.1ms inference, 9.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  52%|█████▏    | 563/1074 [01:12<01:38,  5.17it/s]


0: 384x640 9 persons, 1 sports ball, 29.3ms
Speed: 2.9ms preprocess, 29.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 564/1074 [01:12<01:46,  4.79it/s]


0: 384x640 10 persons, 1 sports ball, 42.0ms
Speed: 3.9ms preprocess, 42.0ms inference, 2.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 565/1074 [01:12<01:50,  4.59it/s]


0: 384x640 8 persons, 1 sports ball, 42.9ms
Speed: 2.4ms preprocess, 42.9ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 566/1074 [01:13<01:51,  4.54it/s]


0: 384x640 7 persons, 41.7ms
Speed: 2.3ms preprocess, 41.7ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 567/1074 [01:13<01:39,  5.08it/s]


0: 384x640 7 persons, 1 sports ball, 28.8ms
Speed: 2.0ms preprocess, 28.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 568/1074 [01:13<01:36,  5.23it/s]


0: 384x640 8 persons, 1 sports ball, 30.9ms
Speed: 2.1ms preprocess, 30.9ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 569/1074 [01:13<01:39,  5.10it/s]


0: 384x640 7 persons, 1 sports ball, 30.1ms
Speed: 7.1ms preprocess, 30.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 570/1074 [01:13<01:28,  5.67it/s]


0: 384x640 7 persons, 1 sports ball, 25.4ms
Speed: 2.1ms preprocess, 25.4ms inference, 9.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 571/1074 [01:13<01:21,  6.16it/s]


0: 384x640 7 persons, 1 sports ball, 27.1ms
Speed: 5.2ms preprocess, 27.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 572/1074 [01:14<01:18,  6.39it/s]


0: 384x640 7 persons, 1 sports ball, 43.6ms
Speed: 2.0ms preprocess, 43.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 573/1074 [01:14<01:15,  6.62it/s]


0: 384x640 7 persons, 1 sports ball, 32.3ms
Speed: 2.6ms preprocess, 32.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  53%|█████▎    | 574/1074 [01:14<01:17,  6.44it/s]


0: 384x640 6 persons, 1 sports ball, 25.4ms
Speed: 10.3ms preprocess, 25.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▎    | 575/1074 [01:14<01:10,  7.12it/s]


0: 384x640 6 persons, 1 sports ball, 25.9ms
Speed: 9.1ms preprocess, 25.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▎    | 576/1074 [01:14<01:06,  7.53it/s]


0: 384x640 7 persons, 30.1ms
Speed: 2.2ms preprocess, 30.1ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▎    | 577/1074 [01:14<01:06,  7.43it/s]


0: 384x640 7 persons, 1 sports ball, 27.0ms
Speed: 3.5ms preprocess, 27.0ms inference, 3.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▍    | 578/1074 [01:14<01:16,  6.49it/s]


0: 384x640 7 persons, 1 sports ball, 28.6ms
Speed: 2.1ms preprocess, 28.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▍    | 579/1074 [01:15<01:12,  6.80it/s]


0: 384x640 6 persons, 1 sports ball, 28.6ms
Speed: 2.0ms preprocess, 28.6ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▍    | 580/1074 [01:15<01:09,  7.09it/s]


0: 384x640 6 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 5.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▍    | 581/1074 [01:15<01:13,  6.72it/s]


0: 384x640 7 persons, 30.9ms
Speed: 5.5ms preprocess, 30.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▍    | 582/1074 [01:15<01:11,  6.89it/s]


0: 384x640 6 persons, 1 sports ball, 28.9ms
Speed: 3.6ms preprocess, 28.9ms inference, 6.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▍    | 583/1074 [01:15<01:09,  7.04it/s]


0: 384x640 6 persons, 1 sports ball, 33.0ms
Speed: 2.4ms preprocess, 33.0ms inference, 4.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▍    | 584/1074 [01:15<01:17,  6.34it/s]


0: 384x640 8 persons, 1 sports ball, 31.6ms
Speed: 2.5ms preprocess, 31.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  54%|█████▍    | 585/1074 [01:15<01:10,  6.95it/s]


0: 384x640 7 persons, 1 sports ball, 26.6ms
Speed: 3.1ms preprocess, 26.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▍    | 586/1074 [01:16<01:06,  7.36it/s]


0: 384x640 7 persons, 1 sports ball, 24.2ms
Speed: 2.9ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▍    | 587/1074 [01:16<01:10,  6.87it/s]


0: 384x640 7 persons, 1 sports ball, 35.8ms
Speed: 2.1ms preprocess, 35.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▍    | 588/1074 [01:16<01:06,  7.27it/s]


0: 384x640 7 persons, 2 sports balls, 24.2ms
Speed: 2.5ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▍    | 589/1074 [01:16<01:19,  6.10it/s]


0: 384x640 7 persons, 1 sports ball, 26.2ms
Speed: 2.5ms preprocess, 26.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▍    | 590/1074 [01:16<01:12,  6.71it/s]


0: 384x640 7 persons, 1 sports ball, 47.9ms
Speed: 7.8ms preprocess, 47.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▌    | 591/1074 [01:16<01:23,  5.77it/s]


0: 384x640 6 persons, 1 sports ball, 24.2ms
Speed: 9.3ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▌    | 592/1074 [01:17<01:16,  6.29it/s]


0: 384x640 8 persons, 1 sports ball, 24.7ms
Speed: 10.1ms preprocess, 24.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▌    | 593/1074 [01:17<01:20,  6.01it/s]


0: 384x640 8 persons, 1 sports ball, 24.4ms
Speed: 6.8ms preprocess, 24.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▌    | 594/1074 [01:17<01:14,  6.46it/s]


0: 384x640 7 persons, 1 sports ball, 30.3ms
Speed: 3.1ms preprocess, 30.3ms inference, 8.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▌    | 595/1074 [01:17<01:16,  6.26it/s]


0: 384x640 8 persons, 1 sports ball, 26.8ms
Speed: 2.8ms preprocess, 26.8ms inference, 9.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  55%|█████▌    | 596/1074 [01:17<01:09,  6.84it/s]


0: 384x640 7 persons, 1 sports ball, 35.2ms
Speed: 2.5ms preprocess, 35.2ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▌    | 597/1074 [01:17<01:08,  6.95it/s]


0: 384x640 7 persons, 1 sports ball, 35.8ms
Speed: 4.2ms preprocess, 35.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▌    | 598/1074 [01:17<01:19,  5.95it/s]


0: 384x640 7 persons, 2 sports balls, 24.7ms
Speed: 2.3ms preprocess, 24.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▌    | 599/1074 [01:18<01:12,  6.55it/s]


0: 384x640 7 persons, 1 sports ball, 27.9ms
Speed: 2.1ms preprocess, 27.9ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▌    | 600/1074 [01:18<01:10,  6.74it/s]


0: 384x640 7 persons, 1 sports ball, 27.6ms
Speed: 2.6ms preprocess, 27.6ms inference, 6.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▌    | 601/1074 [01:18<01:06,  7.06it/s]


0: 384x640 7 persons, 1 sports ball, 24.3ms
Speed: 2.1ms preprocess, 24.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▌    | 602/1074 [01:18<01:03,  7.38it/s]


0: 384x640 9 persons, 1 sports ball, 27.2ms
Speed: 3.8ms preprocess, 27.2ms inference, 2.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▌    | 603/1074 [01:18<01:03,  7.45it/s]


0: 384x640 9 persons, 1 sports ball, 32.2ms
Speed: 4.3ms preprocess, 32.2ms inference, 4.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▌    | 604/1074 [01:18<01:12,  6.50it/s]


0: 384x640 9 persons, 1 sports ball, 35.5ms
Speed: 4.2ms preprocess, 35.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▋    | 605/1074 [01:18<01:10,  6.65it/s]


0: 384x640 8 persons, 1 sports ball, 26.0ms
Speed: 4.0ms preprocess, 26.0ms inference, 5.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  56%|█████▋    | 606/1074 [01:19<01:12,  6.48it/s]


0: 384x640 8 persons, 1 sports ball, 39.9ms
Speed: 2.0ms preprocess, 39.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 607/1074 [01:19<01:10,  6.59it/s]


0: 384x640 9 persons, 1 sports ball, 36.4ms
Speed: 2.2ms preprocess, 36.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 608/1074 [01:19<01:19,  5.86it/s]


0: 384x640 8 persons, 1 sports ball, 29.3ms
Speed: 4.6ms preprocess, 29.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 609/1074 [01:19<01:14,  6.28it/s]


0: 384x640 8 persons, 1 sports ball, 49.3ms
Speed: 5.4ms preprocess, 49.3ms inference, 5.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 610/1074 [01:19<01:13,  6.34it/s]


0: 384x640 8 persons, 42.8ms
Speed: 4.8ms preprocess, 42.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 611/1074 [01:19<01:19,  5.82it/s]


0: 384x640 9 persons, 2 sports balls, 24.3ms
Speed: 2.2ms preprocess, 24.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 612/1074 [01:20<01:11,  6.42it/s]


0: 384x640 8 persons, 1 sports ball, 24.9ms
Speed: 2.0ms preprocess, 24.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 613/1074 [01:20<01:08,  6.77it/s]


0: 384x640 8 persons, 1 sports ball, 42.2ms
Speed: 3.4ms preprocess, 42.2ms inference, 4.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 614/1074 [01:20<01:16,  6.00it/s]


0: 384x640 8 persons, 1 sports ball, 27.6ms
Speed: 2.7ms preprocess, 27.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 615/1074 [01:20<01:10,  6.53it/s]


0: 384x640 8 persons, 1 sports ball, 25.3ms
Speed: 2.0ms preprocess, 25.3ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 616/1074 [01:20<01:16,  5.98it/s]


0: 384x640 8 persons, 1 sports ball, 37.3ms
Speed: 3.8ms preprocess, 37.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  57%|█████▋    | 617/1074 [01:20<01:22,  5.53it/s]


0: 384x640 8 persons, 1 sports ball, 28.1ms
Speed: 3.2ms preprocess, 28.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 618/1074 [01:21<01:19,  5.76it/s]


0: 384x640 8 persons, 1 sports ball, 39.7ms
Speed: 2.2ms preprocess, 39.7ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 619/1074 [01:21<01:13,  6.17it/s]


0: 384x640 8 persons, 1 sports ball, 37.5ms
Speed: 2.3ms preprocess, 37.5ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 620/1074 [01:21<01:15,  6.02it/s]


0: 384x640 8 persons, 1 sports ball, 27.7ms
Speed: 2.3ms preprocess, 27.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 621/1074 [01:21<01:10,  6.46it/s]


0: 384x640 8 persons, 1 sports ball, 29.3ms
Speed: 2.0ms preprocess, 29.3ms inference, 6.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 622/1074 [01:21<01:13,  6.15it/s]


0: 384x640 8 persons, 1 sports ball, 47.3ms
Speed: 2.0ms preprocess, 47.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 623/1074 [01:21<01:17,  5.81it/s]


0: 384x640 8 persons, 1 sports ball, 30.6ms
Speed: 2.1ms preprocess, 30.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 624/1074 [01:22<01:11,  6.33it/s]


0: 384x640 8 persons, 1 sports ball, 36.4ms
Speed: 2.2ms preprocess, 36.4ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 625/1074 [01:22<01:17,  5.77it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 10.7ms preprocess, 24.2ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 626/1074 [01:22<01:21,  5.47it/s]


0: 384x640 8 persons, 1 sports ball, 31.6ms
Speed: 2.2ms preprocess, 31.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 627/1074 [01:22<01:13,  6.06it/s]


0: 384x640 8 persons, 1 sports ball, 24.4ms
Speed: 3.2ms preprocess, 24.4ms inference, 3.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  58%|█████▊    | 628/1074 [01:22<01:10,  6.29it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 9.4ms preprocess, 24.2ms inference, 7.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▊    | 629/1074 [01:22<01:16,  5.83it/s]


0: 384x640 8 persons, 1 sports ball, 25.8ms
Speed: 4.7ms preprocess, 25.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▊    | 630/1074 [01:23<01:10,  6.32it/s]


0: 384x640 8 persons, 1 sports ball, 32.2ms
Speed: 2.7ms preprocess, 32.2ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▉    | 631/1074 [01:23<01:18,  5.65it/s]


0: 384x640 8 persons, 1 sports ball, 32.6ms
Speed: 8.2ms preprocess, 32.6ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▉    | 632/1074 [01:23<01:22,  5.33it/s]


0: 384x640 8 persons, 1 sports ball, 29.7ms
Speed: 2.1ms preprocess, 29.7ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▉    | 633/1074 [01:23<01:16,  5.77it/s]


0: 384x640 8 persons, 1 sports ball, 43.1ms
Speed: 2.2ms preprocess, 43.1ms inference, 4.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▉    | 634/1074 [01:23<01:16,  5.76it/s]


0: 384x640 8 persons, 1 sports ball, 45.1ms
Speed: 3.8ms preprocess, 45.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▉    | 635/1074 [01:24<01:18,  5.59it/s]


0: 384x640 8 persons, 1 sports ball, 47.2ms
Speed: 3.4ms preprocess, 47.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▉    | 636/1074 [01:24<01:13,  5.93it/s]


0: 384x640 8 persons, 1 sports ball, 27.0ms
Speed: 7.7ms preprocess, 27.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▉    | 637/1074 [01:24<01:14,  5.85it/s]


0: 384x640 8 persons, 1 sports ball, 45.9ms
Speed: 2.2ms preprocess, 45.9ms inference, 6.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▉    | 638/1074 [01:24<01:15,  5.74it/s]


0: 384x640 8 persons, 1 sports ball, 46.6ms
Speed: 2.2ms preprocess, 46.6ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  59%|█████▉    | 639/1074 [01:24<01:17,  5.62it/s]


0: 384x640 8 persons, 1 sports ball, 43.0ms
Speed: 4.2ms preprocess, 43.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|█████▉    | 640/1074 [01:24<01:14,  5.83it/s]


0: 384x640 8 persons, 1 sports ball, 28.1ms
Speed: 4.6ms preprocess, 28.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|█████▉    | 641/1074 [01:25<01:10,  6.16it/s]


0: 384x640 8 persons, 1 sports ball, 34.8ms
Speed: 6.8ms preprocess, 34.8ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|█████▉    | 642/1074 [01:25<01:13,  5.89it/s]


0: 384x640 8 persons, 1 sports ball, 48.3ms
Speed: 2.0ms preprocess, 48.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|█████▉    | 643/1074 [01:25<01:20,  5.34it/s]


0: 384x640 8 persons, 1 sports ball, 39.7ms
Speed: 2.7ms preprocess, 39.7ms inference, 9.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|█████▉    | 644/1074 [01:25<01:16,  5.59it/s]


0: 384x640 8 persons, 1 sports ball, 37.1ms
Speed: 2.1ms preprocess, 37.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|██████    | 645/1074 [01:25<01:20,  5.31it/s]


0: 384x640 8 persons, 1 sports ball, 34.1ms
Speed: 4.3ms preprocess, 34.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|██████    | 646/1074 [01:25<01:18,  5.44it/s]


0: 384x640 8 persons, 1 sports ball, 33.9ms
Speed: 9.6ms preprocess, 33.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|██████    | 647/1074 [01:26<01:15,  5.63it/s]


0: 384x640 8 persons, 1 sports ball, 43.0ms
Speed: 1.9ms preprocess, 43.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|██████    | 648/1074 [01:26<01:14,  5.73it/s]


0: 384x640 8 persons, 2 sports balls, 28.6ms
Speed: 6.7ms preprocess, 28.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  60%|██████    | 649/1074 [01:26<01:11,  5.93it/s]


0: 384x640 8 persons, 1 sports ball, 35.9ms
Speed: 2.2ms preprocess, 35.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████    | 650/1074 [01:26<01:10,  6.00it/s]


0: 384x640 8 persons, 1 sports ball, 36.6ms
Speed: 3.1ms preprocess, 36.6ms inference, 3.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████    | 651/1074 [01:26<01:08,  6.15it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 2.3ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████    | 652/1074 [01:26<01:12,  5.84it/s]


0: 384x640 8 persons, 2 sports balls, 45.6ms
Speed: 2.2ms preprocess, 45.6ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████    | 653/1074 [01:27<01:17,  5.44it/s]


0: 384x640 8 persons, 1 sports ball, 43.4ms
Speed: 3.1ms preprocess, 43.4ms inference, 9.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████    | 654/1074 [01:27<01:20,  5.23it/s]


0: 384x640 8 persons, 1 sports ball, 30.6ms
Speed: 2.1ms preprocess, 30.6ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████    | 655/1074 [01:27<01:13,  5.69it/s]


0: 384x640 8 persons, 1 sports ball, 29.4ms
Speed: 2.3ms preprocess, 29.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████    | 656/1074 [01:27<01:11,  5.83it/s]


0: 384x640 8 persons, 1 sports ball, 66.4ms
Speed: 5.0ms preprocess, 66.4ms inference, 3.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████    | 657/1074 [01:27<01:14,  5.60it/s]


0: 384x640 8 persons, 1 sports ball, 39.6ms
Speed: 9.7ms preprocess, 39.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████▏   | 658/1074 [01:28<01:19,  5.21it/s]


0: 384x640 9 persons, 1 sports ball, 42.3ms
Speed: 6.2ms preprocess, 42.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████▏   | 659/1074 [01:28<01:16,  5.45it/s]


0: 384x640 9 persons, 1 sports ball, 32.7ms
Speed: 2.0ms preprocess, 32.7ms inference, 9.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  61%|██████▏   | 660/1074 [01:28<01:23,  4.94it/s]


0: 384x640 9 persons, 2 sports balls, 32.5ms
Speed: 14.6ms preprocess, 32.5ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 661/1074 [01:28<01:27,  4.70it/s]


0: 384x640 10 persons, 1 sports ball, 35.5ms
Speed: 7.8ms preprocess, 35.5ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 662/1074 [01:28<01:23,  4.94it/s]


0: 384x640 10 persons, 1 sports ball, 48.0ms
Speed: 7.7ms preprocess, 48.0ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 663/1074 [01:29<01:27,  4.70it/s]


0: 384x640 10 persons, 2 sports balls, 32.8ms
Speed: 5.1ms preprocess, 32.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 664/1074 [01:29<01:29,  4.56it/s]


0: 384x640 10 persons, 1 sports ball, 37.3ms
Speed: 6.0ms preprocess, 37.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 665/1074 [01:29<01:22,  4.95it/s]


0: 384x640 10 persons, 2 sports balls, 29.9ms
Speed: 2.7ms preprocess, 29.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 666/1074 [01:29<01:20,  5.06it/s]


0: 384x640 10 persons, 2 sports balls, 41.8ms
Speed: 2.0ms preprocess, 41.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 667/1074 [01:29<01:23,  4.90it/s]


0: 384x640 10 persons, 1 sports ball, 29.5ms
Speed: 2.0ms preprocess, 29.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 668/1074 [01:30<01:24,  4.79it/s]


0: 384x640 11 persons, 1 sports ball, 27.3ms
Speed: 2.3ms preprocess, 27.3ms inference, 4.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 669/1074 [01:30<01:13,  5.51it/s]


0: 384x640 11 persons, 1 sports ball, 31.9ms
Speed: 2.3ms preprocess, 31.9ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 670/1074 [01:30<01:18,  5.14it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 6.3ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  62%|██████▏   | 671/1074 [01:30<01:10,  5.71it/s]


0: 384x640 10 persons, 36.5ms
Speed: 2.8ms preprocess, 36.5ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 672/1074 [01:30<01:11,  5.63it/s]


0: 384x640 10 persons, 30.4ms
Speed: 2.6ms preprocess, 30.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 673/1074 [01:30<01:06,  6.06it/s]


0: 384x640 11 persons, 1 sports ball, 30.6ms
Speed: 2.0ms preprocess, 30.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 674/1074 [01:31<01:07,  5.94it/s]


0: 384x640 10 persons, 1 sports ball, 31.8ms
Speed: 2.2ms preprocess, 31.8ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 675/1074 [01:31<01:02,  6.40it/s]


0: 384x640 10 persons, 1 sports ball, 27.1ms
Speed: 2.5ms preprocess, 27.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 676/1074 [01:31<01:07,  5.87it/s]


0: 384x640 9 persons, 1 sports ball, 36.0ms
Speed: 2.1ms preprocess, 36.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 677/1074 [01:31<01:10,  5.65it/s]


0: 384x640 9 persons, 1 sports ball, 35.0ms
Speed: 10.3ms preprocess, 35.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 678/1074 [01:31<01:09,  5.66it/s]


0: 384x640 9 persons, 1 sports ball, 32.2ms
Speed: 2.1ms preprocess, 32.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 679/1074 [01:32<01:13,  5.36it/s]


0: 384x640 10 persons, 1 sports ball, 26.7ms
Speed: 9.5ms preprocess, 26.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 680/1074 [01:32<01:11,  5.54it/s]


0: 384x640 9 persons, 2 sports balls, 32.4ms
Speed: 2.3ms preprocess, 32.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  63%|██████▎   | 681/1074 [01:32<01:14,  5.27it/s]


0: 384x640 9 persons, 2 sports balls, 30.3ms
Speed: 4.2ms preprocess, 30.3ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▎   | 682/1074 [01:32<01:08,  5.70it/s]


0: 384x640 10 persons, 2 sports balls, 29.5ms
Speed: 6.8ms preprocess, 29.5ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▎   | 683/1074 [01:32<01:08,  5.72it/s]


0: 384x640 9 persons, 2 sports balls, 32.3ms
Speed: 11.6ms preprocess, 32.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▎   | 684/1074 [01:33<01:17,  5.05it/s]


0: 384x640 8 persons, 1 sports ball, 36.0ms
Speed: 2.1ms preprocess, 36.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▍   | 685/1074 [01:33<01:16,  5.07it/s]


0: 384x640 9 persons, 1 sports ball, 39.8ms
Speed: 2.1ms preprocess, 39.8ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▍   | 686/1074 [01:33<01:08,  5.69it/s]


0: 384x640 9 persons, 2 sports balls, 26.0ms
Speed: 15.5ms preprocess, 26.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▍   | 687/1074 [01:33<01:12,  5.32it/s]


0: 384x640 9 persons, 24.4ms
Speed: 4.9ms preprocess, 24.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▍   | 688/1074 [01:33<01:08,  5.65it/s]


0: 384x640 9 persons, 27.7ms
Speed: 15.7ms preprocess, 27.7ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▍   | 689/1074 [01:33<01:07,  5.74it/s]


0: 384x640 9 persons, 1 sports ball, 29.7ms
Speed: 2.0ms preprocess, 29.7ms inference, 3.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▍   | 690/1074 [01:34<01:10,  5.41it/s]


0: 384x640 9 persons, 2 sports balls, 35.8ms
Speed: 2.1ms preprocess, 35.8ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▍   | 691/1074 [01:34<01:14,  5.15it/s]


0: 384x640 9 persons, 2 sports balls, 31.2ms
Speed: 2.1ms preprocess, 31.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  64%|██████▍   | 692/1074 [01:34<01:04,  5.89it/s]


0: 384x640 9 persons, 2 sports balls, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▍   | 693/1074 [01:34<01:01,  6.21it/s]


0: 384x640 10 persons, 2 sports balls, 38.3ms
Speed: 2.1ms preprocess, 38.3ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▍   | 694/1074 [01:34<01:06,  5.73it/s]


0: 384x640 9 persons, 1 sports ball, 35.6ms
Speed: 7.1ms preprocess, 35.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▍   | 695/1074 [01:34<01:02,  6.06it/s]


0: 384x640 9 persons, 2 sports balls, 29.4ms
Speed: 6.9ms preprocess, 29.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▍   | 696/1074 [01:35<00:59,  6.34it/s]


0: 384x640 10 persons, 2 sports balls, 31.8ms
Speed: 2.2ms preprocess, 31.8ms inference, 6.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▍   | 697/1074 [01:35<01:07,  5.57it/s]


0: 384x640 9 persons, 1 sports ball, 27.8ms
Speed: 2.0ms preprocess, 27.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▍   | 698/1074 [01:35<01:12,  5.18it/s]


0: 384x640 9 persons, 1 sports ball, 24.2ms
Speed: 2.4ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▌   | 699/1074 [01:35<01:12,  5.18it/s]


0: 384x640 9 persons, 1 sports ball, 37.5ms
Speed: 2.0ms preprocess, 37.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▌   | 700/1074 [01:35<01:06,  5.64it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.0ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▌   | 701/1074 [01:36<01:08,  5.43it/s]


0: 384x640 9 persons, 1 sports ball, 24.2ms
Speed: 2.0ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▌   | 702/1074 [01:36<01:10,  5.24it/s]


0: 384x640 8 persons, 1 sports ball, 29.2ms
Speed: 3.0ms preprocess, 29.2ms inference, 4.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  65%|██████▌   | 703/1074 [01:36<01:06,  5.59it/s]


0: 384x640 8 persons, 1 sports ball, 24.4ms
Speed: 2.1ms preprocess, 24.4ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▌   | 704/1074 [01:36<01:01,  6.06it/s]


0: 384x640 8 persons, 1 sports ball, 31.7ms
Speed: 5.5ms preprocess, 31.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▌   | 705/1074 [01:36<00:58,  6.27it/s]


0: 384x640 8 persons, 1 sports ball, 24.8ms
Speed: 2.1ms preprocess, 24.8ms inference, 4.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▌   | 706/1074 [01:36<00:55,  6.58it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▌   | 707/1074 [01:36<00:58,  6.30it/s]


0: 384x640 8 persons, 1 sports ball, 28.6ms
Speed: 8.4ms preprocess, 28.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▌   | 708/1074 [01:37<00:53,  6.78it/s]


0: 384x640 8 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▌   | 709/1074 [01:37<00:52,  6.90it/s]


0: 384x640 8 persons, 1 sports ball, 37.1ms
Speed: 7.7ms preprocess, 37.1ms inference, 8.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▌   | 710/1074 [01:37<01:02,  5.87it/s]


0: 384x640 8 persons, 1 sports ball, 32.2ms
Speed: 7.5ms preprocess, 32.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▌   | 711/1074 [01:37<01:06,  5.44it/s]


0: 384x640 8 persons, 1 sports ball, 38.0ms
Speed: 7.6ms preprocess, 38.0ms inference, 3.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▋   | 712/1074 [01:37<01:00,  5.94it/s]


0: 384x640 8 persons, 1 sports ball, 33.9ms
Speed: 2.2ms preprocess, 33.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▋   | 713/1074 [01:37<01:01,  5.87it/s]


0: 384x640 8 persons, 1 sports ball, 37.5ms
Speed: 2.1ms preprocess, 37.5ms inference, 8.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  66%|██████▋   | 714/1074 [01:38<00:57,  6.29it/s]


0: 384x640 7 persons, 1 sports ball, 30.2ms
Speed: 3.6ms preprocess, 30.2ms inference, 5.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 715/1074 [01:38<01:04,  5.61it/s]


0: 384x640 7 persons, 1 sports ball, 30.4ms
Speed: 5.1ms preprocess, 30.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 716/1074 [01:38<00:57,  6.19it/s]


0: 384x640 7 persons, 1 sports ball, 30.2ms
Speed: 10.0ms preprocess, 30.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 717/1074 [01:38<01:02,  5.72it/s]


0: 384x640 7 persons, 1 sports ball, 34.8ms
Speed: 2.2ms preprocess, 34.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 718/1074 [01:38<01:05,  5.41it/s]


0: 384x640 7 persons, 1 sports ball, 26.2ms
Speed: 11.3ms preprocess, 26.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 719/1074 [01:38<00:59,  5.98it/s]


0: 384x640 7 persons, 1 sports ball, 28.5ms
Speed: 5.1ms preprocess, 28.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 720/1074 [01:39<00:56,  6.24it/s]


0: 384x640 7 persons, 1 sports ball, 32.5ms
Speed: 3.8ms preprocess, 32.5ms inference, 2.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 721/1074 [01:39<01:03,  5.54it/s]


0: 384x640 7 persons, 1 sports ball, 32.2ms
Speed: 3.5ms preprocess, 32.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 722/1074 [01:39<01:02,  5.61it/s]


0: 384x640 7 persons, 1 sports ball, 45.0ms
Speed: 2.0ms preprocess, 45.0ms inference, 6.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 723/1074 [01:39<01:09,  5.03it/s]


0: 384x640 7 persons, 1 sports ball, 32.6ms
Speed: 5.2ms preprocess, 32.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  67%|██████▋   | 724/1074 [01:40<01:15,  4.64it/s]


0: 384x640 7 persons, 2 sports balls, 26.2ms
Speed: 6.6ms preprocess, 26.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 725/1074 [01:40<01:12,  4.83it/s]


0: 384x640 7 persons, 1 sports ball, 48.5ms
Speed: 8.6ms preprocess, 48.5ms inference, 2.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 726/1074 [01:40<01:16,  4.57it/s]


0: 384x640 8 persons, 39.6ms
Speed: 2.2ms preprocess, 39.6ms inference, 6.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 727/1074 [01:40<01:14,  4.65it/s]


0: 384x640 7 persons, 1 sports ball, 47.7ms
Speed: 2.0ms preprocess, 47.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 728/1074 [01:40<01:06,  5.17it/s]


0: 384x640 9 persons, 1 sports ball, 31.5ms
Speed: 2.2ms preprocess, 31.5ms inference, 4.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 729/1074 [01:40<01:02,  5.52it/s]


0: 384x640 8 persons, 1 sports ball, 34.7ms
Speed: 7.0ms preprocess, 34.7ms inference, 6.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 730/1074 [01:41<01:01,  5.56it/s]


0: 384x640 8 persons, 1 sports ball, 34.0ms
Speed: 2.4ms preprocess, 34.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 731/1074 [01:41<01:06,  5.14it/s]


0: 384x640 8 persons, 1 sports ball, 39.0ms
Speed: 7.7ms preprocess, 39.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 732/1074 [01:41<01:07,  5.07it/s]


0: 384x640 8 persons, 1 sports ball, 39.0ms
Speed: 4.3ms preprocess, 39.0ms inference, 5.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 733/1074 [01:41<01:08,  4.99it/s]


0: 384x640 8 persons, 1 sports ball, 36.1ms
Speed: 8.4ms preprocess, 36.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 734/1074 [01:41<01:03,  5.32it/s]


0: 384x640 8 persons, 1 sports ball, 40.8ms
Speed: 3.5ms preprocess, 40.8ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  68%|██████▊   | 735/1074 [01:42<01:06,  5.06it/s]


0: 384x640 8 persons, 1 sports ball, 44.4ms
Speed: 2.1ms preprocess, 44.4ms inference, 7.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▊   | 736/1074 [01:42<01:10,  4.76it/s]


0: 384x640 8 persons, 1 sports ball, 48.5ms
Speed: 2.2ms preprocess, 48.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▊   | 737/1074 [01:42<01:14,  4.50it/s]


0: 384x640 8 persons, 1 sports ball, 38.6ms
Speed: 4.5ms preprocess, 38.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▊   | 738/1074 [01:42<01:16,  4.40it/s]


0: 384x640 9 persons, 1 sports ball, 53.4ms
Speed: 2.3ms preprocess, 53.4ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▉   | 739/1074 [01:43<01:18,  4.29it/s]


0: 384x640 9 persons, 1 sports ball, 38.6ms
Speed: 2.3ms preprocess, 38.6ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▉   | 740/1074 [01:43<01:15,  4.42it/s]


0: 384x640 9 persons, 1 sports ball, 34.2ms
Speed: 6.5ms preprocess, 34.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▉   | 741/1074 [01:43<01:13,  4.55it/s]


0: 384x640 8 persons, 1 sports ball, 36.0ms
Speed: 2.1ms preprocess, 36.0ms inference, 3.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▉   | 742/1074 [01:43<01:09,  4.77it/s]


0: 384x640 10 persons, 1 sports ball, 53.4ms
Speed: 4.3ms preprocess, 53.4ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▉   | 743/1074 [01:44<01:13,  4.50it/s]


0: 384x640 9 persons, 1 sports ball, 30.1ms
Speed: 2.9ms preprocess, 30.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▉   | 744/1074 [01:44<01:17,  4.25it/s]


0: 384x640 11 persons, 1 sports ball, 49.1ms
Speed: 2.2ms preprocess, 49.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▉   | 745/1074 [01:44<01:17,  4.22it/s]


0: 384x640 10 persons, 1 sports ball, 37.8ms
Speed: 2.2ms preprocess, 37.8ms inference, 6.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  69%|██████▉   | 746/1074 [01:44<01:13,  4.47it/s]


0: 384x640 10 persons, 1 sports ball, 29.1ms
Speed: 2.1ms preprocess, 29.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|██████▉   | 747/1074 [01:44<01:07,  4.82it/s]


0: 384x640 9 persons, 1 sports ball, 36.8ms
Speed: 6.3ms preprocess, 36.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|██████▉   | 748/1074 [01:45<01:05,  5.00it/s]


0: 384x640 10 persons, 1 sports ball, 45.7ms
Speed: 10.7ms preprocess, 45.7ms inference, 6.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|██████▉   | 749/1074 [01:45<01:07,  4.81it/s]


0: 384x640 9 persons, 1 sports ball, 34.9ms
Speed: 4.8ms preprocess, 34.9ms inference, 6.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|██████▉   | 750/1074 [01:45<01:11,  4.55it/s]


0: 384x640 9 persons, 1 sports ball, 46.6ms
Speed: 3.6ms preprocess, 46.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|██████▉   | 751/1074 [01:45<01:11,  4.55it/s]


0: 384x640 10 persons, 1 sports ball, 42.5ms
Speed: 2.2ms preprocess, 42.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|███████   | 752/1074 [01:45<01:10,  4.54it/s]


0: 384x640 9 persons, 1 sports ball, 28.1ms
Speed: 2.5ms preprocess, 28.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|███████   | 753/1074 [01:46<01:09,  4.61it/s]


0: 384x640 9 persons, 1 sports ball, 26.0ms
Speed: 2.3ms preprocess, 26.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|███████   | 754/1074 [01:46<01:01,  5.23it/s]


0: 384x640 11 persons, 2 sports balls, 31.8ms
Speed: 2.6ms preprocess, 31.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|███████   | 755/1074 [01:46<01:03,  4.98it/s]


0: 384x640 11 persons, 25.8ms
Speed: 2.0ms preprocess, 25.8ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|███████   | 756/1074 [01:46<01:02,  5.10it/s]


0: 384x640 10 persons, 1 sports ball, 36.1ms
Speed: 6.9ms preprocess, 36.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  70%|███████   | 757/1074 [01:46<01:04,  4.90it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████   | 758/1074 [01:47<01:04,  4.89it/s]


0: 384x640 12 persons, 1 sports ball, 30.3ms
Speed: 2.1ms preprocess, 30.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████   | 759/1074 [01:47<00:56,  5.59it/s]


0: 384x640 10 persons, 1 sports ball, 35.7ms
Speed: 2.1ms preprocess, 35.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████   | 760/1074 [01:47<00:55,  5.67it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.5ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████   | 761/1074 [01:47<00:50,  6.15it/s]


0: 384x640 11 persons, 1 sports ball, 30.9ms
Speed: 2.3ms preprocess, 30.9ms inference, 2.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████   | 762/1074 [01:47<00:51,  6.10it/s]


0: 384x640 11 persons, 51.2ms
Speed: 2.3ms preprocess, 51.2ms inference, 7.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████   | 763/1074 [01:47<00:50,  6.20it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.6ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████   | 764/1074 [01:48<00:52,  5.89it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 12.2ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████   | 765/1074 [01:48<00:50,  6.14it/s]


0: 384x640 10 persons, 1 sports ball, 35.3ms
Speed: 2.3ms preprocess, 35.3ms inference, 6.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████▏  | 766/1074 [01:48<00:55,  5.50it/s]


0: 384x640 10 persons, 31.0ms
Speed: 4.4ms preprocess, 31.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  71%|███████▏  | 767/1074 [01:48<00:50,  6.06it/s]


0: 384x640 10 persons, 27.5ms
Speed: 2.2ms preprocess, 27.5ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 768/1074 [01:48<00:53,  5.75it/s]


0: 384x640 14 persons, 1 sports ball, 32.4ms
Speed: 2.4ms preprocess, 32.4ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 769/1074 [01:48<00:50,  6.05it/s]


0: 384x640 16 persons, 29.1ms
Speed: 2.7ms preprocess, 29.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 770/1074 [01:49<00:48,  6.27it/s]


0: 384x640 15 persons, 1 sports ball, 28.7ms
Speed: 3.3ms preprocess, 28.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 771/1074 [01:49<00:54,  5.55it/s]


0: 384x640 15 persons, 26.1ms
Speed: 2.1ms preprocess, 26.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 772/1074 [01:49<00:58,  5.17it/s]


0: 384x640 15 persons, 24.4ms
Speed: 6.6ms preprocess, 24.4ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 773/1074 [01:49<00:57,  5.27it/s]


0: 384x640 15 persons, 34.0ms
Speed: 9.4ms preprocess, 34.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 774/1074 [01:49<00:53,  5.57it/s]


0: 384x640 15 persons, 30.2ms
Speed: 3.7ms preprocess, 30.2ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 775/1074 [01:50<00:53,  5.59it/s]


0: 384x640 15 persons, 35.3ms
Speed: 2.2ms preprocess, 35.3ms inference, 3.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 776/1074 [01:50<00:49,  5.99it/s]


0: 384x640 17 persons, 1 sports ball, 33.0ms
Speed: 2.2ms preprocess, 33.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 777/1074 [01:50<00:57,  5.18it/s]


0: 384x640 15 persons, 1 sports ball, 24.2ms
Speed: 2.0ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  72%|███████▏  | 778/1074 [01:50<01:02,  4.77it/s]


0: 384x640 14 persons, 1 sports ball, 33.8ms
Speed: 3.1ms preprocess, 33.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 779/1074 [01:50<00:55,  5.30it/s]


0: 384x640 14 persons, 1 sports ball, 35.6ms
Speed: 3.3ms preprocess, 35.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 780/1074 [01:50<00:52,  5.55it/s]


0: 384x640 14 persons, 28.2ms
Speed: 2.2ms preprocess, 28.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 781/1074 [01:51<00:53,  5.47it/s]


0: 384x640 14 persons, 1 sports ball, 30.5ms
Speed: 2.0ms preprocess, 30.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 782/1074 [01:51<00:50,  5.80it/s]


0: 384x640 13 persons, 1 sports ball, 24.2ms
Speed: 2.7ms preprocess, 24.2ms inference, 8.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 783/1074 [01:51<00:48,  5.95it/s]


0: 384x640 13 persons, 1 sports ball, 24.8ms
Speed: 11.1ms preprocess, 24.8ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 784/1074 [01:51<00:46,  6.26it/s]


0: 384x640 13 persons, 1 sports ball, 37.3ms
Speed: 2.0ms preprocess, 37.3ms inference, 3.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 785/1074 [01:51<00:58,  4.95it/s]


0: 384x640 13 persons, 28.0ms
Speed: 2.1ms preprocess, 28.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 786/1074 [01:52<00:52,  5.46it/s]


0: 384x640 13 persons, 37.6ms
Speed: 2.2ms preprocess, 37.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 787/1074 [01:52<00:56,  5.06it/s]


0: 384x640 13 persons, 29.8ms
Speed: 3.9ms preprocess, 29.8ms inference, 5.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 788/1074 [01:52<00:51,  5.57it/s]


0: 384x640 13 persons, 35.1ms
Speed: 2.5ms preprocess, 35.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  73%|███████▎  | 789/1074 [01:52<00:53,  5.31it/s]


0: 384x640 13 persons, 1 sports ball, 35.8ms
Speed: 6.7ms preprocess, 35.8ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▎  | 790/1074 [01:52<00:49,  5.75it/s]


0: 384x640 13 persons, 1 sports ball, 25.1ms
Speed: 3.2ms preprocess, 25.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▎  | 791/1074 [01:52<00:52,  5.40it/s]


0: 384x640 13 persons, 1 sports ball, 47.9ms
Speed: 4.7ms preprocess, 47.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▎  | 792/1074 [01:53<00:49,  5.67it/s]


0: 384x640 13 persons, 1 sports ball, 35.9ms
Speed: 6.2ms preprocess, 35.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▍  | 793/1074 [01:53<00:55,  5.05it/s]


0: 384x640 13 persons, 1 sports ball, 25.2ms
Speed: 3.5ms preprocess, 25.2ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▍  | 794/1074 [01:53<00:56,  4.93it/s]


0: 384x640 13 persons, 1 sports ball, 27.7ms
Speed: 2.1ms preprocess, 27.7ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▍  | 795/1074 [01:53<00:55,  5.06it/s]


0: 384x640 13 persons, 1 sports ball, 36.7ms
Speed: 3.1ms preprocess, 36.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▍  | 796/1074 [01:54<00:57,  4.80it/s]


0: 384x640 13 persons, 1 sports ball, 30.3ms
Speed: 2.1ms preprocess, 30.3ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▍  | 797/1074 [01:54<00:51,  5.33it/s]


0: 384x640 13 persons, 1 sports ball, 40.9ms
Speed: 2.2ms preprocess, 40.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▍  | 798/1074 [01:54<00:48,  5.71it/s]


0: 384x640 13 persons, 1 sports ball, 34.7ms
Speed: 2.1ms preprocess, 34.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▍  | 799/1074 [01:54<00:46,  5.93it/s]


0: 384x640 13 persons, 1 sports ball, 29.9ms
Speed: 3.8ms preprocess, 29.9ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  74%|███████▍  | 800/1074 [01:54<00:50,  5.44it/s]


0: 384x640 13 persons, 1 sports ball, 37.3ms
Speed: 2.1ms preprocess, 37.3ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▍  | 801/1074 [01:54<00:55,  4.93it/s]


0: 384x640 12 persons, 1 sports ball, 24.2ms
Speed: 3.8ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▍  | 802/1074 [01:55<00:49,  5.48it/s]


0: 384x640 13 persons, 1 sports ball, 38.2ms
Speed: 2.0ms preprocess, 38.2ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▍  | 803/1074 [01:55<00:54,  4.94it/s]


0: 384x640 14 persons, 25.8ms
Speed: 3.9ms preprocess, 25.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▍  | 804/1074 [01:55<00:57,  4.70it/s]


0: 384x640 13 persons, 1 sports ball, 24.2ms
Speed: 2.3ms preprocess, 24.2ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▍  | 805/1074 [01:55<00:56,  4.79it/s]


0: 384x640 13 persons, 1 sports ball, 49.9ms
Speed: 2.3ms preprocess, 49.9ms inference, 6.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▌  | 806/1074 [01:56<01:05,  4.08it/s]


0: 384x640 13 persons, 32.1ms
Speed: 2.1ms preprocess, 32.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▌  | 807/1074 [01:56<01:06,  4.04it/s]


0: 384x640 13 persons, 28.5ms
Speed: 2.8ms preprocess, 28.5ms inference, 8.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▌  | 808/1074 [01:56<01:07,  3.95it/s]


0: 384x640 13 persons, 27.8ms
Speed: 2.1ms preprocess, 27.8ms inference, 4.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▌  | 809/1074 [01:56<01:00,  4.41it/s]


0: 384x640 14 persons, 1 sports ball, 36.9ms
Speed: 2.1ms preprocess, 36.9ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  75%|███████▌  | 810/1074 [01:56<00:54,  4.81it/s]


0: 384x640 14 persons, 1 sports ball, 54.7ms
Speed: 2.1ms preprocess, 54.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▌  | 811/1074 [01:57<00:57,  4.61it/s]


0: 384x640 14 persons, 1 sports ball, 39.7ms
Speed: 6.2ms preprocess, 39.7ms inference, 9.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▌  | 812/1074 [01:57<00:56,  4.68it/s]


0: 384x640 14 persons, 1 sports ball, 39.3ms
Speed: 1.9ms preprocess, 39.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▌  | 813/1074 [01:57<01:01,  4.24it/s]


0: 384x640 15 persons, 1 sports ball, 31.4ms
Speed: 2.2ms preprocess, 31.4ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▌  | 814/1074 [01:57<00:59,  4.35it/s]


0: 384x640 14 persons, 39.7ms
Speed: 6.7ms preprocess, 39.7ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▌  | 815/1074 [01:58<00:56,  4.55it/s]


0: 384x640 15 persons, 1 sports ball, 37.0ms
Speed: 3.8ms preprocess, 37.0ms inference, 6.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▌  | 816/1074 [01:58<00:55,  4.67it/s]


0: 384x640 14 persons, 1 sports ball, 46.5ms
Speed: 7.8ms preprocess, 46.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▌  | 817/1074 [01:58<00:53,  4.77it/s]


0: 384x640 14 persons, 1 sports ball, 34.5ms
Speed: 4.1ms preprocess, 34.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▌  | 818/1074 [01:58<00:53,  4.76it/s]


0: 384x640 14 persons, 1 sports ball, 52.4ms
Speed: 9.5ms preprocess, 52.4ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▋  | 819/1074 [01:58<01:01,  4.16it/s]


0: 384x640 15 persons, 1 sports ball, 26.4ms
Speed: 2.8ms preprocess, 26.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▋  | 820/1074 [01:59<01:03,  4.02it/s]


0: 384x640 14 persons, 1 sports ball, 41.0ms
Speed: 10.2ms preprocess, 41.0ms inference, 9.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  76%|███████▋  | 821/1074 [01:59<00:59,  4.25it/s]


0: 384x640 15 persons, 1 sports ball, 43.9ms
Speed: 4.3ms preprocess, 43.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 822/1074 [01:59<00:57,  4.42it/s]


0: 384x640 15 persons, 1 sports ball, 28.0ms
Speed: 4.2ms preprocess, 28.0ms inference, 10.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 823/1074 [01:59<00:55,  4.49it/s]


0: 384x640 13 persons, 1 sports ball, 47.6ms
Speed: 2.4ms preprocess, 47.6ms inference, 7.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 824/1074 [02:00<00:58,  4.25it/s]


0: 384x640 13 persons, 1 sports ball, 52.3ms
Speed: 2.6ms preprocess, 52.3ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 825/1074 [02:00<01:01,  4.02it/s]


0: 384x640 12 persons, 1 sports ball, 28.1ms
Speed: 2.1ms preprocess, 28.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 826/1074 [02:00<01:04,  3.87it/s]


0: 384x640 13 persons, 1 sports ball, 40.4ms
Speed: 2.2ms preprocess, 40.4ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 827/1074 [02:00<01:04,  3.83it/s]


0: 384x640 14 persons, 1 sports ball, 35.6ms
Speed: 2.1ms preprocess, 35.6ms inference, 5.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 828/1074 [02:01<01:03,  3.90it/s]


0: 384x640 12 persons, 1 sports ball, 35.6ms
Speed: 2.8ms preprocess, 35.6ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 829/1074 [02:01<01:02,  3.89it/s]


0: 384x640 13 persons, 1 sports ball, 43.4ms
Speed: 5.1ms preprocess, 43.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 830/1074 [02:01<00:58,  4.20it/s]


0: 384x640 12 persons, 1 sports ball, 34.4ms
Speed: 4.8ms preprocess, 34.4ms inference, 4.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 831/1074 [02:01<00:50,  4.77it/s]


0: 384x640 13 persons, 1 sports ball, 24.8ms
Speed: 2.6ms preprocess, 24.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  77%|███████▋  | 832/1074 [02:02<00:49,  4.85it/s]


0: 384x640 12 persons, 1 sports ball, 24.3ms
Speed: 9.3ms preprocess, 24.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 833/1074 [02:02<00:54,  4.45it/s]


0: 384x640 11 persons, 1 sports ball, 28.6ms
Speed: 11.3ms preprocess, 28.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 834/1074 [02:02<00:48,  4.97it/s]


0: 384x640 11 persons, 25.4ms
Speed: 2.6ms preprocess, 25.4ms inference, 4.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 835/1074 [02:02<00:45,  5.23it/s]


0: 384x640 12 persons, 1 sports ball, 54.2ms
Speed: 4.8ms preprocess, 54.2ms inference, 5.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 836/1074 [02:02<00:50,  4.70it/s]


0: 384x640 11 persons, 1 sports ball, 24.2ms
Speed: 10.3ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 837/1074 [02:03<00:46,  5.10it/s]


0: 384x640 11 persons, 25.2ms
Speed: 9.7ms preprocess, 25.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 838/1074 [02:03<00:42,  5.50it/s]


0: 384x640 13 persons, 28.8ms
Speed: 2.1ms preprocess, 28.8ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 839/1074 [02:03<00:39,  5.95it/s]


0: 384x640 11 persons, 35.0ms
Speed: 2.2ms preprocess, 35.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 840/1074 [02:03<00:39,  5.86it/s]


0: 384x640 11 persons, 24.4ms
Speed: 7.1ms preprocess, 24.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 841/1074 [02:03<00:43,  5.41it/s]


0: 384x640 11 persons, 1 sports ball, 36.4ms
Speed: 2.3ms preprocess, 36.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 842/1074 [02:03<00:44,  5.24it/s]


0: 384x640 11 persons, 1 sports ball, 27.4ms
Speed: 2.0ms preprocess, 27.4ms inference, 4.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  78%|███████▊  | 843/1074 [02:04<00:40,  5.65it/s]


0: 384x640 11 persons, 1 sports ball, 24.3ms
Speed: 7.1ms preprocess, 24.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▊  | 844/1074 [02:04<00:47,  4.89it/s]


0: 384x640 11 persons, 1 sports ball, 29.6ms
Speed: 2.4ms preprocess, 29.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▊  | 845/1074 [02:04<00:42,  5.38it/s]


0: 384x640 11 persons, 1 sports ball, 31.7ms
Speed: 2.3ms preprocess, 31.7ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▉  | 846/1074 [02:04<00:43,  5.21it/s]


0: 384x640 11 persons, 1 sports ball, 44.6ms
Speed: 2.1ms preprocess, 44.6ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▉  | 847/1074 [02:04<00:41,  5.43it/s]


0: 384x640 11 persons, 1 sports ball, 28.3ms
Speed: 1.9ms preprocess, 28.3ms inference, 6.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▉  | 848/1074 [02:05<00:47,  4.79it/s]


0: 384x640 11 persons, 1 sports ball, 25.6ms
Speed: 2.8ms preprocess, 25.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▉  | 849/1074 [02:05<00:42,  5.35it/s]


0: 384x640 12 persons, 1 sports ball, 28.8ms
Speed: 2.0ms preprocess, 28.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▉  | 850/1074 [02:05<00:38,  5.78it/s]


0: 384x640 11 persons, 1 sports ball, 24.1ms
Speed: 2.0ms preprocess, 24.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▉  | 851/1074 [02:05<00:43,  5.09it/s]


0: 384x640 13 persons, 24.2ms
Speed: 2.9ms preprocess, 24.2ms inference, 9.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▉  | 852/1074 [02:05<00:42,  5.27it/s]


0: 384x640 13 persons, 27.7ms
Speed: 2.8ms preprocess, 27.7ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  79%|███████▉  | 853/1074 [02:06<00:45,  4.86it/s]


0: 384x640 11 persons, 24.2ms
Speed: 2.4ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|███████▉  | 854/1074 [02:06<00:44,  5.00it/s]


0: 384x640 10 persons, 24.4ms
Speed: 2.0ms preprocess, 24.4ms inference, 5.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|███████▉  | 855/1074 [02:06<00:40,  5.39it/s]


0: 384x640 11 persons, 29.4ms
Speed: 8.8ms preprocess, 29.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|███████▉  | 856/1074 [02:06<00:40,  5.33it/s]


0: 384x640 10 persons, 30.9ms
Speed: 9.1ms preprocess, 30.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|███████▉  | 857/1074 [02:06<00:40,  5.29it/s]


0: 384x640 10 persons, 48.0ms
Speed: 2.0ms preprocess, 48.0ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|███████▉  | 858/1074 [02:07<00:45,  4.77it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.0ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|███████▉  | 859/1074 [02:07<00:39,  5.43it/s]


0: 384x640 10 persons, 1 sports ball, 34.1ms
Speed: 2.2ms preprocess, 34.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|████████  | 860/1074 [02:07<00:41,  5.22it/s]


0: 384x640 12 persons, 1 sports ball, 28.9ms
Speed: 6.1ms preprocess, 28.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|████████  | 861/1074 [02:07<00:37,  5.66it/s]


0: 384x640 10 persons, 1 sports ball, 37.9ms
Speed: 8.3ms preprocess, 37.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|████████  | 862/1074 [02:07<00:35,  5.92it/s]


0: 384x640 10 persons, 1 sports ball, 41.8ms
Speed: 2.7ms preprocess, 41.8ms inference, 4.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|████████  | 863/1074 [02:07<00:39,  5.40it/s]


0: 384x640 11 persons, 1 sports ball, 32.7ms
Speed: 2.7ms preprocess, 32.7ms inference, 3.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  80%|████████  | 864/1074 [02:07<00:35,  5.87it/s]


0: 384x640 11 persons, 1 sports ball, 32.1ms
Speed: 2.1ms preprocess, 32.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████  | 865/1074 [02:08<00:33,  6.23it/s]


0: 384x640 11 persons, 2 sports balls, 24.3ms
Speed: 2.1ms preprocess, 24.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████  | 866/1074 [02:08<00:31,  6.57it/s]


0: 384x640 10 persons, 1 sports ball, 25.3ms
Speed: 2.0ms preprocess, 25.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████  | 867/1074 [02:08<00:30,  6.73it/s]


0: 384x640 9 persons, 1 sports ball, 26.2ms
Speed: 2.0ms preprocess, 26.2ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████  | 868/1074 [02:08<00:35,  5.82it/s]


0: 384x640 9 persons, 1 sports ball, 26.9ms
Speed: 3.7ms preprocess, 26.9ms inference, 9.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████  | 869/1074 [02:08<00:33,  6.04it/s]


0: 384x640 9 persons, 1 sports ball, 41.4ms
Speed: 2.1ms preprocess, 41.4ms inference, 4.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████  | 870/1074 [02:08<00:33,  6.10it/s]


0: 384x640 9 persons, 1 sports ball, 46.3ms
Speed: 1.9ms preprocess, 46.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████  | 871/1074 [02:09<00:36,  5.61it/s]


0: 384x640 9 persons, 1 sports ball, 27.1ms
Speed: 4.3ms preprocess, 27.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████  | 872/1074 [02:09<00:32,  6.14it/s]


0: 384x640 9 persons, 1 sports ball, 27.7ms
Speed: 8.7ms preprocess, 27.7ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████▏ | 873/1074 [02:09<00:31,  6.32it/s]


0: 384x640 9 persons, 1 sports ball, 26.0ms
Speed: 1.9ms preprocess, 26.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████▏ | 874/1074 [02:09<00:36,  5.54it/s]


0: 384x640 8 persons, 1 sports ball, 28.8ms
Speed: 10.4ms preprocess, 28.8ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  81%|████████▏ | 875/1074 [02:09<00:41,  4.85it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 876/1074 [02:10<00:45,  4.39it/s]


0: 384x640 8 persons, 1 sports ball, 25.5ms
Speed: 2.4ms preprocess, 25.5ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 877/1074 [02:10<00:45,  4.29it/s]


0: 384x640 11 persons, 1 sports ball, 31.2ms
Speed: 2.4ms preprocess, 31.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 878/1074 [02:10<00:43,  4.48it/s]


0: 384x640 9 persons, 1 sports ball, 28.8ms
Speed: 2.3ms preprocess, 28.8ms inference, 4.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 879/1074 [02:10<00:41,  4.71it/s]


0: 384x640 9 persons, 1 sports ball, 37.5ms
Speed: 2.7ms preprocess, 37.5ms inference, 4.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 880/1074 [02:10<00:37,  5.18it/s]


0: 384x640 10 persons, 1 sports ball, 26.6ms
Speed: 3.7ms preprocess, 26.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 881/1074 [02:11<00:36,  5.22it/s]


0: 384x640 10 persons, 1 sports ball, 40.4ms
Speed: 6.4ms preprocess, 40.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 882/1074 [02:11<00:41,  4.67it/s]


0: 384x640 9 persons, 1 sports ball, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 883/1074 [02:11<00:36,  5.21it/s]


0: 384x640 9 persons, 1 sports ball, 28.9ms
Speed: 2.2ms preprocess, 28.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 884/1074 [02:11<00:36,  5.26it/s]


0: 384x640 9 persons, 1 sports ball, 33.2ms
Speed: 8.7ms preprocess, 33.2ms inference, 6.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 885/1074 [02:12<00:38,  4.90it/s]


0: 384x640 10 persons, 1 sports ball, 35.9ms
Speed: 3.2ms preprocess, 35.9ms inference, 3.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  82%|████████▏ | 886/1074 [02:12<00:37,  5.07it/s]


0: 384x640 9 persons, 1 sports ball, 31.4ms
Speed: 2.3ms preprocess, 31.4ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 887/1074 [02:12<00:40,  4.62it/s]


0: 384x640 9 persons, 1 sports ball, 25.6ms
Speed: 9.5ms preprocess, 25.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 888/1074 [02:12<00:41,  4.46it/s]


0: 384x640 9 persons, 1 sports ball, 33.2ms
Speed: 2.3ms preprocess, 33.2ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 889/1074 [02:12<00:39,  4.68it/s]


0: 384x640 9 persons, 46.9ms
Speed: 2.1ms preprocess, 46.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 890/1074 [02:13<00:37,  4.92it/s]


0: 384x640 9 persons, 30.8ms
Speed: 2.6ms preprocess, 30.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 891/1074 [02:13<00:37,  4.88it/s]


0: 384x640 9 persons, 45.0ms
Speed: 11.7ms preprocess, 45.0ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 892/1074 [02:13<00:36,  5.00it/s]


0: 384x640 8 persons, 41.3ms
Speed: 13.1ms preprocess, 41.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 893/1074 [02:13<00:39,  4.62it/s]


0: 384x640 8 persons, 33.6ms
Speed: 7.0ms preprocess, 33.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 894/1074 [02:13<00:40,  4.48it/s]


0: 384x640 9 persons, 1 sports ball, 38.3ms
Speed: 4.0ms preprocess, 38.3ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 895/1074 [02:14<00:39,  4.49it/s]


0: 384x640 8 persons, 47.8ms
Speed: 8.7ms preprocess, 47.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  83%|████████▎ | 896/1074 [02:14<00:37,  4.69it/s]


0: 384x640 8 persons, 33.3ms
Speed: 9.0ms preprocess, 33.3ms inference, 5.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▎ | 897/1074 [02:14<00:36,  4.88it/s]


0: 384x640 9 persons, 33.5ms
Speed: 9.2ms preprocess, 33.5ms inference, 7.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▎ | 898/1074 [02:14<00:38,  4.52it/s]


0: 384x640 11 persons, 26.5ms
Speed: 2.0ms preprocess, 26.5ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▎ | 899/1074 [02:14<00:35,  4.92it/s]


0: 384x640 8 persons, 35.6ms
Speed: 9.2ms preprocess, 35.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▍ | 900/1074 [02:15<00:34,  5.08it/s]


0: 384x640 8 persons, 27.5ms
Speed: 6.1ms preprocess, 27.5ms inference, 3.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▍ | 901/1074 [02:15<00:32,  5.39it/s]


0: 384x640 8 persons, 30.6ms
Speed: 3.6ms preprocess, 30.6ms inference, 5.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▍ | 902/1074 [02:15<00:31,  5.45it/s]


0: 384x640 9 persons, 39.1ms
Speed: 3.1ms preprocess, 39.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▍ | 903/1074 [02:15<00:31,  5.41it/s]


0: 384x640 9 persons, 52.6ms
Speed: 7.7ms preprocess, 52.6ms inference, 3.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▍ | 904/1074 [02:15<00:35,  4.84it/s]


0: 384x640 9 persons, 35.3ms
Speed: 2.1ms preprocess, 35.3ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▍ | 905/1074 [02:16<00:39,  4.24it/s]


0: 384x640 9 persons, 27.8ms
Speed: 2.1ms preprocess, 27.8ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▍ | 906/1074 [02:16<00:34,  4.81it/s]


0: 384x640 9 persons, 33.8ms
Speed: 3.6ms preprocess, 33.8ms inference, 5.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  84%|████████▍ | 907/1074 [02:16<00:35,  4.68it/s]


0: 384x640 9 persons, 40.7ms
Speed: 9.1ms preprocess, 40.7ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▍ | 908/1074 [02:16<00:33,  4.93it/s]


0: 384x640 9 persons, 30.6ms
Speed: 2.2ms preprocess, 30.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▍ | 909/1074 [02:16<00:32,  5.13it/s]


0: 384x640 10 persons, 32.1ms
Speed: 9.2ms preprocess, 32.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▍ | 910/1074 [02:17<00:36,  4.47it/s]


0: 384x640 9 persons, 32.9ms
Speed: 3.6ms preprocess, 32.9ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▍ | 911/1074 [02:17<00:36,  4.53it/s]


0: 384x640 9 persons, 27.4ms
Speed: 9.3ms preprocess, 27.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▍ | 912/1074 [02:17<00:34,  4.75it/s]


0: 384x640 11 persons, 74.2ms
Speed: 2.0ms preprocess, 74.2ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▌ | 913/1074 [02:17<00:38,  4.14it/s]


0: 384x640 10 persons, 30.8ms
Speed: 2.1ms preprocess, 30.8ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▌ | 914/1074 [02:18<00:39,  4.05it/s]


0: 384x640 10 persons, 33.5ms
Speed: 2.1ms preprocess, 33.5ms inference, 4.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▌ | 915/1074 [02:18<00:38,  4.14it/s]


0: 384x640 9 persons, 43.4ms
Speed: 10.3ms preprocess, 43.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▌ | 916/1074 [02:18<00:33,  4.70it/s]


0: 384x640 9 persons, 53.7ms
Speed: 2.3ms preprocess, 53.7ms inference, 7.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▌ | 917/1074 [02:18<00:37,  4.24it/s]


0: 384x640 12 persons, 1 sports ball, 28.0ms
Speed: 2.1ms preprocess, 28.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  85%|████████▌ | 918/1074 [02:19<00:32,  4.77it/s]


0: 384x640 10 persons, 28.6ms
Speed: 3.5ms preprocess, 28.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▌ | 919/1074 [02:19<00:32,  4.76it/s]


0: 384x640 10 persons, 29.7ms
Speed: 2.1ms preprocess, 29.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▌ | 920/1074 [02:19<00:28,  5.38it/s]


0: 384x640 10 persons, 35.6ms
Speed: 2.4ms preprocess, 35.6ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▌ | 921/1074 [02:19<00:32,  4.72it/s]


0: 384x640 10 persons, 34.6ms
Speed: 3.5ms preprocess, 34.6ms inference, 5.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▌ | 922/1074 [02:19<00:32,  4.70it/s]


0: 384x640 10 persons, 31.7ms
Speed: 2.1ms preprocess, 31.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▌ | 923/1074 [02:20<00:28,  5.25it/s]


0: 384x640 10 persons, 24.2ms
Speed: 13.5ms preprocess, 24.2ms inference, 9.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▌ | 924/1074 [02:20<00:31,  4.71it/s]


0: 384x640 10 persons, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▌ | 925/1074 [02:20<00:28,  5.23it/s]


0: 384x640 10 persons, 27.1ms
Speed: 4.3ms preprocess, 27.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▌ | 926/1074 [02:20<00:26,  5.54it/s]


0: 384x640 10 persons, 1 sports ball, 39.5ms
Speed: 4.6ms preprocess, 39.5ms inference, 3.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▋ | 927/1074 [02:20<00:29,  4.94it/s]


0: 384x640 11 persons, 1 sports ball, 30.4ms
Speed: 7.4ms preprocess, 30.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▋ | 928/1074 [02:20<00:27,  5.29it/s]


0: 384x640 10 persons, 1 sports ball, 47.5ms
Speed: 1.9ms preprocess, 47.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  86%|████████▋ | 929/1074 [02:21<00:31,  4.66it/s]


0: 384x640 11 persons, 1 sports ball, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 930/1074 [02:21<00:26,  5.35it/s]


0: 384x640 11 persons, 2 sports balls, 30.1ms
Speed: 6.1ms preprocess, 30.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 931/1074 [02:21<00:26,  5.47it/s]


0: 384x640 11 persons, 1 sports ball, 28.0ms
Speed: 2.0ms preprocess, 28.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 932/1074 [02:21<00:23,  5.95it/s]


0: 384x640 11 persons, 1 sports ball, 61.4ms
Speed: 5.4ms preprocess, 61.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 933/1074 [02:21<00:29,  4.84it/s]


0: 384x640 11 persons, 1 sports ball, 28.8ms
Speed: 2.2ms preprocess, 28.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 934/1074 [02:22<00:31,  4.50it/s]


0: 384x640 11 persons, 1 sports ball, 32.9ms
Speed: 2.1ms preprocess, 32.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 935/1074 [02:22<00:27,  5.06it/s]


0: 384x640 11 persons, 1 sports ball, 24.2ms
Speed: 2.6ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 936/1074 [02:22<00:28,  4.78it/s]


0: 384x640 11 persons, 1 sports ball, 46.3ms
Speed: 2.2ms preprocess, 46.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 937/1074 [02:22<00:31,  4.35it/s]


0: 384x640 10 persons, 1 sports ball, 34.0ms
Speed: 2.2ms preprocess, 34.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 938/1074 [02:23<00:27,  4.92it/s]


0: 384x640 11 persons, 1 sports ball, 30.5ms
Speed: 2.1ms preprocess, 30.5ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  87%|████████▋ | 939/1074 [02:23<00:28,  4.71it/s]


0: 384x640 11 persons, 1 sports ball, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 940/1074 [02:23<00:25,  5.22it/s]


0: 384x640 10 persons, 1 sports ball, 24.9ms
Speed: 3.7ms preprocess, 24.9ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 941/1074 [02:23<00:23,  5.69it/s]


0: 384x640 12 persons, 1 sports ball, 35.0ms
Speed: 1.9ms preprocess, 35.0ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 942/1074 [02:23<00:22,  5.76it/s]


0: 384x640 13 persons, 1 sports ball, 42.2ms
Speed: 3.4ms preprocess, 42.2ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 943/1074 [02:23<00:26,  4.95it/s]


0: 384x640 12 persons, 1 sports ball, 32.0ms
Speed: 12.1ms preprocess, 32.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 944/1074 [02:24<00:23,  5.44it/s]


0: 384x640 11 persons, 1 sports ball, 26.7ms
Speed: 2.0ms preprocess, 26.7ms inference, 5.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 945/1074 [02:24<00:25,  5.14it/s]


0: 384x640 11 persons, 1 sports ball, 25.2ms
Speed: 5.9ms preprocess, 25.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 946/1074 [02:24<00:25,  4.99it/s]


0: 384x640 11 persons, 1 sports ball, 24.2ms
Speed: 2.4ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 947/1074 [02:24<00:23,  5.29it/s]


0: 384x640 11 persons, 1 sports ball, 42.3ms
Speed: 8.8ms preprocess, 42.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 948/1074 [02:24<00:23,  5.44it/s]


0: 384x640 11 persons, 1 sports ball, 29.5ms
Speed: 2.0ms preprocess, 29.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 949/1074 [02:25<00:22,  5.54it/s]


0: 384x640 12 persons, 1 sports ball, 24.2ms
Speed: 1.9ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  88%|████████▊ | 950/1074 [02:25<00:21,  5.64it/s]


0: 384x640 11 persons, 1 sports ball, 36.0ms
Speed: 2.0ms preprocess, 36.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▊ | 951/1074 [02:25<00:21,  5.80it/s]


0: 384x640 12 persons, 1 sports ball, 44.8ms
Speed: 2.0ms preprocess, 44.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▊ | 952/1074 [02:25<00:20,  5.99it/s]


0: 384x640 11 persons, 1 sports ball, 32.4ms
Speed: 3.1ms preprocess, 32.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▊ | 953/1074 [02:25<00:20,  5.82it/s]


0: 384x640 12 persons, 2 sports balls, 30.6ms
Speed: 3.2ms preprocess, 30.6ms inference, 5.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▉ | 954/1074 [02:25<00:23,  5.13it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▉ | 955/1074 [02:26<00:20,  5.69it/s]


0: 384x640 11 persons, 1 sports ball, 27.4ms
Speed: 2.2ms preprocess, 27.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▉ | 956/1074 [02:26<00:23,  4.97it/s]


0: 384x640 11 persons, 1 sports ball, 25.0ms
Speed: 2.1ms preprocess, 25.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▉ | 957/1074 [02:26<00:22,  5.19it/s]


0: 384x640 12 persons, 30.9ms
Speed: 4.9ms preprocess, 30.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▉ | 958/1074 [02:26<00:21,  5.48it/s]


0: 384x640 11 persons, 1 sports ball, 24.8ms
Speed: 2.1ms preprocess, 24.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▉ | 959/1074 [02:26<00:20,  5.59it/s]


0: 384x640 11 persons, 24.2ms
Speed: 3.7ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▉ | 960/1074 [02:27<00:19,  5.90it/s]


0: 384x640 11 persons, 31.3ms
Speed: 3.3ms preprocess, 31.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  89%|████████▉ | 961/1074 [02:27<00:22,  4.97it/s]


0: 384x640 11 persons, 27.1ms
Speed: 2.2ms preprocess, 27.1ms inference, 6.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|████████▉ | 962/1074 [02:27<00:20,  5.43it/s]


0: 384x640 12 persons, 28.7ms
Speed: 2.2ms preprocess, 28.7ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|████████▉ | 963/1074 [02:27<00:19,  5.82it/s]


0: 384x640 11 persons, 28.8ms
Speed: 2.3ms preprocess, 28.8ms inference, 4.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|████████▉ | 964/1074 [02:27<00:18,  5.92it/s]


0: 384x640 11 persons, 35.5ms
Speed: 2.1ms preprocess, 35.5ms inference, 3.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|████████▉ | 965/1074 [02:27<00:18,  5.81it/s]


0: 384x640 12 persons, 25.9ms
Speed: 2.2ms preprocess, 25.9ms inference, 6.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|████████▉ | 966/1074 [02:28<00:17,  6.01it/s]


0: 384x640 12 persons, 28.8ms
Speed: 2.9ms preprocess, 28.8ms inference, 2.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|█████████ | 967/1074 [02:28<00:16,  6.37it/s]


0: 384x640 11 persons, 29.0ms
Speed: 1.9ms preprocess, 29.0ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|█████████ | 968/1074 [02:28<00:19,  5.57it/s]


0: 384x640 9 persons, 24.2ms
Speed: 2.2ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|█████████ | 969/1074 [02:28<00:18,  5.57it/s]


0: 384x640 10 persons, 53.3ms
Speed: 3.3ms preprocess, 53.3ms inference, 6.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|█████████ | 970/1074 [02:28<00:21,  4.84it/s]


0: 384x640 10 persons, 1 sports ball, 24.2ms
Speed: 2.8ms preprocess, 24.2ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  90%|█████████ | 971/1074 [02:29<00:19,  5.25it/s]


0: 384x640 10 persons, 41.1ms
Speed: 2.4ms preprocess, 41.1ms inference, 6.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████ | 972/1074 [02:29<00:20,  5.08it/s]


0: 384x640 9 persons, 44.8ms
Speed: 2.9ms preprocess, 44.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████ | 973/1074 [02:29<00:20,  5.03it/s]


0: 384x640 10 persons, 36.1ms
Speed: 10.0ms preprocess, 36.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████ | 974/1074 [02:29<00:20,  4.79it/s]


0: 384x640 10 persons, 1 sports ball, 44.9ms
Speed: 9.4ms preprocess, 44.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████ | 975/1074 [02:29<00:22,  4.36it/s]


0: 384x640 11 persons, 1 sports ball, 26.0ms
Speed: 4.7ms preprocess, 26.0ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████ | 976/1074 [02:30<00:20,  4.80it/s]


0: 384x640 10 persons, 24.3ms
Speed: 6.8ms preprocess, 24.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████ | 977/1074 [02:30<00:19,  5.00it/s]


0: 384x640 10 persons, 44.6ms
Speed: 2.1ms preprocess, 44.6ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████ | 978/1074 [02:30<00:21,  4.41it/s]


0: 384x640 10 persons, 48.8ms
Speed: 2.3ms preprocess, 48.8ms inference, 7.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████ | 979/1074 [02:30<00:22,  4.20it/s]


0: 384x640 11 persons, 52.9ms
Speed: 2.2ms preprocess, 52.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████ | 980/1074 [02:31<00:21,  4.36it/s]


0: 384x640 12 persons, 38.2ms
Speed: 8.8ms preprocess, 38.2ms inference, 7.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████▏| 981/1074 [02:31<00:24,  3.77it/s]


0: 384x640 12 persons, 29.7ms
Speed: 13.5ms preprocess, 29.7ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  91%|█████████▏| 982/1074 [02:31<00:24,  3.76it/s]


0: 384x640 11 persons, 34.4ms
Speed: 11.0ms preprocess, 34.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 983/1074 [02:31<00:21,  4.22it/s]


0: 384x640 11 persons, 33.6ms
Speed: 2.0ms preprocess, 33.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 984/1074 [02:32<00:23,  3.80it/s]


0: 384x640 11 persons, 25.3ms
Speed: 5.1ms preprocess, 25.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 985/1074 [02:32<00:24,  3.69it/s]


0: 384x640 11 persons, 37.5ms
Speed: 2.3ms preprocess, 37.5ms inference, 8.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 986/1074 [02:32<00:22,  3.86it/s]


0: 384x640 11 persons, 59.2ms
Speed: 7.2ms preprocess, 59.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 987/1074 [02:32<00:21,  4.08it/s]


0: 384x640 7 persons, 41.7ms
Speed: 2.1ms preprocess, 41.7ms inference, 5.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 988/1074 [02:33<00:21,  4.07it/s]


0: 384x640 6 persons, 24.4ms
Speed: 5.7ms preprocess, 24.4ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 989/1074 [02:33<00:18,  4.53it/s]


0: 384x640 6 persons, 43.9ms
Speed: 1.9ms preprocess, 43.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 990/1074 [02:33<00:19,  4.34it/s]


0: 384x640 6 persons, 40.9ms
Speed: 2.4ms preprocess, 40.9ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 991/1074 [02:33<00:17,  4.87it/s]


0: 384x640 6 persons, 30.3ms
Speed: 2.3ms preprocess, 30.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 992/1074 [02:33<00:15,  5.14it/s]


0: 384x640 7 persons, 28.1ms
Speed: 2.9ms preprocess, 28.1ms inference, 5.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  92%|█████████▏| 993/1074 [02:34<00:14,  5.41it/s]


0: 384x640 8 persons, 46.6ms
Speed: 2.2ms preprocess, 46.6ms inference, 7.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 994/1074 [02:34<00:14,  5.41it/s]


0: 384x640 9 persons, 36.4ms
Speed: 2.2ms preprocess, 36.4ms inference, 7.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 995/1074 [02:34<00:15,  4.97it/s]


0: 384x640 8 persons, 37.1ms
Speed: 3.2ms preprocess, 37.1ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 996/1074 [02:34<00:15,  5.05it/s]


0: 384x640 8 persons, 25.0ms
Speed: 2.3ms preprocess, 25.0ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 997/1074 [02:34<00:14,  5.23it/s]


0: 384x640 7 persons, 43.8ms
Speed: 6.1ms preprocess, 43.8ms inference, 3.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 998/1074 [02:35<00:16,  4.59it/s]


0: 384x640 7 persons, 38.1ms
Speed: 2.1ms preprocess, 38.1ms inference, 6.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 999/1074 [02:35<00:14,  5.00it/s]


0: 384x640 8 persons, 36.8ms
Speed: 2.2ms preprocess, 36.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 1000/1074 [02:35<00:13,  5.36it/s]


0: 384x640 8 persons, 27.1ms
Speed: 2.5ms preprocess, 27.1ms inference, 14.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 1001/1074 [02:35<00:13,  5.26it/s]


0: 384x640 8 persons, 24.3ms
Speed: 3.0ms preprocess, 24.3ms inference, 4.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 1002/1074 [02:35<00:14,  5.01it/s]


0: 384x640 9 persons, 33.3ms
Speed: 2.1ms preprocess, 33.3ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 1003/1074 [02:36<00:15,  4.63it/s]


0: 384x640 8 persons, 26.2ms
Speed: 2.1ms preprocess, 26.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  93%|█████████▎| 1004/1074 [02:36<00:13,  5.21it/s]


0: 384x640 6 persons, 25.4ms
Speed: 2.5ms preprocess, 25.4ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▎| 1005/1074 [02:36<00:13,  5.20it/s]


0: 384x640 7 persons, 38.6ms
Speed: 5.7ms preprocess, 38.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▎| 1006/1074 [02:36<00:13,  4.90it/s]


0: 384x640 6 persons, 29.3ms
Speed: 10.7ms preprocess, 29.3ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▍| 1007/1074 [02:36<00:14,  4.59it/s]


0: 384x640 6 persons, 29.2ms
Speed: 4.2ms preprocess, 29.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▍| 1008/1074 [02:37<00:12,  5.33it/s]


0: 384x640 7 persons, 45.3ms
Speed: 2.1ms preprocess, 45.3ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▍| 1009/1074 [02:37<00:13,  4.76it/s]


0: 384x640 7 persons, 24.2ms
Speed: 6.6ms preprocess, 24.2ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▍| 1010/1074 [02:37<00:14,  4.55it/s]


0: 384x640 7 persons, 38.6ms
Speed: 2.1ms preprocess, 38.6ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▍| 1011/1074 [02:37<00:12,  5.10it/s]


0: 384x640 7 persons, 41.0ms
Speed: 2.4ms preprocess, 41.0ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▍| 1012/1074 [02:37<00:13,  4.59it/s]


0: 384x640 7 persons, 27.7ms
Speed: 2.4ms preprocess, 27.7ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▍| 1013/1074 [02:38<00:11,  5.27it/s]


0: 384x640 9 persons, 24.8ms
Speed: 3.1ms preprocess, 24.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  94%|█████████▍| 1014/1074 [02:38<00:10,  5.74it/s]


0: 384x640 8 persons, 47.5ms
Speed: 2.1ms preprocess, 47.5ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▍| 1015/1074 [02:38<00:10,  5.48it/s]


0: 384x640 7 persons, 31.4ms
Speed: 5.3ms preprocess, 31.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▍| 1016/1074 [02:38<00:09,  5.89it/s]


0: 384x640 6 persons, 36.1ms
Speed: 2.0ms preprocess, 36.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▍| 1017/1074 [02:38<00:09,  6.07it/s]


0: 384x640 5 persons, 29.6ms
Speed: 2.2ms preprocess, 29.6ms inference, 4.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▍| 1018/1074 [02:38<00:09,  6.19it/s]


0: 384x640 6 persons, 24.2ms
Speed: 3.6ms preprocess, 24.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▍| 1019/1074 [02:39<00:08,  6.59it/s]


0: 384x640 7 persons, 31.8ms
Speed: 3.0ms preprocess, 31.8ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▍| 1020/1074 [02:39<00:07,  6.93it/s]


0: 384x640 7 persons, 34.1ms
Speed: 2.3ms preprocess, 34.1ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▌| 1021/1074 [02:39<00:07,  7.22it/s]


0: 384x640 7 persons, 43.9ms
Speed: 1.9ms preprocess, 43.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▌| 1022/1074 [02:39<00:08,  6.17it/s]


0: 384x640 6 persons, 36.8ms
Speed: 5.1ms preprocess, 36.8ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▌| 1023/1074 [02:39<00:08,  6.10it/s]


0: 384x640 5 persons, 39.7ms
Speed: 2.3ms preprocess, 39.7ms inference, 4.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▌| 1024/1074 [02:39<00:08,  6.14it/s]


0: 384x640 7 persons, 39.6ms
Speed: 9.9ms preprocess, 39.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  95%|█████████▌| 1025/1074 [02:40<00:08,  5.64it/s]


0: 384x640 7 persons, 30.9ms
Speed: 2.2ms preprocess, 30.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▌| 1026/1074 [02:40<00:07,  6.27it/s]


0: 384x640 7 persons, 31.2ms
Speed: 1.9ms preprocess, 31.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▌| 1027/1074 [02:40<00:07,  6.71it/s]


0: 384x640 7 persons, 30.9ms
Speed: 2.0ms preprocess, 30.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▌| 1028/1074 [02:40<00:06,  7.05it/s]


0: 384x640 8 persons, 28.1ms
Speed: 5.5ms preprocess, 28.1ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▌| 1029/1074 [02:40<00:06,  7.18it/s]


0: 384x640 7 persons, 45.9ms
Speed: 3.0ms preprocess, 45.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▌| 1030/1074 [02:40<00:06,  6.63it/s]


0: 384x640 7 persons, 48.4ms
Speed: 8.4ms preprocess, 48.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▌| 1031/1074 [02:40<00:06,  6.87it/s]


0: 384x640 6 persons, 31.0ms
Speed: 4.5ms preprocess, 31.0ms inference, 4.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▌| 1032/1074 [02:41<00:06,  6.35it/s]


0: 384x640 4 persons, 1 sports ball, 37.5ms
Speed: 2.1ms preprocess, 37.5ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▌| 1033/1074 [02:41<00:05,  6.94it/s]


0: 384x640 7 persons, 37.4ms
Speed: 3.2ms preprocess, 37.4ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▋| 1034/1074 [02:41<00:05,  7.26it/s]


0: 384x640 6 persons, 28.1ms
Speed: 2.2ms preprocess, 28.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▋| 1035/1074 [02:41<00:05,  7.30it/s]


0: 384x640 5 persons, 31.9ms
Speed: 7.5ms preprocess, 31.9ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  96%|█████████▋| 1036/1074 [02:41<00:05,  7.36it/s]


0: 384x640 6 persons, 31.9ms
Speed: 4.1ms preprocess, 31.9ms inference, 4.5ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1037/1074 [02:41<00:05,  7.17it/s]


0: 384x640 7 persons, 26.5ms
Speed: 3.3ms preprocess, 26.5ms inference, 3.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1038/1074 [02:41<00:05,  7.04it/s]


0: 384x640 6 persons, 30.7ms
Speed: 2.2ms preprocess, 30.7ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1039/1074 [02:41<00:05,  6.65it/s]


0: 384x640 7 persons, 40.1ms
Speed: 3.4ms preprocess, 40.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1040/1074 [02:42<00:05,  6.76it/s]


0: 384x640 6 persons, 33.7ms
Speed: 6.7ms preprocess, 33.7ms inference, 2.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1041/1074 [02:42<00:04,  7.10it/s]


0: 384x640 6 persons, 37.5ms
Speed: 2.0ms preprocess, 37.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1042/1074 [02:42<00:04,  7.21it/s]


0: 384x640 7 persons, 24.2ms
Speed: 16.9ms preprocess, 24.2ms inference, 6.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1043/1074 [02:42<00:04,  6.42it/s]


0: 384x640 6 persons, 38.2ms
Speed: 3.2ms preprocess, 38.2ms inference, 2.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1044/1074 [02:42<00:04,  6.38it/s]


0: 384x640 5 persons, 43.9ms
Speed: 10.3ms preprocess, 43.9ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1045/1074 [02:42<00:04,  6.72it/s]


0: 384x640 4 persons, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 7.3ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 4 persons, 33.2ms
Speed: 1.9ms preprocess, 33.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  97%|█████████▋| 1047/1074 [02:43<00:03,  7.81it/s]


0: 384x640 5 persons, 28.5ms
Speed: 1.9ms preprocess, 28.5ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 7 persons, 24.2ms
Speed: 2.1ms preprocess, 24.2ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  98%|█████████▊| 1049/1074 [02:43<00:03,  8.01it/s]


0: 384x640 7 persons, 37.9ms
Speed: 2.1ms preprocess, 37.9ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  98%|█████████▊| 1050/1074 [02:43<00:03,  7.45it/s]


0: 384x640 8 persons, 1 sports ball, 29.0ms
Speed: 4.7ms preprocess, 29.0ms inference, 3.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  98%|█████████▊| 1051/1074 [02:43<00:03,  6.96it/s]


0: 384x640 9 persons, 24.2ms
Speed: 2.6ms preprocess, 24.2ms inference, 2.4ms postprocess per image at shape (1, 3, 384, 640)

0: 384x640 7 persons, 32.8ms
Speed: 3.5ms preprocess, 32.8ms inference, 4.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  98%|█████████▊| 1053/1074 [02:43<00:02,  7.71it/s]


0: 384x640 7 persons, 31.8ms
Speed: 2.2ms preprocess, 31.8ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  98%|█████████▊| 1054/1074 [02:44<00:02,  7.26it/s]


0: 384x640 7 persons, 29.6ms
Speed: 2.3ms preprocess, 29.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  98%|█████████▊| 1055/1074 [02:44<00:02,  6.76it/s]


0: 384x640 7 persons, 32.5ms
Speed: 5.5ms preprocess, 32.5ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  98%|█████████▊| 1056/1074 [02:44<00:02,  6.49it/s]


0: 384x640 7 persons, 32.6ms
Speed: 2.3ms preprocess, 32.6ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  98%|█████████▊| 1057/1074 [02:44<00:02,  6.92it/s]


0: 384x640 6 persons, 33.7ms
Speed: 2.0ms preprocess, 33.7ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▊| 1058/1074 [02:44<00:02,  6.75it/s]


0: 384x640 6 persons, 25.7ms
Speed: 4.7ms preprocess, 25.7ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▊| 1059/1074 [02:44<00:02,  7.32it/s]


0: 384x640 7 persons, 28.6ms
Speed: 2.2ms preprocess, 28.6ms inference, 3.4ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▊| 1060/1074 [02:44<00:01,  7.18it/s]


0: 384x640 8 persons, 34.6ms
Speed: 4.2ms preprocess, 34.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▉| 1061/1074 [02:45<00:01,  7.04it/s]


0: 384x640 8 persons, 33.3ms
Speed: 2.2ms preprocess, 33.3ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▉| 1062/1074 [02:45<00:01,  6.37it/s]


0: 384x640 7 persons, 35.1ms
Speed: 2.3ms preprocess, 35.1ms inference, 1.7ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▉| 1063/1074 [02:45<00:01,  7.10it/s]


0: 384x640 7 persons, 28.0ms
Speed: 6.1ms preprocess, 28.0ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▉| 1064/1074 [02:45<00:01,  7.59it/s]


0: 384x640 8 persons, 26.1ms
Speed: 2.1ms preprocess, 26.1ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▉| 1065/1074 [02:45<00:01,  8.09it/s]


0: 384x640 7 persons, 35.2ms
Speed: 2.3ms preprocess, 35.2ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▉| 1066/1074 [02:45<00:01,  6.12it/s]


0: 384x640 6 persons, 42.7ms
Speed: 2.3ms preprocess, 42.7ms inference, 2.3ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▉| 1067/1074 [02:45<00:01,  6.24it/s]


0: 384x640 7 persons, 42.6ms
Speed: 2.2ms preprocess, 42.6ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames:  99%|█████████▉| 1068/1074 [02:46<00:00,  6.37it/s]


0: 384x640 5 persons, 38.6ms
Speed: 3.7ms preprocess, 38.6ms inference, 8.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames: 100%|█████████▉| 1069/1074 [02:46<00:00,  6.37it/s]


0: 384x640 7 persons, 43.6ms
Speed: 2.0ms preprocess, 43.6ms inference, 3.6ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames: 100%|█████████▉| 1070/1074 [02:46<00:00,  6.21it/s]


0: 384x640 7 persons, 38.7ms
Speed: 2.3ms preprocess, 38.7ms inference, 5.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames: 100%|█████████▉| 1071/1074 [02:46<00:00,  6.34it/s]


0: 384x640 7 persons, 37.7ms
Speed: 8.4ms preprocess, 37.7ms inference, 1.9ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames: 100%|█████████▉| 1072/1074 [02:46<00:00,  6.31it/s]


0: 384x640 6 persons, 31.9ms
Speed: 3.1ms preprocess, 31.9ms inference, 8.2ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames: 100%|█████████▉| 1073/1074 [02:46<00:00,  6.29it/s]


0: 384x640 1 person, 51.1ms
Speed: 2.0ms preprocess, 51.1ms inference, 2.1ms postprocess per image at shape (1, 3, 384, 640)


Processing Frames: 100%|██████████| 1074/1074 [02:47<00:00,  6.43it/s]


Video processing completed.
Video processing completed. Output saved to output task2.mp4


Here is the output for the video: [video](https://drive.google.com/file/d/1DmJCnJqWLsrWpFHVp5PgJ6-OB9djBH0f/view?usp=sharing)
