You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When forwarding, sub-backends are executed in series. If the result of a non-final sub-backend is empty, the execution is terminated and an exception is thrown. If the result of a non-final sub-backend is non-empty, the filter will be executed. The default is `swap`.
69
70
70
71
### Input Range
71
-
If the input data range of all sub-backends is [1,1], then the range of `Sequential` is [1,1]. Otherwise, it is the union of the input ranges of sub-backends whose input ranges are not [1,1]. **Currently, only one sub-backend is allowed to satisfy `max() != 1 && max() != UINT32_MAX` at most**. In `Sequential`, currently only one sub-backend with non-trivial input range is allowed at most, and this sub-backend can be regarded as the main functional implementer of the entire `Sequential`, while other backends are auxiliary or functional backends.
72
+
If the input data range of all sub-backends is [1,1], then the range of `Sequential` is [1,1]. Otherwise, it is the union of the input ranges of sub-backends whose input ranges are not [1,1]. **Currently, only one sub-backend is allowed to satisfy `max() != 1 && max() != UINT32_MAX` at most**. In `Sequential`, currently only one sub-backend with non-trivial input range is allowed at most, and this sub-backend can be regarded as the main functional implementer of the entire `Sequential`, while other backends are auxiliary or functional backends.
73
+
74
+
## Range
75
+
Set a new maximum and minimum range [target_min, target_max].
76
+
Use dynamic programming to determine whether all the integer 'target' in [target_min, target_max] can be represented as the sum
77
+
of multiple numbers within `Range::backend`'s [min, max] range, and provide a result with the largest possible values.
78
+
### Initialization
79
+
:::tip Parameters
80
+
-**range** - Target [min, max]. for example, `range="1,10"`.
81
+
-**Range::backend** - Proxy backend.
82
+
:::
83
+
Initialization fails if the new range cannot be represented by the range in the old range. For example, `target=10, [min, max]=[2,9]` => `10 = 8+2`. And the following situations will fail: `range=[10,10], [min, max]=[8,8]` => `10 != 8`.
Copy file name to clipboardExpand all lines: docs/tools/quantization.mdx
+1-1
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,7 @@ The post-training quantization process of TensorRT mainly includes two steps:
11
11
- Prepare quantization data, about 500 pieces. We will save the data in PyTorch format in the backend.
12
12
- Launch the quantization process with the prepared data and generate the model.
13
13
14
-
For more details, please refer to the [example](https://github.com/torchpipe/torchpipe//examples/int8/README_en.md).
14
+
For more details, please refer to the [example](https://github.com/torchpipe/torchpipe/tree/develop/examples/int8).
15
15
16
16
## Training-based Quantization (based on pytorch_quantization)
17
17
In the [official notebook](https://github.com/NVIDIA/TensorRT/blob/release/8.6/quickstart/quantization_tutorial/qat-ptq-workflow.ipynb), NVIDIA summarizes how to improve quantization accuracy through training-based quantization in PyTorch.
0 commit comments