Skip to content

Output File Format

Andrés Solís Montero edited this page Jul 4, 2016 · 9 revisions

XML Format

			<?xml version="1.0" encoding="UTF-8"?>
			<!-- Video sequence -->
			<sequence frameCount="Fc" id="Cam">
				<!-- Frame -->
				<frame id="Fid" targetCount="N">
					<target id="Tid" action="S">
						<location>x1,y1,x2,y2,...,xM,yM</location>
					</target>
			...
				</frame>
			...
			</sequence>
			...
			
			<!-- Matching of target in the video sequence -->
			<matching>
				<match target1="Tid" target2="Tid">
			...
			</matching>

Definitions

Variable Type Definition
N integer Number of people detected in a particular frame
Fc integer Number of frames
targetCount integer Number of targets detected in the frame
Cam string Camera label
Fid string Frame identifier, i.e., the frame's file number
S string Target state identifier
Tid string Target identifier
xi,yi float Coordinates (x,y) of the of the polygonal vertex number i
  1. There are six states a target can be: Lining Up, Ordering, Paying, Waiting, Picking Up, Leaving.
  2. Target identifier is unique in all the sequence and it is created by the concatenation of: <frame's time stamp><camera identifier><target identifier>

Note about the states to be annotated

The states symbolized by variable S represents the following actions to be annotated by the user:

  1. Line Up: A client (target) enters into this state the first time appears in the cashier's camera; it leaves this state when the same target moves to the cashier's area to place an order.
  2. Order: A target enters into this state when it moves into the cashier's area to place an order; it leaves this state once it starts to pay.
  3. Pay: A target changes enters this state when start paying for its order; it leaves this state when its starts moving away from the cashier's area.
  4. Waiting: A target that moves away from the cashier area after paying enters into this state; it remains in this state despite in which camera appears until it starts picking up its order.
  5. Pick Up: This state marks when the target picks it order from the picking area; it ends when the target moves away from the picking area with its order.
  6. Leaving: This state marks the instant when the target moves away from the picking area with its order; it will remain in this state until it disappear from the view in all the cameras.

Status encoding

Status Code
Line Up lineUp
Order order
Pay pay
Waiting wait
Pick Up pickUp
Leave leave

Annotation states

Example:

			<?xml version="1.0" encoding="UTF-8"?>
			<!-- Video sequence -->
			<sequence frameCount="2" id="0">
				<!-- Frame -->
				<frame id="1448560259342000000" targetCount="2">
					<target id="1448560259342000000" action="wainting">
						<targetArea>205.0,164.0,345.0,164.0,345.0,350.0,205.0,350.0</targetArea>
					</target>
					<target id="1448560259342000001" action="ordered">
						<targetArea>42.0,191.0,166.0,191.0,166.0,323.0,42.0,323.0</targetArea>
					</target>
				</frame>
				<frame id="1448560259865000000" targetCount="1">
					<target id="1448560259865000000" action="waiting">
						<targetArea>215.0,154.0,325.0,164.0,325.0,350.0,225.0,345.0</targetArea>
					</target>
				</frame>
			</sequence>

			<sequence frameCount="2" id="1">
				<!-- Frame -->
				<frame id="1448560259342001000" targetCount="1">
					<target id="1448560259342001001" action="ordered">
						<targetArea>42.0,191.0,166.0,191.0,166.0,323.0,42.0,323.0</targetArea>
					</target>
				</frame>
				<frame id="1448560259865001000" targetCount="3">
					<target id="1448560259865001000" action="served">
						<targetArea>215.0,154.0,325.0,164.0,325.0,350.0,225.0,345.0</targetArea>
					</target>
					<target id="1448560259342001001" action="ordered">
						<targetArea>42.0,191.0,166.0,191.0,166.0,323.0,42.0,323.0</targetArea>
					</target>
					<target id="1448560259342001001" action="ordered">
						<targetArea>143.0,91.0,66.0,101.0,162.0,345.0,142.0,328.0</targetArea>
					</target>
				</frame>
			</sequence>
			
			<!-- Matching of target in the video sequence -->
			<matching>
				<match target1="1448560259342000000" target2="1448560259865000000">
				<match target1="1448560259342000000" target2="1448560259342000001">
				<match target1="1448560259865000000" target2="1448560259342001001">
			</matching>

Note:

The first entry of the XML block defined as matching illustrates the target’s matching in different frames of the same camera. In this example, the client with target id "1448560259342000000", which appears as the target "000" in the frame sequence with time stamp "1448560259342" from the camera "000", is matched with the client with target id "1448560259865000000", which is labeled "000" in the frame with time stamp "1448560259865" originated in the same camera.

Clone this wiki locally