Skip to content

LivepoolQ/Socialality

Repository files navigation

Socialality

This is the official code repo of our paper "Socialality Anchors: Towards Group-bounded Trajectory Prediction".

Socialality is an extended journal-version of our previous work GrouPConCeption.

Getting Started

You can clone this repository by the following command:

git clone https://github.com/LivepoolQ/Socialality.git

Then, run the following command to initialize all submodules:

git submodule update --init --recursive

Requirements

The code is developed with Python 3.10. Additional packages used are included in the requirements.txt file.

Warning

We recommend installing all required Python packages in a virtual environment (like the conda environment). Otherwise, there COULD be other problems due to the package version conflicts.

Run the following command to install the required packages in your Python environment:

pip install -r requirements.txt

Note

More details will be gradually uploaded soon. Thank you for your support to our work.

Contact us

Ziqian Zou (@LivepoolQ): ziqianzoulive@icloud.com

GrouPConCeption


Args Used

Please specify your customized args when training or testing your model in the following way:

python main.py --ARG_KEY1 ARG_VALUE2 --ARG_KEY2 ARG_VALUE2 -SHORT_ARG_KEY3 ARG_VALUE3 ...

where ARG_KEY is the name of args, and ARG_VALUE is the corresponding value. All args and their usages are listed below.

About the argtype:

  • Args with argtype=static can not be changed once after training. When testing the model, the program will not parse these args to overwrite the saved values.
  • Args with argtype=dynamic can be changed anytime. The program will try to first parse inputs from the terminal and then try to load from the saved JSON file.
  • Args with argtype=temporary will not be saved into JSON files. The program will parse these args from the terminal at each time.

Basic Args

--K

The number of multiple generations when testing. This arg only works for multiple-generation models.

  • Type=int, argtype=dynamic
  • The default value is 20.
--K_train

The number of multiple generations when training. This arg only works for multiple-generation models.

  • Type=int, argtype=static
  • The default value is 10.
--anntype

Model's predicted annotation type. Can be 'coordinate' or 'boundingbox'.

  • Type=str, argtype=static
  • The default value is coordinate.
--auto_clear

Controls whether to clear all other saved weights except for the best one. It performs similarly to running python scripts/clear.py --logs logs.

  • Type=int, argtype=temporary
  • The default value is 1.
--batch_size (short for -bs)

Batch size when implementation.

  • Type=int, argtype=dynamic
  • The default value is 5000.
--compute_loss

Controls whether to compute losses when testing.

  • Type=int, argtype=temporary
  • The default value is 0.
--compute_metrics_with_types

Controls whether to compute metrics separately on different kinds of agents.

  • Type=int, argtype=temporary
  • The default value is 0.
--compute_statistical_metrics

(bool) Choose whether to compute metrics (ADE/FDE) as mean $\pm$ std.

  • Type=int, argtype=temporary
  • The default value is 0.
--dataset

Name of the video dataset to train or evaluate. For example, 'ETH-UCY' or 'SDD'. NOTE: DO NOT set this argument manually.

  • Type=str, argtype=static
  • The default value is Unavailable.
--down_sampling_rate

Selects whether to down-sample from multiple-generated predicted trajectories. This arg only works for multiple-generative models.

  • Type=float, argtype=temporary
  • The default value is 1.0.
--draw_results (short for -dr)

Controls whether to draw visualized results on video frames. Accept the name of one video clip. The codes will first try to load the video file according to the path saved in the plist file (saved in dataset_configs folder), and if it loads successfully it will draw the results on that video, otherwise it will draw results on a blank canvas. Note that test_mode will be set to 'one' and force_split will be set to draw_results if draw_results != 'null'.

  • Type=str, argtype=temporary
  • The default value is null.
--draw_videos

Controls whether to draw visualized results on video frames and save them as images. Accept the name of one video clip. The codes will first try to load the video according to the path saved in the plist file, and if successful it will draw the visualization on the video, otherwise it will draw on a blank canvas. Note that test_mode will be set to 'one' and force_split will be set to draw_videos if draw_videos != 'null'.

  • Type=str, argtype=temporary
  • The default value is null.
--epochs

Maximum training epochs.

  • Type=int, argtype=static
  • The default value is 500.
--experimental

NOTE: It is only used for code tests.

  • Type=bool, argtype=temporary
  • The default value is False.
--feature_dim

Feature dimensions that are used in most layers.

  • Type=int, argtype=static
  • The default value is 128.
--force_anntype

Assign the prediction type. It is now only used for silverballers models that are trained with annotation type coordinate but to be tested on datasets with annotation type boundingbox.

  • Type=str, argtype=temporary
  • The default value is null.
--force_clip

Force test video clip (ignore the train/test split). It only works when test_mode has been set to one. .

  • Type=str, argtype=temporary
  • The default value is null.
--force_dataset

Force test dataset (ignore the train/test split). It only works when test_mode has been set to one.

  • Type=str, argtype=temporary
  • The default value is null.
--force_split

Force test dataset (ignore the train/test split). It only works when test_mode has been set to one.

  • Type=str, argtype=temporary
  • The default value is null.
--gpu

Speed up training or test if you have at least one NVidia GPU. If you have no GPUs or want to run the code on your CPU, please set it to -1. NOTE: It only supports training or testing on one GPU.

  • Type=str, argtype=temporary
  • The default value is 0.
--help (short for -h)

Print help information on the screen.

  • Type=str, argtype=temporary
  • The default value is null.
--input_pred_steps

Indices of future time steps that are used as extra model inputs. It accepts a string that contains several integer numbers separated with '_'. For example, '3_6_9'. It will take the corresponding ground truth points as the input when training the model, and take the first output of the former network as this input when testing the model. Set it to 'null' to disable these extra model inputs.

  • Type=str, argtype=static
  • The default value is null.
--interval

Time interval of each sampled trajectory point.

  • Type=float, argtype=static
  • The default value is 0.4.
--load (short for -l)

Folder to load model weights (to test). If it is set to null, the training manager will start training new models according to other reveived args. NOTE: Leave this arg to null when training new models.

  • Type=str, argtype=temporary
  • The default value is null.
--load_epoch

Load model weights that is saved after specific training epochs. It will try to load the weight file in the load dir whose name is end with _epoch${load_epoch}. This arg only works when the auto_clear arg is disabled (by passing --auto_clear 0 when training). Set it to -1 to disable this function.

  • Type=int, argtype=temporary
  • The default value is -1.
--load_part

Choose whether to load only a part of the model weights if the state_dict of the saved model and the model in the code do not match.

IMPORTANT NOTE: This arg is only used for some ablation experiments. It MAY lead to incorrect predictions or metrics.

  • Type=int, argtype=temporary
  • The default value is 0.
--log_dir

Folder to save training logs and model weights. Logs will save at ${save_base_dir}/${log_dir}. DO NOT change this arg manually. (You can still change the saving path by passing the save_base_dir arg.).

  • Type=str, argtype=static
  • The default value is Unavailable.
--loss_weights

Configure the agent-wise loss weights. It now only supports the dataset-clip-wise re-weight.

  • Type=str, argtype=dynamic
  • The default value is {}.
--lr (short for -lr)

Learning rate.

  • Type=float, argtype=static
  • The default value is 0.001.
--macos

(Experimental) Choose whether to enable the MPS (Metal Performance Shaders) on Apple platforms (instead of running on CPUs).

  • Type=int, argtype=temporary
  • The default value is 0.
--max_agents

Max number of agents to predict per frame. It only works when model_type == 'frame-based'.

  • Type=int, argtype=dynamic
  • The default value is 50.
--model

The model type used to train or test.

  • Type=str, argtype=static
  • The default value is none.
--model_name

Customized model name.

  • Type=str, argtype=static
  • The default value is model.
--model_type

Model type. It can be 'agent-based' or 'frame-based'.

  • Type=str, argtype=static
  • The default value is agent-based.
--noise_depth

Depth of the random noise vector.

  • Type=int, argtype=static; also: --depth
  • The default value is 16.
--obs_frames (short for -obs)

Observation frames for prediction.

  • Type=int, argtype=static
  • The default value is 8.
--output_pred_steps

Indices of future time steps to be predicted. It accepts a string that contains several integer numbers separated with '_'. For example, '3_6_9'. Set it to 'all' to predict points among all future steps.

  • Type=str, argtype=static; also: --key_points
  • The default value is all.
--pmove

(Pre/post-process Arg) Index of the reference point when moving trajectories.

  • Type=int, argtype=static
  • The default value is -1.
--pred_frames (short for -pred)

Prediction frames.

  • Type=int, argtype=static
  • The default value is 12.
--preprocess

Controls whether to run any pre-process before the model inference. It accepts a 3-bit-like string value (like '111'): - The first bit: MOVE trajectories to (0, 0); - The second bit: re-SCALE trajectories; - The third bit: ROTATE trajectories.

  • Type=str, argtype=static
  • The default value is 100.
--restore

Path to restore the pre-trained weights before training. It will not restore any weights if args.restore == 'null'.

  • Type=str, argtype=temporary
  • The default value is null.
--restore_args

Path to restore the reference args before training. It will not restore any args if args.restore_args == 'null'.

  • Type=str, argtype=temporary
  • The default value is null.
--save_base_dir

Base folder to save all running logs.

  • Type=str, argtype=static
  • The default value is ./logs.
--split (short for -s)

The dataset split that used to train and evaluate.

  • Type=str, argtype=static
  • The default value is zara1.
--start_test_percent

Set when (at which epoch) to start validation during training. The range of this arg should be 0 <= x <= 1. Validation may start at epoch args.epochs * args.start_test_percent.

  • Type=float, argtype=temporary
  • The default value is 0.0.
--step

Frame interval for sampling training data.

  • Type=float, argtype=dynamic
  • The default value is 1.0.
--test_mode

Test settings. It can be 'one', 'all', or 'mix'. When setting it to one, it will test the model on the args.force_split only; When setting it to all, it will test on each of the test datasets in args.split; When setting it to mix, it will test on all test datasets in args.split together.

  • Type=str, argtype=temporary
  • The default value is mix.
--test_step

Epoch interval to run validation during training.

  • Type=int, argtype=temporary
  • The default value is 1.
--update_saved_args

Choose whether to update (overwrite) the saved arg files or not.

  • Type=int, argtype=temporary
  • The default value is 0.
--verbose (short for -v)

Controls whether to print verbose logs and outputs to the terminal.

  • Type=int, argtype=temporary
  • The default value is 0.

Visualization Args

--distribution_steps

Controls which time step(s) should be considered when visualizing the distribution of forecasted trajectories. It accepts one or more integer numbers (started with 0) split by '_'. For example, '4_8_11'. Set it to 'all' to show the distribution of all predictions.

  • Type=str, argtype=temporary
  • The default value is all.
--draw_distribution (short for -dd)

Controls whether to draw distributions of predictions instead of points. If draw_distribution == 0, it will draw results as normal coordinates; If draw_distribution == 1, it will draw all results in the distribution way, and points from different time steps will be drawn with different colors.

  • Type=int, argtype=temporary
  • The default value is 0.
--draw_exclude_type

Draw visualized results of agents except for user-assigned types. If the assigned types are "Biker_Cart" and the draw_results or draw_videos is not "null", it will draw results of all types of agents except "Biker" and "Cart". It supports partial match, and it is case-sensitive.

  • Type=str, argtype=temporary
  • The default value is null.
--draw_extra_outputs

Choose whether to draw (put text) extra model outputs on the visualized images.

  • Type=int, argtype=temporary
  • The default value is 0.
--draw_full_neighbors

Choose whether to draw the full observed trajectories of all neighbor agents or only the last trajectory point at the current observation moment.

  • Type=int, argtype=temporary
  • The default value is 0.
--draw_index

Indexes of test agents to visualize. Numbers are split with _. For example, '123_456_789'.

  • Type=str, argtype=temporary
  • The default value is all.
--draw_lines

Choose whether to draw lines between each two 2D trajectory points.

  • Type=int, argtype=temporary
  • The default value is 0.
--draw_on_empty_canvas

Controls whether to draw visualized results on the empty canvas instead of the actual video.

  • Type=int, argtype=temporary
  • The default value is 0.
--draw_with_plt

(bool) Choose whether to use PLT as the preferred method for visualizing trajectories (on the empty canvas). It will try to visualize all points on the scene images if this arg is not enabled. .

  • Type=int, argtype=temporary
  • The default value is 0.

Socialality Args

--Kc
    .
  • Type=int, argtype=static
  • The default value is 20.
--ego_capacity

Number of neighbors to be carefully considered in ego predictor.

  • Type=int, argtype=dynamic
  • The default value is -1.
--ego_loss_ratio

Ratio of ego loss when computing sum of l2 loss and ego loss.

  • Type=float, argtype=static
  • The default value is 0.4.
--ego_predictor_type

Choose which kind of backbones ego predictor will use. - linear: - fc: - tran:

  • Type=str, argtype=dynamic
  • The default value is tran.
--ego_t_f

Prediction time steps calculated in ego predictor.

  • Type=int, argtype=static
  • The default value is -1.
--ego_t_h

Observation time steps calculated in ego predictor.

  • Type=int, argtype=static
  • The default value is -1.
--encode_agent_types

Choose whether to encode the type name of each agent. It is mainly used in multi-type-agent prediction scenes, providing a unique type-coding for each type of agents when encoding their trajectories.

  • Type=int, argtype=static
  • The default value is 0.
--fix_distance_anchor (short for -fix_dis)

Choose whether to fix distance anchor.

  • Type=int, argtype=static
  • The default value is 0.
--fix_speed_anchor (short for -fix_speed)

Choose whether to speed distance anchor.

  • Type=int, argtype=static
  • The default value is 0.
--generation_num

Number of multi-style generation.

  • Type=int, argtype=static
  • The default value is 20.
--group_distance

(Working in process)

  • Type=int, argtype=static
  • The default value is 6.
--group_type

Choose which group method to use, including [0, 1, 2]: - 0: Vanilla ; - 1: TODO Model; - 2: TODO.

  • Type=int, argtype=static
  • The default value is 1.
--insights_num

Number of insights.

  • Type=int, argtype=static
  • The default value is 3.
--l2_loss_ratio

Ratio of l2 loss when computing sum of l2 loss and ego loss.

  • Type=float, argtype=static
  • The default value is 1.0.
--output_units

Set number of the output units of trajectory encoding.

  • Type=int, argtype=static
  • The default value is 32.
--previews_only

Choose whether to only use previews when grouping. NOTE This args can only be used when --use_mixed_trajectory 1.

  • Type=int, argtype=static
  • The default value is 0.
--set_anchor_value

Choose whether to set anchor value globally.

  • Type=int, argtype=static
  • The default value is 0.
--set_distance_anchor (short for -set_dis)

Set distance anchor value globally.

  • Type=int, argtype=static
  • The default value is -1.
--set_speed_anchor (short for -set_speed)

Set distance anchor value globally.

  • Type=int, argtype=static
  • The default value is -1.
--use_group

Choose whether to use pedestrian groups when calculating SocialCircle.

  • Type=int, argtype=static
  • The default value is 0.
--use_mixed_trajectory (short for -mix)

Choose whether to use mixed time window trajectory.

  • Type=int, argtype=static
  • The default value is 1.
--view_angle

Value of conception view field.

  • Type=float, argtype=static
  • The default value is 3.141592653589793.
--vis_anchors

Choose whether to visualize anchors.

  • Type=int, argtype=temporary
  • The default value is 0.
--vis_ego_predictor

Choose whether to visualize trajectories forecasted by the ego predictior. It accepts three values:

  • 0: Do nothing; - 1: Visualize ego predictor's all predictions; - 2: Visualize ego predictor's mean predicton for each neighbor.

NOTE that this arg only works in the Playground mode, or the program will be killed immediately.

  • Type=int, argtype=temporary
  • The default value is 0.
--vis_group_members

Choose whether to visualize group members.

NOTE that this arg only works in the Playground mode, or the program will be killed immediately.

  • Type=int, argtype=temporary
  • The default value is 0.

Playground Args

--clip

The video clip to run this playground.

  • Type=str, argtype=temporary
  • The default value is zara1.
--compute_social_mod

(bool) Choose whether to enable the computing of social modifications.

  • Type=int, argtype=temporary
  • The default value is 0.
--default_agent

Set the default index of agent to be predicted.

  • Type=int, argtype=temporary
  • The default value is 0.
--do_not_draw_neighbors

(bool) Choose whether to draw neighboring-agents' trajectories.

  • Type=int, argtype=temporary
  • The default value is 0.
--draw_seg_map

(bool) Choose whether to draw segmentation maps on the canvas.

  • Type=int, argtype=temporary
  • The default value is 1.
--lite

(bool) Choose whether to show the lite-version's visualization window.

  • Type=int, argtype=temporary
  • The default value is 0.
--physical_manual_neighbor_mode

Mode for the manual neighbor on segmentation maps. - Mode 1: Add obstacles to the given position; - Mode 0: Set areas to be walkable.

  • Type=float, argtype=temporary
  • The default value is 1.0.
--points

The number of points to simulate the trajectory of manual neighbor. It only accepts 2 or 3.

  • Type=int, argtype=temporary
  • The default value is 2.
--save_full_outputs

(bool) Choose whether to save all outputs as images.

  • Type=int, argtype=temporary
  • The default value is 0.
--show_manual_neighbor_boxes

(Working in process)

  • Type=int, argtype=temporary
  • The default value is 0.

About

Official code of "Socialality Anchors: Towards Group-bounded Trajectory Prediction".

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages