UniRank _{v0.1.0, work in progress}

A Ranking Model Benchmark for Unified Sequential Modeling and Feature Interaction

UniRank is an open PyTorch benchmark for large-scale recommendation ranking models. It focuses on a practical setting that is increasingly common in industrial recommender systems: ranking models must jointly learn from heterogeneous non-sequential features, target item features, and long user behavior sequences under multi-feedback objectives such as click, follow, like, share, comment, long-view, and conversion.

The project is built to make modern unified ranking architectures easier to compare, reproduce, and extend. It provides standardized dataset configurations, model implementations, distributed training utilities, mixed precision support, blocked data loading for large datasets, and sparse attention acceleration for long-sequence models.

Why UniRank?

Modern ranking research is moving from isolated feature interaction or sequence pooling modules toward unified architectures that model feature fields and user behavior tokens together. However, many strong ranking models are released from industrial systems where data, implementations, and infrastructure are not fully available. This makes it difficult to answer basic research questions:

Which architecture works best under the same data split, sequence length, and metric protocol?
How should feature interaction and sequential modeling be combined?
How do models behave across different feedback tasks rather than only CTR?
What engineering support is needed to train ranking models on industrial-scale data?

UniRank addresses these gaps by collecting representative ranking models, unified data processing logic, and reproducible experiment settings in one benchmark.

Architecture Design

UniRank follows a unified ranking pipeline. Raw user, item, context, and action features are embedded, converted into model-specific tokens, passed through feature interaction or sequence interaction layers, and finally predicted by task-specific towers.

Figure 1. Traditional New Impression Only Paradigm. Most conventional ranking systems train on the latest impressed target item only. Historical positive feedback is used as auxiliary behavior context, usually through target attention, pooling, or aggregation, before being combined with the target item, user profile, and context features in a feature interaction layer. This paradigm is efficient, but it treats each target impression as an independent sample and does not fully exploit the step-by-step evolution of user behavior.

Figure 2. UniRank Auto-Regressive Paradigm. UniRank reorganizes user histories as sequential training samples. Each behavior step can be represented with action-aware sequential tokens, target item, and non-sequential feature tokens. Instead of only predicting the latest impression, the model learns from the chronological behavior sequence and supports multi-task prediction at different positions. This design better matches long user histories and enables unified sequence-feature interaction.

Following the paper, UniRank organizes representative unified ranking models into two architectural paradigms:

Paradigm	Description	Representative Models
Unified Interaction after Sequence Pooling and Non-sequence Tokenization	Behavior sequences are first pooled or aggregated into compact sequential representations. These representations are then tokenized together with non-sequential features into a unified token space for subsequent interaction modeling.	HiFormer, RankMixer, Zenith, UniMixer, HeMix
Layer-wise Unified Interaction	Keep sequence tokens and non-sequence tokens inside the interaction layers, allowing behavior tokens, field tokens, and target tokens to exchange information throughout the unified interaction network.	OneTrans, HyFormer, MixFormer, INFNet, EST, SORT, TokenFormer, LONGER, UltraHSTU

Design choices in this repository are intentionally practical:

Multi-feedback ranking: each dataset can define multiple binary feedback tasks and evaluate AUC/gAUC per task.
Auto-regressive / user-centric training support: long behavior histories can be represented as structured action sequences rather than only a latest-impression sample.
Distributed training: torchrun + DDP are supported through run_expid.py.
Large data loading: blocked parquet loading is supported for large datasets such as TencentGR-10M.
Mixed precision and operator acceleration: bf16 training and sparse/flex attention paths are available for compatible models.

Repository Structure

UniRank/
+-- config/
|   +-- dataset_config.yaml      # Dataset paths, feature schemas, labels, and blocked-loading options
|   +-- model_config.yaml        # Experiment ids and hyperparameters
+-- data/
|   +-- QK_Video_Action/
|   +-- KuaiRand_Video_Action/
|   +-- TencentGR_10M_Action_Blocked/
+-- fuxictr/                     # Training, feature, metric, and layer utilities based on FuxiCTR
+-- model_zoo/                   # Ranking model implementations
+-- checkpoints/                 # Saved models and experiment logs
+-- test/                        # Metric and utility tests
+-- UniRank_Dataloader.py        # UniRank-specific sequence/action dataloader
+-- run_expid.py                 # Run one experiment
+-- run_all.sh                   # Run a list of experiments
+-- run_param_tuner.py           # Hyperparameter tuning entry
+-- autotuner.py                 # Tuning utilities
+-- requirements.txt
+-- README.md

Datasets

Raw Datasets

Preprocessed Datasets

Place the downloaded preprocessed datasets under ./data/ using the same directory names as the dataset ids in config/dataset_config.yaml.

Models

No.	Model	Publication
1	DIN	Deep Interest Evolution Network for Click-Through Rate Prediction
2	HiFormer	Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems
3	RankMixer	RankMixer: Scaling Up Ranking Models in Industrial Recommenders
4	Zenith	Zenith: Scaling up Ranking Models for Billion-scale Livestreaming Recommendation
5	UniMixer	UniMixer: A Unified Architecture for Scaling Laws in Recommendation Systems
6	HeMix	Query-Mixed Interest Extraction and Heterogeneous Interaction: A Scalable CTR Model for Industrial Recommender Systems
7	LONGER	LONGER: Scaling Up Long Sequence Modeling in Industrial Recommenders
8	OneTrans	OneTrans: Unified Feature Interaction and Sequence Modeling with One Transformer in Industrial Recommender
9	HyFormer	HyFormer: Revisiting the Roles of Sequence Modeling and Feature Interaction in CTR Prediction
10	MixFormer	MixFormer: Co-Scaling Up Dense and Sequence in Industrial Recommenders
11	INFNet	INFNet: A Task-aware Information Flow Network for Large-Scale Recommendation Systems
12	EST	EST: Towards Efficient Scaling Laws in Click-Through Rate Prediction via Unified Modeling
13	SORT	SORT: A Systematically Optimized Ranking Transformer for Industrial-scale Recommenders
14	TokenFormer	TokenFormer: Unify the Multi-Field and Sequential Recommendation Worlds
15	UltraHSTU	Bending the Scaling Law Curve in Large-Scale Recommendation Systems

Additional experimental or auxiliary implementations may also appear in model_zoo/.

Benchmark

The table below reports the preliminary benchmarking results under a fixed sequence length of 100. For a fair comparison, all models are configured with three layers. The token dimension is set to 128 for QK-Video and 256 for KuaiRand and TAAC-25.

Figure 3. Preliminary Benchmark Results. The benchmark evaluates 15 ranking models on QK-Video, KuaiRand, and TAAC-25 under AUC and gAUC. Results are reported for multiple feedback tasks, including click, follow, like, share, comment, long view, and conversion. Bold values indicate top-performing results for each task-metric pair.

Installation

conda create -n UniRank python=3.9
conda activate UniRank

pip install torch==2.7.0 torchvision==0.22.0 torchaudio==2.7.0 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt

How to Use

1. Download datasets

Download the preprocessed datasets from Hugging Face and place them under ./data/:

data/
+-- QK_Video_Action/
+-- KuaiRand_Video_Action/
+-- TencentGR_10M_Action_Blocked/

Check config/dataset_config.yaml if you want to change paths, feature schemas, labels, or blocked-loading settings.

2. Run one experiment

Single GPU:

python run_expid.py --config ./config --expid DIN_KuaiRand_Video_Action --gpu 0

Multi-GPU DDP:

torchrun --standalone --nproc_per_node=2 run_expid.py \
  --config ./config \
  --expid DIN_KuaiRand_Video_Action \
  --gpu 0,1

Experiment ids are defined in config/model_config.yaml and usually follow:

<Model>_<Dataset>

Examples:

UltraHSTU_QK_Video_Action
TokenFormer_KuaiRand_Video_Action
LONGER_TencentGR_10M_Action

3. Run a batch of experiments

Edit run_all.sh to uncomment the experiments you want, then run:

chmod +x run_all.sh
./run_all.sh

Logs and checkpoints are written to ./checkpoints/ and ./logs/ when enabled by the running script/configuration.

4. Add a new model

Add the model implementation to model_zoo/YourModel.py.
Export it in model_zoo/__init__.py.
Add an experiment block to config/model_config.yaml.
Reuse UniRank_Dataloader.py unless the model needs a custom input format.
Run python run_expid.py --config ./config --expid YourModel_Dataset --gpu 0.

Configuration Notes

dataset_config.yaml defines feature columns, label columns, parquet paths, sequence length metadata, and blocked data loading.
model_config.yaml defines model hyperparameters, batch size, optimizer, task list, metrics, monitor rule, and sequence length.
run_expid.py initializes feature encoders, builds dataloaders, sets up DDP, constructs the model from model_zoo, trains, validates, and optionally evaluates on the test split.
UniRank_Dataloader.py handles action-aware sequence construction and large blocked parquet loading.

Acknowledgement

UniRank is built on top of, and deeply inspired by, the excellent FuxiCTR project. We sincerely thank the FuxiCTR authors and contributors for their open-source work on reproducible CTR and ranking model research.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UniRank _{v0.1.0, work in progress}

Why UniRank?

Architecture Design

Repository Structure

Datasets

Raw Datasets

Preprocessed Datasets

Models

Benchmark

Installation

How to Use

1. Download datasets

2. Run one experiment

3. Run a batch of experiments

4. Add a new model

Configuration Notes

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
assets/figures		assets/figures
checkpoints		checkpoints
config		config
data		data
fuxictr		fuxictr
model_zoo		model_zoo
LICENSE		LICENSE
README.md		README.md
UniRank_Dataloader.py		UniRank_Dataloader.py
autotuner.py		autotuner.py
run_all.sh		run_all.sh
run_expid.py		run_expid.py
run_param_tuner.py		run_param_tuner.py

Folders and files

Latest commit

History

Repository files navigation

UniRank v0.1.0, work in progress

Why UniRank?

Architecture Design

Repository Structure

Datasets

Raw Datasets

Preprocessed Datasets

Models

Benchmark

Installation

How to Use

1. Download datasets

2. Run one experiment

3. Run a batch of experiments

4. Add a new model

Configuration Notes

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

UniRank _{v0.1.0, work in progress}

Packages