The dataset is maintained by a set of field values that contain each variable. Currently, there are two datasets for each of the languages: Python and Java.
src
: The raw source code of the program.
complexity
: The time complexity of the program, can be one of the following seven classes ('constant', 'logn', 'linear', 'nlogn', 'quadratic', 'cubic', 'exponential')
problem
: The problem number of the program. It contains the contest round number and the problem ID. The format is in "round number"_"problem ID".
from
: The origin of the program. In the case of this dataset, all source codes are from CODEFORCES.
tags
: The tags for the specific problem. There can be multiple tag types for each problem. (e.g. dp, trees, implementation)
migrated as a part of CodeComplex