Full support of SERIAL data type #1628

ray6080 · 2023-06-05T02:32:55Z

For now, SERIAL is added as an experimental feature.
Users can define the primary key property of node tables as SERIAL to avoid the creation of HashIndex in node tables, thus can speed up data ingestion a lot (no index construction and lookups).

Internally, SERIAL is interpreted as INT64, and its values are auto-incremented as new nodes are appended.
Essentially, SERIAL is an alias of a SEQUENCE with data type as int64, start as 0, increment as 1.
For node tuples, we assume their SERIAL values are aligned with our internal node offsets.

Given a node table person with the schema (ID SERIAL, age INT64, name STRING, PRIMARY KEY(ID)).
To completely support SERIAL, we should consider following cases:

Support copy to specified table properties only. For example, users can specify COPY person(name) FROM 'person_names.csv' to copy from a csv file with only one column ("name") into age and name properties. This will let ID and age be their default values, ID will be set as SERIAL auto incremented starting from 0 (suppose the table is empty), and age will all be set to 0.
Copy to SERIAL property from csv files. We should also allow users to copy from csv files into a SERIAL property. This is to handle the case where a table only contains SERIAL properties. For example, CREATE NODE TABLE n(ID SERIAL, PRIMARY KEY(ID)). In this case, we have the constraint that values in the csv file must match the auto increment pattern.
We need to handle creations of node tuples following deletions, where we reuse deleted node offsets, thus we reuse deleted serial values too.
Consider add the support of SEQUENCE if necessary in the future.

This is still debatable. Let me know if there are any ideas around this.

The text was updated successfully, but these errors were encountered:

andyfengHKU · 2024-04-12T17:09:16Z

Close due to duplication with #1496

sapalli2989 mentioned this issue Oct 29, 2023

INT64 <-> SERIAL conversion fails in Python API #2297

Closed

andyfengHKU closed this as completed Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Full support of SERIAL data type #1628

Full support of SERIAL data type #1628

ray6080 commented Jun 5, 2023

andyfengHKU commented Apr 12, 2024

Full support of SERIAL data type #1628

Full support of SERIAL data type #1628

Comments

ray6080 commented Jun 5, 2023

andyfengHKU commented Apr 12, 2024