Allow create fragment on non-existed dataset. by eddyxu · Pull Request #825 · lance-format/lance

eddyxu · 2023-05-04T23:56:02Z

It allows user to distributedly create Fragments first, and then commit a Dataset later.

changhiskhan · 2023-05-05T00:05:38Z

        """
-        ds = self._ds.create_version_from_fragments(new_schema, fragments)
-        return LanceDataset(self.uri)
+        if isinstance(base_uri, Path):


i forgot, do we need to do any relative to absolute conversion or $HOME expansion etc here? or is that all done at the Rust level?

i am not sure which side does the normalization now.

changhiskhan · 2023-05-05T00:15:39Z

-    def _create_version_from_fragments(
-        self,
+    @staticmethod
+    def _commit(


ok so basically the intended usage here is:

each executor node create a new fragment

each fragment gets written to gs bucket under the lance directory's /data subdir

call this _commit to create a) a new manifest file and b) update the _latest.manifest file?

and these fragments can either be created from scratch or appending a column to an existing fragment?

Yes, that's right. The logic here is to separate the fragments preparation step with the final commit step (to make the version of dataset visible).

This control flow can be used for append new data (fragments), delete fragments, garbage collections later as well.

changhiskhan · 2023-05-05T00:20:17Z

+        elif isinstance(data, pa.Table):
+            reader = data.to_reader()
+        elif isinstance(data, pa.dataset.Dataset):
+            reader = pa.dataset.Scanner.from_dataset(data).to_reader()


if the user passes in a LanceDataset it will fall into this case and then fail.
Instead of using the static method, use the Dataset.scanner() (or maybe to_scanner()) API (see the other lance methods for this).

The reason is that pa.dataset.Scanner.from_dataset(...) ends up referring to Dataset private internals specific to the C++ pyarrow implementation (e.g., CDataset or smth).

Oh, i was re-using the same code in write_dataset. Can make it a to_scanner() i guess.

changhiskhan · 2023-05-05T00:22:24Z

+def test_create_from_fragments(tmp_path: Path):
+    table = pa.Table.from_pydict({"a": range(100), "b": range(100)})
+    base_dir = tmp_path / "test"
+    fragment = lance.fragment.LanceFragment.create(base_dir, 1, table)


would the input data to create ever be a LanceDataset? I'm wondering if this has the same tokio runtime issue that the read/write APIs have. If so, you may need to convert the input LanceDataset into a pyarrow Table first until we figure out how to deal with that.

at fragment level? seems no. If you consider this is the first step to create fragment. this is usually just write in memory data to disk.

changhiskhan · 2023-05-05T00:23:15Z

-                    .ds
-                    .create_version_from_fragments(&new_schema_with_id, &fragment_metadata)
-                    .await
+                LanceDataset::commit(dataset_uri, &schema, &fragment_metadata).await


is commit the right terminology here?

So from the DB term, writing all fragments are the preparing phase of a transaction, and this last step "commit" the change to the dataset (as making them visible)?

changhiskhan · 2023-05-05T00:24:26Z

+        let rt = tokio::runtime::Runtime::new()?;
+        let metadata = rt.block_on(async {
+            let mut batches: Box<dyn RecordBatchReader> = if reader.is_instance_of::<Scanner>()? {
+                let scanner: Scanner = reader.extract()?;


have you tested this case in the if else here?

added one test.

changhiskhan · 2023-05-05T00:27:04Z

-        let indices = self.load_indices().await?;
-        write_manifest_file(&self.object_store, &mut manifest, Some(indices)).await?;
-        let base = self.object_store.base_path().clone();
+        write_manifest_file(&object_store, &mut manifest, Some(indices)).await?;


preserving the indices only makes sense if we're appending rows. Here there's no guarantee as to what the input fragments actually represent?

that's fair. prevervign indices is an available option for adding columns as well tho.

Add check for the same WriteMode here as well.

eddyxu · 2023-05-05T02:27:43Z

Addressed comments.

LiWeiJie · 2023-05-05T08:57:37Z

+    @staticmethod
+    def create(
+        dataset_uri: Union[str, Path],
+        fragment_id: int,


Does the fragment_id here need to be sequentially increased from 1

eddyxu added 5 commits May 4, 2023 15:15

write data as fragment

6fd04ad

change interface

e0f0bfd

create fragment

b8a760f

add rust pyo3 integration

d3a8003

enable distribute writing

c913abb

changhiskhan reviewed May 5, 2023

View reviewed changes

eddyxu added 2 commits May 4, 2023 18:49

add test

d0973d4

add test

49c2ad6

changhiskhan approved these changes May 5, 2023

View reviewed changes

eddyxu merged commit 2d389fc into main May 5, 2023

eddyxu deleted the lei/create_fragment branch May 5, 2023 03:32

LiWeiJie reviewed May 5, 2023

View reviewed changes

Conversation

eddyxu commented May 4, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eddyxu commented May 5, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants