feat: Deprecate `datasets` module, rename to `inferences` #2785

anticorrelator · 2024-04-05T18:37:00Z

Resolves #2732

Deprecates phoenix.datasets module, interfaces are available under phoenix.inferences
px.Dataset class renamed to px.Inference
px.ExampleDatasets class renmaed to px.ExampleInferences
Datasets.from_open_inference deprecated and removed from Inference

review-notebook-app · 2024-04-05T18:37:05Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

axiomofjoy · 2024-04-05T19:59:27Z

tests/server/api/types/test_dataset.py

@@ -2,8 +2,8 @@

 import pytest
 from pandas import DataFrame, Timestamp
-from phoenix.datasets.dataset import Dataset as InternalDataset
-from phoenix.datasets.dataset import Schema
+from phoenix.inferences.inference import Inference as InternalDataset


Rename to InternalInference.

axiomofjoy · 2024-04-05T20:02:51Z

src/phoenix/core/model.py

-from phoenix.datasets.dataset import Dataset
-from phoenix.datasets.schema import EmbeddingColumnNames, EmbeddingFeatures
+from phoenix.inferences.inference import Inference
+from phoenix.inferences.schema import EmbeddingColumnNames, EmbeddingFeatures

 from .embedding_dimension import EmbeddingDimension


 def _get_embedding_dimensions(


I think we can rename these arguments for clarity.

I'm going to leave renames alone I think, this is a big refactor and I'm already having a tough time keeping it all straight

axiomofjoy · 2024-04-05T20:03:26Z

src/phoenix/core/model_schema_adapter.py

-def _is_dataset(obj: Optional[Dataset]) -> TypeGuard[Dataset]:
-    return type(obj) is Dataset
+def _is_dataset(obj: Optional[Inference]) -> TypeGuard[Inference]:
+    return type(obj) is Inference


These probably deserve to be renamed.

axiomofjoy · 2024-04-05T20:05:00Z

src/phoenix/server/main.py

@@ -114,9 +114,9 @@ def _load_items(
    trace_dataset_name: Optional[str] = None
    simulate_streaming: Optional[bool] = None

-    primary_dataset: Dataset = EMPTY_DATASET


Probably can be renamed.

axiomofjoy · 2024-04-05T20:05:50Z

tests/datasets/test_inference.py

    _normalize_timestamps,
    _parse_dataframe_and_schema,
 )
-from phoenix.datasets.errors import DatasetError
-from phoenix.datasets.schema import (
+from phoenix.inferences.schema import (


Looks like the test names in this file are still named with dataset names.

axiomofjoy · 2024-04-05T20:07:01Z

tests/server/api/types/test_inference.py

-        primary_dataset: InternalDataset,
-        reference_dataset: InternalDataset,
+        primary_dataset: InternalInference,
+        reference_dataset: InternalInference,


looks like this file still has a lot of dataset names

axiomofjoy · 2024-04-05T20:08:07Z

src/phoenix/inferences/fixtures.py

+NAME_TO_FIXTURE = {fixture.name: fixture for fixture in FIXTURES}
+
+
+def get_datasets(


This can be renamed.

axiomofjoy · 2024-04-05T20:08:16Z

src/phoenix/inferences/fixtures.py

+    no_internet: bool = False,
+) -> Tuple[Inference, Optional[Inference], Optional[Inference]]:
+    """
+    Downloads primary and reference datasets for a fixture if they are not found


axiomofjoy · 2024-04-05T20:09:51Z

src/phoenix/inferences/inference.py

+SchemaLike: TypeAlias = Any
+
+
+class Inference:


Should this be plural because it represents multiple inferences?

axiomofjoy

Inference vs. Inferences?

Just checking the diff, I'm still seeing a large number of old dataset names.

mikeldking

I think it should be plural. Also can we cascade the notebook changes? We need to wait until the release. You will also have to pin the notebooks lowerbound

mikeldking

Approving to unblock

…ix into dustin/deprecate-datasets

anticorrelator added 2 commits April 5, 2024 03:12

Initial refactor of Datasets -> Inferences

c445ffc

Add deprecation warnings to "datasets" public interface

9c29507

dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Apr 5, 2024

RogerHYang approved these changes Apr 5, 2024

View reviewed changes

anticorrelator added 2 commits April 5, 2024 15:50

Fix type signature

2196e30

Ensure ExampleDatasets still exists

cca7e34

anticorrelator changed the title ~~chore: Deprecate datasets module, rename to inferences~~ feat: Deprecate datasets module, rename to inferences Apr 5, 2024

Use multiline string

4f8ca27

axiomofjoy reviewed Apr 5, 2024

View reviewed changes

Rename test files

2ab294b

axiomofjoy reviewed Apr 5, 2024

View reviewed changes

axiomofjoy approved these changes Apr 5, 2024

View reviewed changes

anticorrelator and others added 2 commits April 5, 2024 16:24

Remove test for Inference.from_open_inference

07f39db

exclude manual instrumentation example from type checks

69433b1

mikeldking requested changes Apr 5, 2024

View reviewed changes

mikeldking approved these changes Apr 5, 2024

View reviewed changes

anticorrelator added 5 commits April 5, 2024 18:54

Revert changes to notebooks

5698b32

Merge branch 'dustin/deprecate-datasets' of github.com:Arize-ai/phoen…

6ea4102

…ix into dustin/deprecate-datasets

Rename Inference -> Inferences

1446a3f

Fix line length

7f91187

Merge remote-tracking branch 'origin' into dustin/deprecate-datasets

2612b08

anticorrelator merged commit 4987ea3 into main Apr 8, 2024
11 checks passed

anticorrelator deleted the dustin/deprecate-datasets branch April 8, 2024 15:54

github-actions bot mentioned this pull request Apr 8, 2024

chore(main): release arize-phoenix 3.20.0 #2803

Merged

github-actions bot mentioned this pull request May 9, 2024

chore(main): release arize-phoenix 5.0.0 #3134

Closed

mikeldking mentioned this pull request May 9, 2024

chore(main): release arize-phoenix 4.0.0 #3143

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Deprecate `datasets` module, rename to `inferences` #2785

feat: Deprecate `datasets` module, rename to `inferences` #2785

anticorrelator commented Apr 5, 2024 •

edited

review-notebook-app bot commented Apr 5, 2024

axiomofjoy Apr 5, 2024

axiomofjoy Apr 5, 2024

anticorrelator Apr 5, 2024

axiomofjoy Apr 5, 2024

axiomofjoy Apr 5, 2024

axiomofjoy Apr 5, 2024

axiomofjoy Apr 5, 2024

axiomofjoy Apr 5, 2024

axiomofjoy Apr 5, 2024

axiomofjoy Apr 5, 2024

axiomofjoy left a comment

mikeldking left a comment

mikeldking left a comment

		NAME_TO_FIXTURE = {fixture.name: fixture for fixture in FIXTURES}


		def get_datasets(

feat: Deprecate datasets module, rename to inferences #2785

feat: Deprecate datasets module, rename to inferences #2785

Conversation

anticorrelator commented Apr 5, 2024 • edited

review-notebook-app bot commented Apr 5, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

axiomofjoy left a comment

Choose a reason for hiding this comment

mikeldking left a comment

Choose a reason for hiding this comment

mikeldking left a comment

Choose a reason for hiding this comment

feat: Deprecate `datasets` module, rename to `inferences` #2785

feat: Deprecate `datasets` module, rename to `inferences` #2785

anticorrelator commented Apr 5, 2024 •

edited