### Quotes Dataset
- **Description:** Collection of 1,638 famous quotes.
- **Data Source:** [micheleriva_1638_quotes_planar_embeddings.parquet](https://www.dropbox.com/scl/fi/hgqxoi9edehwq4d17k3q7/micheleriva_1638_quotes_planar_embeddings.parquet?rlkey=wey433rcicsxkhghhlpwbskwu&dl=1)
  - **Potential columns for visualization:**
    - **X & Y Coordinates:** `x`, `y`
    - **Label:** `quote`

## Get data

### Data parameters

In [1]:
ext = '.parquet'
src = 'https://www.dropbox.com/scl/fi/hgqxoi9edehwq4d17k3q7/micheleriva_1638_quotes_planar_embeddings.parquet?rlkey=wey433rcicsxkhghhlpwbskwu&dl=1'
target_filename = 'micheleriva_1638_quotes_planar_embeddings.parquet'

### Install and import

In [2]:
import os
if not os.getenv('IN_COSMO_DEV_ENV'):
    %pip install -q cosmograph tabled cosmodata

import tabled
import cosmodata

from functools import partial 
from cosmograph import cosmo

### Load data

In [3]:
if ext:
    getter = partial(tabled.get_table, ext=ext)
else:
    getter = tabled.get_table
# acquire_data takes care of caching locally too, so next time access will be faster
# (If you want a fresh copy, you can delete the local cache file manually.)
data = cosmodata.acquire_data(src, target_filename, getter=getter)

Fetching data from https://www.dropbox.com/scl/fi/hgqxoi9edehwq4d17k3q7/micheleriva_1638_quotes_planar_embeddings.parquet?rlkey=wey433rcicsxkhghhlpwbskwu&dl=1...
Data cached at: /Users/thorwhalen/.local/share/cosmodata/datasets/micheleriva_1638_quotes_planar_embeddings.parquet.pkl


## Peep at the data

In [4]:
mode = 'short'  #Literal['short', 'sample', 'stats'] = 'short',
exclude_cols = []
cosmodata.print_dataframe_info(data, exclude_cols, mode=mode)

DataFrame shape: (1638, 3)
First row
------------------------------------------------------------
quote    Genius is one percent inspiration and ninety-n...
x                                                11.794733
y                                                   5.6216


## Visualize data

### Quotes Scatter Plot

This visualization displays quotes in a 2D scatter plot, where the coordinates are determined by the numerical columns `x` and `y`. The size of the points is uniform, while the color of the points can be adjusted for clarity or categorize them visually. The quotes can also be displayed as labels on hover, providing context to the visualization.

In [11]:
cosmo(
    data,
    point_x_by="x",
    point_y_by="y",
    point_label_by="quote",
    point_color="#b3b3b3",
    point_id_by="quote",
    show_labels=True,
    # show_hovered_point_label=True,
    # show_dynamic_labels=False,
    fit_view_on_init=True,
)

Cosmograph(background_color=None, components_display_state_mode=None, fit_view_on_init=True, focused_point_rin…