Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Data] TypeError when writing data #40480

Closed
bveeramani opened this issue Oct 19, 2023 · 0 comments · Fixed by #40481
Closed

[Data] TypeError when writing data #40480

bveeramani opened this issue Oct 19, 2023 · 0 comments · Fixed by #40481
Labels
bug Something that is supposed to be working; but isn't data Ray Data-related issues P1 Issue that should be fixed within a few weeks

Comments

@bveeramani
Copy link
Member

What happened + What you expected to happen

❯ python repro.py 
2023-10-18 20:27:48,612 INFO worker.py:1668 -- Started a local Ray instance.
2023-10-18 20:27:49,136 INFO streaming_executor.py:104 -- Executing DAG InputDataBuffer[Input] -> TaskPoolMapOperator[ReadRange->Write]
2023-10-18 20:27:49,136 INFO streaming_executor.py:105 -- Execution config: ExecutionOptions(resource_limits=ExecutionResources(cpu=None, gpu=None, object_store_memory=None), locality_with_output=False, preserve_order=False, actor_locality_enabled=True, verbose_progress=False)
2023-10-18 20:27:49,136 INFO streaming_executor.py:107 -- Tip: For detailed progress reporting, run `ray.data.DataContext.get_current().execution_options.verbose_progress = True`
2023-10-18 20:27:49,523 ERROR worker.py:403 -- Unhandled error (suppress with 'RAY_IGNORE_UNHANDLED_ERRORS=1'): ray::_StatsActor.update_metrics() (pid=58597, ip=127.0.0.1, actor_id=0c85fc131d5d81c0a6fe236801000000, repr=<ray.data._internal.stats._StatsActor object at 0x104a6f290>)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/balaji/Documents/GitHub/ray/python/ray/data/_internal/stats.py", line 225, in update_metrics
    self.bytes_allocated.set(stats["obj_store_mem_alloc"], tags)
  File "/Users/balaji/Documents/GitHub/ray/python/ray/util/metrics.py", line 332, in set
    raise TypeError(f"value must be int or float, got {type(value)}.")
TypeError: value must be int or float, got <class 'numpy.int64'>.

Versions / Dependencies

779c08a

Reproduction script

import ray

ray.data.range(1, parallelism=1).write_parquet("/tmp")

Issue Severity

High: It blocks me from completing my task.

@bveeramani bveeramani added bug Something that is supposed to be working; but isn't P1 Issue that should be fixed within a few weeks data Ray Data-related issues labels Oct 19, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something that is supposed to be working; but isn't data Ray Data-related issues P1 Issue that should be fixed within a few weeks
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant