read data from hdfs #1

formath · 2016-08-31T02:37:58Z

"Different node should owns different parts of all Train data. This simple script did not do this job, so you should prepare it at last. " I saw this in cluster training wiki. So, could paddle read data from hdfs and distribute data to each node automatically?

reyoung · 2016-08-31T03:27:20Z

Distribute data to cluster is not added in PaddlePaddle now. You can read data directly from a HDFS file path by PyDataProvider2.

PaddlePaddle not handle how to get data file remotely, just pass the file path into a Python function. It is user's job to OPEN the file (or SQL connection string, or HDFS path), and get each
sample one by one from it.

It is welcome to contribute a script to distribute data to cluster. Or we may add it soon if this feature is very necessary.

Update from the original

revise CMake for MAC OS

Refine cmake version tag.

merge PaddlePaddle/Paddle

update from origin

Rephrase the first paragraph

Update from the origin

Invoke check_grad many times for no_grad_set

Fix CI test

Update

Master

update from origin

fix pipe_reader unimport packages

Pr54202

modify for dynamic zeus

Rename docs-src to docs and rename demo to tutorials.

[MTAI-489] build(ci): test CI

Correct license in rockspec file.

spliting

* provide capi for flash attention * cuda enforce; flash error * fix api * fix zero tensors * fix softmax_ptr size

Smask up ready

TRT index_put

添加castop

paddle-bot · 2024-09-26T08:46:29Z

Since you haven't replied for more than a year, we have closed this issue/pr.
If the problem is not solved or there is a follow-up one, please reopen it at any time and we will continue to follow up.
由于您超过一年未回复，我们将关闭这个issue/pr。
若问题未解决或有后续问题，请随时重新打开，我们会继续跟进。

reyoung self-assigned this Aug 31, 2016

reyoung added the question label Aug 31, 2016

qingqing01 pushed a commit that referenced this issue Sep 14, 2016

Merge pull request #1 from baidu/master

5c6ecb2

Update from the original

gangliao closed this as completed Sep 21, 2016

reyoung referenced this issue in reyoung/Paddle Sep 21, 2016

Merge pull request #1 from reyoung/mac_port

4315a38

revise CMake for MAC OS

wangkuiyi added a commit that referenced this issue Dec 1, 2016

Merge pull request #1 from reyoung/docker_openssh

bb13328

Refine cmake version tag.

reyoung pushed a commit that referenced this issue Dec 5, 2016

Merge pull request #1 from PaddlePaddle/develop

021b3a4

merge PaddlePaddle/Paddle

luotao1 pushed a commit that referenced this issue Dec 28, 2016

Merge pull request #1 from PaddlePaddle/develop

4a0cb3d

update from origin

backyes pushed a commit that referenced this issue Dec 30, 2016

Merge pull request #1 from wangkuiyi/aws-yi

7ca14b6

Rephrase the first paragraph

sarawon mentioned this issue Feb 21, 2017

GPU训练时候报错 #1406

Closed

qingqing01 pushed a commit that referenced this issue Mar 20, 2017

Merge pull request #1 from PaddlePaddle/develop

2a9d71a

Update from the origin

sdujq mentioned this issue May 5, 2017

paddle exp计算出core （vsExp），浮点计算溢出？ #2024

Closed

xiehongweiscut mentioned this issue Aug 3, 2017

使用paddle capi在线化部署服务出现core #3207

Closed

This was referenced Aug 17, 2017

c-api examples编译后运行出core #3543

Closed

capi sequence方式调用出core #3590

Closed

NHZlX mentioned this issue Aug 22, 2017

Convolution层group操作是否存在内存泄漏 #3593

Closed

reyoung pushed a commit that referenced this issue Sep 5, 2017

Merge pull request #1 from reyoung/grad_test_for_multi_inputs

95955cc

Invoke check_grad many times for no_grad_set

fty8788 mentioned this issue Sep 6, 2017

C预测程序中，如何跳过部分底层网络，将中间层作为输入？ #3915

Closed

fsfszongming256 mentioned this issue Sep 8, 2017

capi预测出core，当样本是dense_vector_sequence类型时，请教正确的调用方式 #3969

Closed

zchen0211 added a commit that referenced this issue Sep 14, 2017

Merge pull request #1 from reyoung/czy_elemwise

b2d9c91

Fix CI test

wanghaoshuang mentioned this issue Sep 19, 2017

Add sequence_expand Operator based LoDTensor #4178

Closed

qingqing01 pushed a commit that referenced this issue Sep 19, 2017

Merge pull request #1 from PaddlePaddle/develop

462c2ed

Update

luotao1 pushed a commit that referenced this issue Oct 16, 2017

Merge pull request #1 from PaddlePaddle/master

4bc5bf9

Master

wanghaox added a commit that referenced this issue Nov 24, 2017

Merge pull request #1 from PaddlePaddle/develop

ca988bd

update from origin

typhoonzero added a commit that referenced this issue Dec 2, 2017

Merge pull request #1 from seiriosPlus/tangw/simple_pipe_reader

cd36531

fix pipe_reader unimport packages

tensor-tang mentioned this issue Dec 5, 2017

add script to check the cpu env #6291

Merged

yyhlvdl mentioned this issue Dec 11, 2017

PaddlePaddle使用gcc4.9源码编译失败 #6382

Closed

fty8788 mentioned this issue Jan 23, 2018

capi forward函数core： Check failed: size != 0 allocate 0 bytes #7774

Closed

pkuyym mentioned this issue Jan 24, 2018

Should enhance FillZerosLikeOp to support array #7823

Closed

hw446 mentioned this issue Mar 31, 2023

使用PaddleInference推理，推理时内存暴涨 #49500

Closed

PeiyuLau pushed a commit to PeiyuLau/Paddle that referenced this issue Jun 8, 2023

[MLU] fix bn and enable tf32 (PaddlePaddle#1)

4ef39c2

feifei-111 referenced this issue in feifei-111/Paddle Jun 14, 2023

Merge pull request #1 from NotHaozi/pr54202

632b36a

Pr54202

LangJiHuangSha mentioned this issue Jun 19, 2023

国产化麒麟v10系统源码安装 make报错 #54755

Closed

ga1008 mentioned this issue Jul 26, 2023

arm架构服务器上的debian容器内编译报错 #55712

Closed

MyAngelAyase mentioned this issue Aug 28, 2023

【NPU】打开FLAGS_cache_inference_while_scope后，多卡推理报错 #56721

Closed

cqli0905 mentioned this issue Sep 8, 2023

Arm架构 openeuleros-22.3系统下，paddle plugin无法正常编译使用 #57121

Closed

HelloRay123 mentioned this issue Sep 9, 2023

ExternalError: XPU conv kernel return wrong value[1 xpu api invalid param] #57142

Closed

paddle-bot bot added the status/developing 开发中 label Sep 22, 2023

paddle-bot bot reopened this Sep 22, 2023

kane-hu mentioned this issue Sep 23, 2023

鲲鹏920编译patchelf出错 #57660

Closed

kane-hu mentioned this issue Oct 8, 2023

源码编译ARM机器时，make TARGET=ARMV8 -j$(nproc)卡住 #57932

Closed

hubimaso mentioned this issue Nov 3, 2023

在aistudio平台上使用paddlepaddle python环境训练，通过paddle的go api进行推理, 报错详细间日志。 #58652

Closed

tianyan01 pushed a commit to tianyan01/Paddle that referenced this issue Nov 23, 2023

Merge pull request PaddlePaddle#1 from tianyan01/v2.4.2

052c62e

modify for dynamic zeus

lizexu123 referenced this issue in lizexu123/Paddle Feb 23, 2024

Merge pull request #1 from wanghaoshuang/doc

775aa12

Rename docs-src to docs and rename demo to tutorials.

hanhaowen-mt referenced this issue in hanhaowen-mt/Paddle Feb 29, 2024

Merge pull request #1 from mthreads/mingyuanw/ci

8625655

[MTAI-489] build(ci): test CI

NKNaN pushed a commit to NKNaN/Paddle that referenced this issue Mar 3, 2024

Merge pull request PaddlePaddle#1 from jaredcasper/master

f7d1190

Correct license in rockspec file.

Fridge003 pushed a commit to Fridge003/Paddle that referenced this issue Mar 20, 2024

Merge pull request PaddlePaddle#1 from Fridge003/cinn_tmp

dfee88f

spliting

YuanRisheng mentioned this issue Apr 22, 2024

【快乐开源】PIR模式下单测问题修复与适配 #63740

Closed

Juruobudong mentioned this issue May 14, 2024

官网镜像pull不了，docker pull registry.baidubce.com/device/paddle-xpu:ubuntu18-x86_64-gcc82 #64306

Open

lemonish mentioned this issue May 23, 2024

支持macos 14.x #64182

Open

kircle888 pushed a commit to kircle888/Paddle that referenced this issue Jul 7, 2024

CAPI for paddle (PaddlePaddle#1)

5994ce0

* provide capi for flash attention * cuda enforce; flash error * fix api * fix zero tensors * fix softmax_ptr size

kircle888 added a commit to kircle888/Paddle that referenced this issue Jul 7, 2024

Merge pull request PaddlePaddle#1 from kircle888/smask_up

c3212be

Smask up ready

saysbye mentioned this issue Jul 8, 2024

昇腾 NPU import paddle 报错：libatb.so: cannot open shared object file: No such file or directory #65797

Closed

Linda-Deng mentioned this issue Jul 18, 2024

分布式模型在训练过程中使用SyncBatchNorm报错 #66162

Open

ckl117 pushed a commit to ckl117/Paddle that referenced this issue Jul 18, 2024

Merge pull request PaddlePaddle#1 from ckl117/adi_fm

933d218

TRT index_put

Layssy pushed a commit to Layssy/Paddle that referenced this issue Jul 24, 2024

Merge pull request PaddlePaddle#1 from Layssy/lw_trt

fd0959e

添加castop

gaecom mentioned this issue Aug 22, 2024

PaddlePaddle安装PaddlePaddle，使用import paddle导入出现Illegal instruction #67319

Open

paddle-bot bot closed this as completed Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

read data from hdfs #1

read data from hdfs #1

formath commented Aug 31, 2016

reyoung commented Aug 31, 2016

paddle-bot bot commented Sep 26, 2024

read data from hdfs #1

read data from hdfs #1

Comments

formath commented Aug 31, 2016

reyoung commented Aug 31, 2016

paddle-bot bot commented Sep 26, 2024