Where can I get files named 'train_1000' and 'test_1000'？ #5

yeahQing · 2021-07-26T11:36:54Z

I don`t understand why I should use split().

for dataset_root in config['train_dataset'].split(',')

yeahQing · 2021-07-26T13:30:43Z

How can I create a lmdb dataset for Chinese character?

JingyeChen · 2021-07-27T01:50:52Z

The link of the dataset is shown in http://www.nlpr.ia.ac.cn/databases/handwriting/Home.html

yeahQing · 2021-07-27T04:45:58Z

The link of the dataset is shown in http://www.nlpr.ia.ac.cn/databases/handwriting/Home.html

Hi, Chen, thanks for your reply. I have downloaded the dataset, but I don’t understand why a string is looped here. The code is:

def get_data_package():
    train_dataset = []
    # 'train_dataset': './data/mydata/train_1000' why loop this path?
    for dataset_root in config['train_dataset'].split(','):
        _, dataset = get_dataloader(dataset_root, shuffle=True)
        train_dataset.append(dataset)

yeahQing · 2021-07-27T04:50:07Z

What type of data set should I replace the path './data/mydata/train_1000'?

JingyeChen · 2021-07-27T05:16:55Z

The link of the dataset is shown in http://www.nlpr.ia.ac.cn/databases/handwriting/Home.html

Hi, Chen, thanks for your reply. I have downloaded the dataset, but I don’t understand why a string is looped here. The code is:
def get_data_package():
    train_dataset = []
    # 'train_dataset': './data/mydata/train_1000' why loop this path?
    for dataset_root in config['train_dataset'].split(','):
        _, dataset = get_dataloader(dataset_root, shuffle=True)
        train_dataset.append(dataset)

A loop is used to concatenate multiple datasets. For example, the dataset can be formulated in this way:

'train_dataset': './data/mydata/train_1000,./data/mydata/train_1500,./data/mydata/train_2000'

JingyeChen · 2021-07-27T05:17:11Z

What type of data set should I replace the path './data/mydata/train_1000'?

The format should be lmdb

yeahQing · 2021-07-28T01:50:26Z

Thank you very much, it has helped me a lot!

cptbtptp125 · 2022-07-03T14:52:51Z

Hello, have you successfully converted LMDB format? I want to know how to convert, I have tried many methods without success

yeahQing · 2022-09-08T15:42:13Z

Hello, have you successfully converted LMDB format? I want to know how to convert, I have tried many methods without success

Hi, you can see in #57.

cptbtptp125 · 2022-11-28T02:38:18Z

Hello, I am a little confused about the loop connection of multiple data sets, may I ask why this operation is carried out, and what is the difference between it and the direct single training? Thank you very much for your reply. I would appreciate it if you could help me.
'train_dataset': './data/mydata/train_1000,./data/mydata/train_1500,./data/mydata/train_2000'

yeahQing closed this as completed Jul 26, 2021

yeahQing reopened this Jul 26, 2021

yeahQing closed this as completed Jul 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Where can I get files named 'train_1000' and 'test_1000'？ #5

Where can I get files named 'train_1000' and 'test_1000'？ #5

yeahQing commented Jul 26, 2021

yeahQing commented Jul 26, 2021

JingyeChen commented Jul 27, 2021

yeahQing commented Jul 27, 2021

yeahQing commented Jul 27, 2021

JingyeChen commented Jul 27, 2021

JingyeChen commented Jul 27, 2021

yeahQing commented Jul 28, 2021

cptbtptp125 commented Jul 3, 2022

yeahQing commented Sep 8, 2022

cptbtptp125 commented Nov 28, 2022

Where can I get files named 'train_1000' and 'test_1000'？ #5

Where can I get files named 'train_1000' and 'test_1000'？ #5

Comments

yeahQing commented Jul 26, 2021

yeahQing commented Jul 26, 2021

JingyeChen commented Jul 27, 2021

yeahQing commented Jul 27, 2021

yeahQing commented Jul 27, 2021

JingyeChen commented Jul 27, 2021

JingyeChen commented Jul 27, 2021

yeahQing commented Jul 28, 2021

cptbtptp125 commented Jul 3, 2022

yeahQing commented Sep 8, 2022

cptbtptp125 commented Nov 28, 2022