Implement a fast row container reader to dump rows from disk #45125

YangKeao · 2023-07-03T08:50:05Z

Enhancement

The current implementation of reading rows from RowContainer is not fast enough. The following function is used to dump a chunk:

// GetRowAndAppendToChunk gets a Row from the ListInDisk by RowPtr. Return the Row and the Ref Chunk.
func (l *ListInDisk) GetRowAndAppendToChunk(ptr RowPtr, chk *Chunk) (row Row, _ *Chunk, err error) {
	off, err := l.getOffset(ptr.ChkIdx, ptr.RowIdx)
	if err != nil {
		return
	}
	r := l.dataFile.getSectionReader(off)
	format := rowInDisk{numCol: len(l.fieldTypes)}
	_, err = format.ReadFrom(r)
	if err != nil {
		return row, nil, err
	}
	row, chk = format.toRow(l.fieldTypes, chk)
	return row, chk, err
}

It would be better to pipeline the format.ReadFrom (which is blocked by disk) and the format.toRow (which is blocked by CPU).

Also, with the assumption that a whole chunk is stored together tightly, we can get the first row offset and use it for the whole chunk.

The text was updated successfully, but these errors were encountered:

…ainer reader (#45130) close #45125

…ainer reader (#45130) (#45203) close #45125

…ainer reader (#45130) (#45204) close #45125

YangKeao added the type/enhancement label Jul 3, 2023

YangKeao mentioned this issue Jul 3, 2023

util/chunk: optimize (*ListInDisk).GetChunk and add a fast row container reader #45130

Merged

4 tasks

ti-chi-bot bot closed this as completed in #45130 Jul 4, 2023

ti-chi-bot bot pushed a commit that referenced this issue Jul 4, 2023

util/chunk: optimize (*ListInDisk).GetChunk and add a fast row cont…

ab4c06a

…ainer reader (#45130) close #45125

YangKeao added affects-6.5 affects-7.1 type/bug This issue is a bug. severity/major and removed type/enhancement labels Jul 10, 2023

ti-chi-bot bot added may-affects-5.2 This bug maybe affects 5.2.x versions. may-affects-5.3 This bug maybe affects 5.3.x versions. may-affects-5.4 This bug maybe affects 5.4.x versions. may-affects-6.1 labels Jul 10, 2023

YangKeao added sig/sql-infra SIG: SQL Infra and removed may-affects-5.2 This bug maybe affects 5.2.x versions. may-affects-5.3 This bug maybe affects 5.3.x versions. may-affects-5.4 This bug maybe affects 5.4.x versions. may-affects-6.1 labels Jul 10, 2023

ti-chi-bot bot pushed a commit that referenced this issue Jul 10, 2023

util/chunk: optimize (*ListInDisk).GetChunk and add a fast row cont…

aa4084e

…ainer reader (#45130) (#45203) close #45125

tiancaiamao mentioned this issue Jul 18, 2023

release v7.1.1 pingcap/docs-cn#14506

Merged

17 tasks

ti-chi-bot bot pushed a commit that referenced this issue Jul 28, 2023

util/chunk: optimize (*ListInDisk).GetChunk and add a fast row cont…

a431496

…ainer reader (#45130) (#45204) close #45125

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement a fast row container reader to dump rows from disk #45125

Implement a fast row container reader to dump rows from disk #45125

YangKeao commented Jul 3, 2023

Implement a fast row container reader to dump rows from disk #45125

Implement a fast row container reader to dump rows from disk #45125

Comments

YangKeao commented Jul 3, 2023

Enhancement