Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Blob I/O #1083

Closed
wants to merge 22 commits into from
Closed

Add Blob I/O #1083

wants to merge 22 commits into from

Conversation

joriszwart
Copy link

Incremental Blob I/O

This PR adds support for Incremental Blob I/O.

Notes

  • Modeled after backup using a driver connection.
  • The interfaces io.Reader and io.Closer have been implemented.
  • Related to issue Blob I/O #239

If this PR makes sense, I'll enhance it with additional io interfaces.

@joriszwart joriszwart marked this pull request as ready for review September 1, 2022 14:39
@mattn
Copy link
Owner

mattn commented Sep 1, 2022

interesting.

@joriszwart joriszwart marked this pull request as draft September 1, 2022 15:10
@joriszwart
Copy link
Author

joriszwart commented Sep 1, 2022

@joriszwart joriszwart marked this pull request as ready for review September 1, 2022 15:32
blob_io.go Outdated Show resolved Hide resolved
blob_io_test.go Outdated Show resolved Hide resolved
blob_io.go Show resolved Hide resolved
blob_io.go Outdated Show resolved Hide resolved
blob_io.go Outdated Show resolved Hide resolved
@joriszwart
Copy link
Author

I have implemented write and seek support. As far as I'm concerned, this pull request is ready for further review. Thanks so far.

@joriszwart
Copy link
Author

joriszwart commented Sep 3, 2022

I was tempted to implement the io.ReaderAt and io.WriterAt interfaces as well, but clients can do that themselves by combining io.Reader (or io.Writer) and io.Seeker:

type Foo struct {
	io.ReadSeeker
}

func (f *Foo) ReadAt(p []byte, off int64) (int, error) {
	_, err := f.Seek(off, io.SeekStart)
	if err != nil {
		return 0, err
	}

	n, err := f.Read(p)
	return n, err
}

blob_io.go Show resolved Hide resolved
blob_io.go Show resolved Hide resolved
blob_io.go Outdated Show resolved Hide resolved
blob_io.go Outdated Show resolved Hide resolved
blob_io_test.go Outdated Show resolved Hide resolved
blob_io.go Outdated
return 0, errors.New("sqlite.SQLiteBlob.Seek: negative position")
}

if abs > math.MaxInt32 {
Copy link
Collaborator

@rittneje rittneje Oct 18, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we instead check if abs > math.MaxInt32 || abs > int64(s.size)?

blob_io.go Outdated Show resolved Hide resolved
joriszwart and others added 2 commits October 19, 2022 09:01
Co-authored-by: rittneje <rittneje@gmail.com>
@joriszwart
Copy link
Author

The Linux builds with libsqlite3 fail with:

=== RUN   TestBlobRead
    blob_io_test.go:46: no such vfs: memdb
--- FAIL: TestBlobRead (0.00s)

I don't know how to fix this.

@graf0
Copy link

graf0 commented Nov 28, 2022

Hello!

The Linux builds with libsqlite3 fail with:

=== RUN   TestBlobRead
    blob_io_test.go:46: no such vfs: memdb
--- FAIL: TestBlobRead (0.00s)

I don't know how to fix this.

Problem is like this:

  • github actions uses ubuntu-latest, which is focal release, version 20.04 (stable release) at time of writing this answer
  • default version of sqlite3 in this release of ubuntu is 3.31.0
  • it's compiled without SQLITE_ENABLE_DESERIALIZE which is required to enable memdb vfs in sqlite verison < 3.36.0

So - vfs=memdb will not work in ubuntu focal. You would need to recompile libsqlite3 deb package and add SQLITE_ENABLE_DESERIALIZE option to make. Without it memdb is just not there. And package libsqlite3-dev do not consist c code of sqlite3 - it's .a and .so files, already precompiled.

This behaviour was changed in sqlite 3.36.0 - SQLITE_ENABLE_DESERIALIZE is defined by default, you need to explicity disable it by defining SQLITE_OMIT_DESERIALIZE option during sqlite compilation.

So, there are imho there are two solution of this problem:

  • use connection string: "file:foobar?mode=memory&cache=shared" - maybe only if version of sqlite3 is < 3.36.0
  • or use connection string ":memory:" and set db.SetMaxOpenConns(1) (mayb followed by db.Ping() to create conn?) to limit all access to the same memory database on all versions of sqlite, including sqlite3

This problem will disappear after ubuntu will release next LTS version, and github action will change ubuntu-latest to this LTS version - there should be at least sqlite3 3.37.0 there.

@graf0
Copy link

graf0 commented Nov 28, 2022

sugested fix (will work with any version of sqlite):

diff --git a/blob_io_test.go b/blob_io_test.go
index 3e6fb91..b98df8d 100644
--- a/blob_io_test.go
+++ b/blob_io_test.go
@@ -25,12 +25,14 @@ var _ io.Closer = &SQLiteBlob{}
 type driverConnCallback func(*testing.T, *SQLiteConn)
 
 func blobTestData(t *testing.T, dbname string, rowid int64, blob []byte, c driverConnCallback) {
-       db, err := sql.Open("sqlite3", "file:/"+dbname+"?vfs=memdb")
+       db, err := sql.Open("sqlite3", ":memory:")
        if err != nil {
                t.Fatal(err)
        }
        defer db.Close()
 
+       db.SetMaxOpenConns(1)
+
        // Test data
        query := `
                CREATE TABLE data (

@joriszwart
Copy link
Author

@rittneje do you agree with the proposed fix?

@rittneje
Copy link
Collaborator

@joriszwart In this particular case :memory: should work. However, the need to throttle the connection pool to one can cause some issues in general. If you make that change, you should also add comments explaining it is specifically to support older SQLite versions and developers should always use file:/<name>?vfs=memdb instead whenever possible. (Developers may reference these test cases to see how to use the blob i/o feature.)

Really I think we need a better approach for dealing with testing libsqlite3 in general. But that is an issue beyond the scope of your changes.

@jlelse
Copy link

jlelse commented Feb 3, 2023

Any updates on this? It looks very interesting and enables "streaming" to and from the database! 👍

@joriszwart
Copy link
Author

@rittneje Is there anything I can do to get this approved?

blob_io.go Show resolved Hide resolved
blob_io.go Show resolved Hide resolved
n = len(b)
}

if n != len(b) {
Copy link
Collaborator

@rittneje rittneje Mar 27, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Trying to remember - is there a reason not to do this check after the call to sqlite3_blob_write and only write what we can instead of nothing? I guess the current implementation is consistent with what sqlite3_blob_write internally does.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@joriszwart following up on this

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't know what to do. Sorry.

@rittneje
Copy link
Collaborator

@joriszwart I am away from my dev machine or I'd just add those two len checks myself. They should be pretty simple to add.

@joriszwart
Copy link
Author

@joriszwart I am away from my dev machine or I'd just add those two len checks myself. They should be pretty simple to add.

Can you add them?

@joriszwart
Copy link
Author

joriszwart commented Aug 21, 2023

Anyone else? @graf0 @pokstad @jlelse @lezhnev74 @mitar?

blob_io_test.go Outdated

func blobTestData(t *testing.T, dbname string, rowid int64, blob []byte, c driverConnCallback) {

// TODO use :memory: for compatibility with SQLite versions < 3.37.0.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@joriszwart please resolve this TODO

if err != nil {
t.Fatal(err)
}
defer driverConn.Close()
Copy link
Collaborator

@rittneje rittneje Oct 22, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove. This is superfluous with the call to conn.Close() above. (And also you aren't supposed to do anything with the conn outside the Raw callback.)

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove those 4 lines? Or only the last?

@joriszwart
Copy link
Author

@rittneje can I leave the suggested changes to you? That would make this process more efficient.

jrossi added a commit to jrossi/go-sqlite3 that referenced this pull request Mar 27, 2024
Pulling in this RP for Blob access
@joriszwart joriszwart closed this May 24, 2024
@gabriel-samfira
Copy link

A shame this was abandoned. Thanks for trying @joriszwart !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants