Fix Load progress percentage sometimes exceeding 100% #8

Grandro · 2023-10-22T12:27:27Z

Fixes #6

I couldn't reproduce the issue of the loading progress percentage exceeding 100% after the double change, but I added assert statements to catch if it still happens somehow.
This holds true for the backend (and thus also for the frontend)

I have also declared _curRow to be std::atomic as suggested, in case a GeomCache is used by several threads.

Every thread calls petrimaps::Server::handle individually but I realized one single /loadstatus request is handled by every thread (I only sent out one /loadstatus request, placed a breakpoint where the request is handled in the server and saw every thread pause on that line) and would believe that this is not intended behaviour. In HttpServer::handle getReq(connection) returns the "same" request. What does connection refer to here?

…ntend to backend in a fixed time interval. Edited CMakeLists.txt to only capitalize the first character of CMAKE_BUILD_TYPE. Ensured semicolon consistency in script.js.

into one total progress, displays current stage to the user.

Declared _curRow as std::atomic in GeomCache.h

patrickbr · 2023-10-24T10:56:00Z

Nice, thank you!

Every thread calls petrimaps::Server::handle individually but I realized one single /loadstatus request is handled by every thread (I only sent out one /loadstatus request, placed a breakpoint where the request is handled in the server and saw every thread pause on that line) and would believe that this is not intended behaviour. In HttpServer::handle getReq(connection) returns the "same" request. What does connection refer to here?

Hm, I wasn't able to reproduce this. A single /loadstatus request was only handled by a single thread. The handle()-Method in HttpServer is basically an endless loop and is started by each thread. Each thread then waits in line https://github.com/ad-freiburg/util/blob/af1ced1c82539675b8c9d49c19136224e4af07a9/http/Server.cpp#L109 until a Job is available. Jobs are added in the endless loop here: https://github.com/ad-freiburg/util/blob/af1ced1c82539675b8c9d49c19136224e4af07a9/http/Server.cpp#L342 As soon as a job is available, one of the threads grabs it and handles it. The other threads continue waiting.

The general process is:

Threads are added which all use the handle() method to wait for new jobs on the _jobs queue.
An endless loop calls _jobs.add(socket.wait()). Every time a connection is available, socket.wait() returns a socket file descriptor (this is the connection variable you asked for above), and this socket file descriptor is added to _jobs and exactly one thread waiting _jobs.get() is notified that a new job is available.
For this thread,_jobs.get() now returns the file descriptor. For all other threads, _jobs.get() continues blocking.
The job is immediately removed from the queue, so the same two threads cannot get the same socket file descriptor for handling a connection.
The thread which received the file descriptor handles the connection.

hannahbast · 2024-01-12T20:27:02Z

@Grandro Thanks for working on this. While you are at it, it would also be good to clarify in the progress bar which step does what. In particular, it should be clarified that the filling of the geomcache is a one-time-thing and does not have to be done for every query. More concretely, the first step could have a message as follows (center-aligned, it doesn't matter that the message is a bit longer, the subtitle can be in a smaller font though):

Filling the geometry cache
This needs to be done only once for each new version of the dataset
and does not have to be repeated for subsequent queries

Similary, when the server has been restarted and the cached geometries are read from disk, there is currently nothing happening with the progress bar at all. There should be together with a message like this:

Reading cached geometries from disk
This needs to be done only once after the server has been started
and does not have to be repeated for subsequent queries

Concerning the 2/2 phase I have two comments:

I don't think that "Parsing geometry IDs" is a good title because the typical user cannot understand what it means. How about "Fetching [number] geometries", where [number] is the number of geometries with thousand separators? Beware to use the singular in case [number] is 1 (even though the message will only be very briefly visible then, but still).
When the result is large, there is still a significant wait after that where nothing happens with the progress bar. I would propose having a third progress bar with an appropriate title (so three phases overall: 1/3 filling the cache or reading it from disk, 2/3 fetching the geometries, 3/3 rendering result or whatever would be a good description for that last phase).

steps that sometimes take time, but where currently no progress bar is shown on the screen. The fist

Grandro · 2024-02-01T22:14:37Z

@hannahbast Thanks for your concerns and detailed propositions. I was (and still am) busy with the GeoJSON support I wanted to implement, but I will get back to this as soon as that is done.
I have seen you edited your message but I believe it is incomplete: can I just ignore the last line?

added richer descriptions for the loading stages

patrickbr · 2024-06-19T11:13:47Z

This is now merged in master, I did it manually including some refactorings I have done. Thanks!

Grandro and others added 11 commits August 1, 2023 12:16

Added loading bar visually, properly centered message, minor renames

9e21173

Implemented a loading bar by fetching the GeomCache progress from fro…

3da56e6

…ntend to backend in a fixed time interval. Edited CMakeLists.txt to only capitalize the first character of CMAKE_BUILD_TYPE. Ensured semicolon consistency in script.js.

Merge branch 'ad-freiburg:master' into master

bca4bd1

Merge branch 'ad-freiburg:master' into master

afe4ec5

Improved loading bar visuals, merged progress of all loading stages

bc320e5

into one total progress, displays current stage to the user.

Merge branch 'master' of https://github.com/Grandro/qlever-petrimaps

5ab45c8

Merge branch 'ad-freiburg:master' into master

2a3b485

Merge branch 'ad-freiburg:master' into master

9db802f

Merge branch 'ad-freiburg:master' into master

9e5eca4

Added assert statements to validate progress stays <= 100%

56c4d64

Declared _curRow as std::atomic in GeomCache.h

Merge branch 'ad-freiburg:master' into fix-load-percentage

9637bcc

Grandro added 5 commits April 17, 2024 09:19

Handled loading progress for reading cache from disk,

521748e

added richer descriptions for the loading stages

Included number of geometries in loading message

85a2f78

Added loading progress in Server for image generation

17a75b3

Improved reading from disk progress by reading line by line

835df9d

Reverted some debugging stuff.

d3b069d

patrickbr closed this Jun 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Load progress percentage sometimes exceeding 100% #8

Fix Load progress percentage sometimes exceeding 100% #8

Grandro commented Oct 22, 2023

patrickbr commented Oct 24, 2023 •

edited

Loading

hannahbast commented Jan 12, 2024 •

edited

Loading

Grandro commented Feb 1, 2024

patrickbr commented Jun 19, 2024

Fix Load progress percentage sometimes exceeding 100% #8

Fix Load progress percentage sometimes exceeding 100% #8

Conversation

Grandro commented Oct 22, 2023

patrickbr commented Oct 24, 2023 • edited Loading

hannahbast commented Jan 12, 2024 • edited Loading

Grandro commented Feb 1, 2024

patrickbr commented Jun 19, 2024

patrickbr commented Oct 24, 2023 •

edited

Loading

hannahbast commented Jan 12, 2024 •

edited

Loading