Add a BufferedReader and allow BufferedWriter to handle partial writes and errors after some data was written. #13390

klondi · 2024-01-09T07:56:16Z

The commits should already describe much of what is going on but in summary.

Fix a missing include in py/mpconfig.h
Update BufferedWriter to handle partial writes and errors after some data is written.
Add a BufferedReader class handling also partial reads and errors after some data is read.

This is my first time sending code to micropython so I hope I did not make any mistakes.

(Also in case it is necessary, yes the code is mine, it is owned by me and not by any of my employers and I agree to release it under the MIT License).

Signed-off-by: Francisco Blas (klondike) Izquierdo Riera <klondike@klondike.es>

To simplify the logic create bufwriter_do_write. This function keeps track of the data left on self->len and, if a partial write happens, this function uses memmove to move the remaining data back to the beginning of the buffer. This allows simplifying bufwriter_write and bufwriter_flush significantly. bufwriter_flush now only needs to call this function if the buffer has some data stored and check for any error codes. Additionally, if the buffer is only partially flushed we notify the user too (so that they know there might be data left). bufwriter_write now just needs to call this function whenever the buffer gets full and copy the input into the buffer. It will return when either no data is written at all or when all of the input is consumed. In the case of a partial write it returns exactly the amount of data which was written. Additionally allow caching of errors to better handle partial writes. Until now if an error occurred during the write, the error would be raised and the caller had no way to know if any data was written at all (for example in prior calls if more than one block of data was passed as input). Now when we have written out some data and an error happens, we reset the buffer to the state it would have if it did not contain the data that was not written (and which was not buffered previously), and then, we return the data that was written (if any) or raise an error if no data from the input was written. This allows the programmer better control of writes. In particular, the programmer will know exactly how much of its last input data was written, consequently allowing it to handle whatever data left to be written in a better way. Signed-off-by: Francisco Blas (klondike) Izquierdo Riera <klondike@klondike.es>

Some times there is a need for a BufferedReader in a way similar to how BufferedWritter works. A clear example is when using an underlying device requiring aligned reads, but a less clear example is when using deflate.DeflateIO which will do only 1-byte reads and can become crippling quickly when the underlying object is a python implemented stream instead of a native one. The BufferedReader will only attempt to do full-buffer reads and ensures word-alignment in a way similar to how the writer does. Similarly, it will also hide any errors when partial reads happen to ensure that any data copied so far can be returned first. Signed-off-by: Francisco Blas (klondike) Izquierdo Riera <klondike@klondike.es>

github-actions · 2024-01-09T08:03:38Z

Code size report:

   bare-arm:    +0 +0.000% 
minimal x86:    +0 +0.000% 
   unix x64:    +0 +0.000% standard
      stm32:    +0 +0.000% PYBV10
     mimxrt:    +0 +0.000% TEENSY40
        rp2:    +0 +0.000% RPI_PICO
       samd:    +0 +0.000% ADAFRUIT_ITSYBITSY_M4_EXPRESS

klondi · 2024-01-09T08:05:42Z

I have fixed the spelling error, I am unsure what should I do with the Signed-Of-By headers.

codecov · 2024-01-09T08:30:11Z

Codecov Report

Attention: 32 lines in your changes are missing coverage. Please review.

Comparison is base (2ed976f) 98.36% compared to head (3ce4d08) 98.22%.
Report is 8 commits behind head on master.

Files	Patch %	Lines
py/modio.c	43.85%	32 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #13390      +/-   ##
==========================================
- Coverage   98.36%   98.22%   -0.15%     
==========================================
  Files         159      159              
  Lines       21088    21128      +40     
==========================================
+ Hits        20743    20752       +9     
- Misses        345      376      +31

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

stinos · 2024-01-09T09:01:30Z

py/misc.h

@@ -33,6 +33,8 @@
 #include <stdbool.h>
 #include <stdint.h>
 #include <stddef.h>
+// Needed for NORETURN
+#include "py/mpconfig.h"


Don't think this is explicitly documented, but see the rest of the code (grep for all occurrences of misch.h): the idea is that any file needing to include py/misc.h first includes py/mpconfig.h, so this change should not be made.

stinos · 2024-01-09T09:03:39Z

py/modio.c

+// Writes out the data stored in the buffer so far
+STATIC int bufwriter_do_write(mp_obj_bufwriter_t *self) {
+    int rv = 0;
+    // This cannot return 0 without an error


Can you elaborate on what you mean with this? Like: why would it be an issue for this bit of code if it were 0?

stinos · 2024-01-09T09:05:26Z

py/modio.c

 STATIC mp_uint_t bufwriter_write(mp_obj_t self_in, const void *buf, mp_uint_t size, int *errcode) {
    mp_obj_bufwriter_t *self = MP_OBJ_TO_PTR(self_in);

    mp_uint_t org_size = size;
+    // Alloc should always remain the same so cache it.


Minor, but here the sentence ends with a dot (imo best), but not the other comments. Also the comment below is after its statement instead of on the line above it.

stinos · 2024-01-09T09:07:17Z

Would it be possible to add tests? The code is complex enogh to warrant testing all possible edge cases imo.

projectgus · 2024-03-07T23:48:31Z

This is an automated heads-up that we've just merged a Pull Request
that removes the STATIC macro from MicroPython's C API.

See #13763

A search suggests this PR might apply the STATIC macro to some C code. If it
does, then next time you rebase the PR (or merge from master) then you should
please replace all the STATIC keywords with static.

Although this is an automated message, feel free to @-reply to me directly if
you have any questions about this.

klondi added 3 commits January 9, 2024 08:50

py/misc: Include py/mpconfig.h which defines NORETURN.

2d1e8f7

Signed-off-by: Francisco Blas (klondike) Izquierdo Riera <klondike@klondike.es>

klondi force-pushed the bufferedwriter branch from a98fb6a to 3ce4d08 Compare January 9, 2024 08:04

stinos reviewed Jan 9, 2024

View reviewed changes

dpgeorge added the py-core label Jan 16, 2024

Gadgetoid mentioned this pull request Feb 29, 2024

global: Remove the STATIC macro. #13763

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a BufferedReader and allow BufferedWriter to handle partial writes and errors after some data was written. #13390

Add a BufferedReader and allow BufferedWriter to handle partial writes and errors after some data was written. #13390

klondi commented Jan 9, 2024

github-actions bot commented Jan 9, 2024

klondi commented Jan 9, 2024

codecov bot commented Jan 9, 2024 •

edited

Loading

stinos Jan 9, 2024

stinos Jan 9, 2024

stinos Jan 9, 2024

stinos commented Jan 9, 2024

projectgus commented Mar 7, 2024

Add a BufferedReader and allow BufferedWriter to handle partial writes and errors after some data was written. #13390

Are you sure you want to change the base?

Add a BufferedReader and allow BufferedWriter to handle partial writes and errors after some data was written. #13390

Conversation

klondi commented Jan 9, 2024

github-actions bot commented Jan 9, 2024

klondi commented Jan 9, 2024

codecov bot commented Jan 9, 2024 • edited Loading

Codecov Report

stinos Jan 9, 2024

Choose a reason for hiding this comment

stinos Jan 9, 2024

Choose a reason for hiding this comment

stinos Jan 9, 2024

Choose a reason for hiding this comment

stinos commented Jan 9, 2024

projectgus commented Mar 7, 2024

codecov bot commented Jan 9, 2024 •

edited

Loading