[stdlib] Implementation of rmtree #2439

artemiogr97 · 2024-04-28T18:17:23Z

#2430 must be merged first

abduld

Pretty cool. Please make sure about the Int types though

stdlib/src/os/os.mojo

Signed-off-by: Artemio Garza Reyna <artemiogr97@gmail.com>

laszlokindrat

Thanks for the patch! This could be very nice to have, but there are a bunch of library design questions, as well as implementation details that we should iron out. My main question at this point: do we actually want this right now, given that Python interop already allows one to do this?

While we work on these, I have a couple of suggestions that could help moving this forward: 1) we could use an os.makedirs with an exist_ok argument; 2) we could explore (maybe in a design doc) if we could start a tempfile module to build some testing utilities. @artemiogr97 Would you be interested in working on either of these?

CC: @ConnorGray @rparolin @JoeLoser

laszlokindrat · 2024-05-15T01:19:19Z

stdlib/src/shutil/shutil.mojo

+from os.path import islink, isdir, isfile
+
+
+fn rmtree(path: String, ignore_errors: Bool = False) raises:


@JoeLoser @ConnorGray Do you have any opinions if we should make ignore_errors keyword-only? Python doesn't do this, but it's not clear if that's just a result of backward compatibility. Otherwise, I always feel a bit strange about flags that can be specified positionally.

laszlokindrat · 2024-05-15T01:23:55Z

stdlib/src/shutil/shutil.mojo

+
+    Args:
+      path: The path to the directory.
+      ignore_errors: Whether to ignore errors.


Idea: Once we have parametric raising, we could introduce overloads that have this as a parameter. In the meanwhile, would it be beneficial to have a separate function that ignores errors and never raises?

I think that could make sense for now

laszlokindrat · 2024-05-15T01:29:44Z

stdlib/test/shutil/test_shutil.mojo

+    var cwd_path = Path()
+    var my_dir_path = cwd_path / "my_dir"


I'm not so sure about this. It would be nice if we had some tempfile-like utilities to create these ephemeral paths. At the same time, tempfile should probably depend on shutil, not the other way around.
Idea 1: Maybe we could bring up some basic tempfile utilities so that we can at least have a platform-independent temporary path.
Idea 2: we could just use python interop in these tests while we get there.

I guess python interop is the best option for now, one idea to prevent testing shutil with a potential tempfile module (which logically would depend on shutil) would be to use something like subprocess.run and call something like rm -rf my_dir, but it does not seem like a short-term solution

laszlokindrat · 2024-05-15T01:33:32Z

stdlib/test/shutil/test_shutil.mojo

+    mkdir(my_dir_path)
+    _create_some_files_and_dirs(my_dir_path)
+
+    rmtree(my_dir_path)


Testing these kinds of functions can actually be tricky: what if the test fails for some reason and the directory isn't deleted? The next time the test is run, mkdir can fail. One way to fix this is by doing this in a context manager, like tempfile.TemporaryDirectory. In lieu of that, some manual cleanup logic could also work.

yes, it's tricky, but doing some manual cleanup is not that simple in this case, what if what is failing is rmdir/unlink ? obviously rmtree is relying on those functions so what else could be done?

we have the same "issue" in test_remove and test_mkdir_and_rmdir

artemiogr97 · 2024-05-15T21:36:58Z

My main question at this point: do we actually want this right now, given that Python interop already allows one to do this?

I agree that maybe it is a bit soon to start implementing a new module such as shutil (or an equivalent).
At the beginning I wanted to implement #2018 but then I soon realized that a lot of the basic functionalities such as os.remove/os.unlink, os.mkdir, os.rmdir where missing so I started implementing them to finish #2352, and still there is a lot of stuff missing os.makedirs, os.join, os.abspath, ...

While we work on these, I have a couple of suggestions that could help moving this forward: 1) we could use an os.makedirs with an exist_ok argument; 2) we could explore (maybe in a design doc) if we could start a tempfile module to build some testing utilities. @artemiogr97 Would you be interested in working on either of these?

for your point 1, yes, I would like to work on that, everything that could be done to "complete" the os module makes a lot of sense, but I have some doubts:

should I care about windows compatibility at some point?
should we start creating the equivalents of posix/nt modules to start handling os specific stuff?
if windows is to be taken into account it is going to be hard to make sure that everything is ok since mojo is not available in windows yet

for 2 I'm not sure what there is to explore/discuss about a tempfile module, the equivalent python module just contains a couple of functions/classes but nothing really out of the ordinary, most of the functionality is already in the PR that I mentioned above but it depends on rmtree for now, so if needed I could modify it to remove rmtree and use it as a testing module for now

laszlokindrat · 2024-05-16T02:01:30Z

While we work on these, I have a couple of suggestions that could help moving this forward: 1) we could use an os.makedirs with an exist_ok argument; 2) we could explore (maybe in a design doc) if we could start a tempfile module to build some testing utilities. @artemiogr97 Would you be interested in working on either of these?

for your point 1, yes, I would like to work on that, everything that could be done to "complete" the os module makes a lot of sense, ...

That's awesome, thank you! Please ping me if you have PRs for these, I'm happy to review. I'm assuming these os utilities shouldn't depend on shutil, right (i.e. the dependency is the other way around)?

... but I have some doubts:

should I care about windows compatibility at some point?

For now, don't worry about Windows. The existing code in os would already break on native Windows. But during the implementation, feel free to leave TODOs in places where you think some special handling for Windows might be needed.

should we start creating the equivalents of posix/nt modules to start handling os specific stuff?

I would be lazy about this stuff. If the complexity of handling multiple platforms within os/sys (as well as user code) would grow to the point where it makes sense, we can introduce these, but I don't think we're there yet.

if windows is to be taken into account it is going to be hard to make sure that everything is ok since mojo is not available in windows yet

Yep, don't worry about this for now.

for 2 I'm not sure what there is to explore/discuss about a tempfile module, the equivalent python module just contains a couple of functions/classes but nothing really out of the ordinary, most of the functionality is already in the PR that I mentioned above but it depends on rmtree for now, so if needed I could modify it to remove rmtree and use it as a testing module for now

I see now that you already did quite a bit of work on tempfile, which is awesome. Let's discuss that on the other PR.

artemiogr97 requested a review from a team as a code owner April 28, 2024 18:17

artemiogr97 changed the title ~~Implementation of rmtree~~ [stdlib] Implementation of rmtree Apr 28, 2024

abduld approved these changes Apr 30, 2024

View reviewed changes

stdlib/src/os/os.mojo Outdated Show resolved Hide resolved

stdlib/src/os/os.mojo Outdated Show resolved Hide resolved

artemiogr97 mentioned this pull request May 4, 2024

[stdlib] tempfile and tempdir #2352

Closed

ematejska added the mojo-repo Tag all issues with this label label May 6, 2024

artemiogr97 force-pushed the rmtree branch 2 times, most recently from d2ba422 to d8debbf Compare May 11, 2024 16:47

[stdlib] implement rmtree

d8debbf

Signed-off-by: Artemio Garza Reyna <artemiogr97@gmail.com>

JoeLoser added the imported-internally Signals that a given pull request has been imported internally. label May 14, 2024

laszlokindrat reviewed May 15, 2024

View reviewed changes

laszlokindrat self-assigned this May 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[stdlib] Implementation of rmtree #2439

[stdlib] Implementation of rmtree #2439

artemiogr97 commented Apr 28, 2024

abduld left a comment

laszlokindrat left a comment

laszlokindrat May 15, 2024

laszlokindrat May 15, 2024

artemiogr97 May 15, 2024 •

edited

laszlokindrat May 15, 2024

artemiogr97 May 15, 2024

laszlokindrat May 15, 2024

artemiogr97 May 15, 2024

artemiogr97 commented May 15, 2024

laszlokindrat commented May 16, 2024

		from os.path import islink, isdir, isfile


		fn rmtree(path: String, ignore_errors: Bool = False) raises:

[stdlib] Implementation of rmtree #2439

Are you sure you want to change the base?

[stdlib] Implementation of rmtree #2439

Conversation

artemiogr97 commented Apr 28, 2024

abduld left a comment

Choose a reason for hiding this comment

laszlokindrat left a comment

Choose a reason for hiding this comment

laszlokindrat May 15, 2024

Choose a reason for hiding this comment

laszlokindrat May 15, 2024

Choose a reason for hiding this comment

artemiogr97 May 15, 2024 • edited

Choose a reason for hiding this comment

laszlokindrat May 15, 2024

Choose a reason for hiding this comment

artemiogr97 May 15, 2024

Choose a reason for hiding this comment

laszlokindrat May 15, 2024

Choose a reason for hiding this comment

artemiogr97 May 15, 2024

Choose a reason for hiding this comment

artemiogr97 commented May 15, 2024

laszlokindrat commented May 16, 2024

artemiogr97 May 15, 2024 •

edited