MPD and DATA extension support #27

ScanMountGoat · 2023-03-04T22:34:26Z

This PR adds support for documents with multiple files (MPD) as well as adding the parsing for embedded data files. This adds what I would consider the "bare minimum" functionality for working with LDraw files in an application like Blender. The main functionality missing is the BFC extension and color handling.

weldr lib

Calling parse returns the main file for MPD files or the entire root file otherwise. This avoids loading some submodel geometry multiple times for MPD files.
When loading a SourceFile in the parse function, the file is assumed to be an MPD file and split into multiple files. If the split file list is empty, the entire file is used to still handle dat and ldr files normally.
The parse function returns a SourceFile instead of a reference. This simplifies the implementation but uses an additional clone. The old implementation avoided lifetime issues but didn't guarantee that the reference was valid. The return type can be changed later as long as it indicates the "main model" of the file.

weldr bin

The hierarchy of submodels is preserved when converting to gltf. This matches the behavior of other import/export tools when working with MPD files.
Meshes for parts are cached and instanced, resulting in lower memory usage and faster export times. This assumes that all .dat files are parts for now. This can be replaced by checking the type like "part" or "subpart" in the future.
Forward and backward slashes are tried using path-slash to fix some resolve errors in Linux and MacOS.
Prevents some validation warnings from the gltf Validator and VSCode extension. These errors mostly impact MPD files. I've noticed that some implementations of gltf like Blender will fail with strange errors if the file has validation warnings.

ScanMountGoat · 2023-03-04T22:40:15Z

The UCS Falcon loaded into Blender from gltf after adding some unofficial part paths to the resolver.

djeedai

I quite like the direction this is taking. Things like the removal of SubFileRef and using direct String references looks much simpler. There's a few things however I'm not convinced about, see individual comments for details. The blockers are probably few though, like the minimum Rust version bump. Thanks!

lib/Cargo.toml

lib/src/parse.rs

bin/weldr/src/weldr.rs

bin/weldr/src/convert.rs

djeedai · 2023-03-06T18:35:25Z

bin/weldr/src/convert.rs

+        };
+        gltf.nodes.push(node);
+
+        // TODO: Check the part type rather than the extension.


What amount of work is it to fix that now? Extension-based guesses is a bit ugly.

There's a special comment command for determining if the file is an official or unofficial "part". This should also be updated to check if each file contains geometry commands since apparently some files embed geometry inline for things like hoses and tubes. I'm not sure how trivial this would be to implement. Calling root_file.iter(...) can create enough geometry to crash many applications, so at least some level of instancing is necessary for medium to large models.

Ok I think I didn't understand then what is happening here. Why do we need to check for the official-ness of the part, and how is that related with the file extension?

There's more than one type that determines if it is a part. Ideally, we could just create a mesh for each ldraw file. In practice, this creates an unmanageable number of objects in the scene when importing. Stopping the recursion at the "part" level uses more memory but makes the file easier to work with. Whether the overhead matters depends on the application.

If I understand correctly you're saying that:

Parsing and building a mesh for each single file in a document produces too many objects for an application to handle.

Building a single "merged" mesh for the entire document conversely creates a mesh too big to handle.

The middle ground is to create a smaller quantity of large-ish meshes, and for that we use a heuristic here by stopping the recursion at the part level, by guessing what is a reusable part (instancing).

Did I get the reasoning correctly?

lib/src/lib.rs

lib/tests/parse.rs

djeedai

Thanks for the changes. There's a couple more things I'm not sure I understand (see comments) but that's otherwise roughly in a mergeable state. Thanks!

djeedai · 2023-03-07T20:52:22Z

bin/weldr/src/convert.rs

+        };
+        gltf.nodes.push(node);
+
+        // TODO: Check the part type rather than the extension.


Ok I think I didn't understand then what is happening here. Why do we need to check for the official-ness of the part, and how is that related with the file extension?

bin/weldr/src/convert.rs

lib/src/lib.rs

ScanMountGoat · 2023-03-07T21:37:49Z

The filename_char code has been completely removed since it's not actually used anywhere. The spec encourages not using special characters in filenames but doesn't require it. Applying validation beyond just checking for utf8 won't work with MPD files since the files don't need to be files on disk. These will be strings entered by the user in applications like Studio, LeoCAD, etc.

already covered by self.cmds.iter()

ScanMountGoat · 2023-03-18T16:26:12Z

I've adjusted the path normalization to only use the OS specific separator. This also seems to be the approach used by LDView. This also uses glam to simplify some of the matrix math. Let me know if there are any issues blocking this PR.

djeedai

I'd just like to understand what looks like a heuristic to balance mesh size vs. object count (?), and after that it's good to go!

djeedai · 2023-03-22T20:58:22Z

bin/weldr/src/convert.rs

+        };
+        gltf.nodes.push(node);
+
+        // TODO: Check the part type rather than the extension.


If I understand correctly you're saying that:

Parsing and building a mesh for each single file in a document produces too many objects for an application to handle.

Building a single "merged" mesh for the entire document conversely creates a mesh too big to handle.

The middle ground is to create a smaller quantity of large-ish meshes, and for that we use a heuristic here by stopping the recursion at the part level, by guessing what is a reusable part (instancing).

Did I get the reasoning correctly?

bin/weldr/src/weldr.rs

djeedai · 2023-03-23T21:04:05Z

Thanks a lot for that fantastic contribution and pushing through all the comments @ScanMountGoat, much appreciated!

ScanMountGoat added 2 commits March 4, 2023 16:03

simplify subfile iteration

d281f07

try forward and backward slashes for paths

f6e9506

djeedai requested changes Mar 6, 2023

View reviewed changes

djeedai added the enhancement New feature or request label Mar 6, 2023

mpd and data support

f09cef5

ScanMountGoat force-pushed the main branch from 38b9071 to f09cef5 Compare March 6, 2023 21:49

ScanMountGoat added 5 commits March 6, 2023 15:57

use option for gltf matrix

a18ce1d

return filename instead of file in parse

2ef71d9

remove unused trace

57a7aa8

derive eq and copy for Color

a60fc50

split_files -> split_mpd_file

c1c8685

djeedai reviewed Mar 7, 2023

View reviewed changes

remove unused function

6cd6f4d

ScanMountGoat added 11 commits March 9, 2023 08:56

use glam instead of cgmath

35eb522

separate module for parsing

6da4726

adjust path normalization

d62b267

remove redundant hi-res primitives

6887176

allow paths for parse function

12468fc

remove filename from SourceFile

704917b

remove redundant local_iter

4512ae6

already covered by self.cmds.iter()

don't skip unrecognized meta cmds in iter

290b539

fix material parsing and improve tests

bcdc592

resolve paths not in the ldraw folder

33338e4

test for infinite recursion in parse

c9ae933

djeedai reviewed Mar 22, 2023

View reviewed changes

djeedai approved these changes Mar 23, 2023

View reviewed changes

djeedai merged commit 01c1359 into djeedai:main Mar 23, 2023

djeedai mentioned this pull request May 9, 2024

multi-part document (MPD) support #26

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPD and DATA extension support #27

MPD and DATA extension support #27

ScanMountGoat commented Mar 4, 2023

ScanMountGoat commented Mar 4, 2023

djeedai left a comment

djeedai Mar 6, 2023

ScanMountGoat Mar 6, 2023 •

edited

djeedai Mar 7, 2023

ScanMountGoat Mar 7, 2023

djeedai Mar 22, 2023

djeedai left a comment

djeedai Mar 7, 2023

ScanMountGoat commented Mar 7, 2023

ScanMountGoat commented Mar 18, 2023

djeedai left a comment

djeedai Mar 22, 2023

djeedai commented Mar 23, 2023

MPD and DATA extension support #27

MPD and DATA extension support #27

Conversation

ScanMountGoat commented Mar 4, 2023

ScanMountGoat commented Mar 4, 2023

djeedai left a comment

Choose a reason for hiding this comment

djeedai Mar 6, 2023

Choose a reason for hiding this comment

ScanMountGoat Mar 6, 2023 • edited

Choose a reason for hiding this comment

djeedai Mar 7, 2023

Choose a reason for hiding this comment

ScanMountGoat Mar 7, 2023

Choose a reason for hiding this comment

djeedai Mar 22, 2023

Choose a reason for hiding this comment

djeedai left a comment

Choose a reason for hiding this comment

djeedai Mar 7, 2023

Choose a reason for hiding this comment

ScanMountGoat commented Mar 7, 2023

ScanMountGoat commented Mar 18, 2023

djeedai left a comment

Choose a reason for hiding this comment

djeedai Mar 22, 2023

Choose a reason for hiding this comment

djeedai commented Mar 23, 2023

ScanMountGoat Mar 6, 2023 •

edited