Any documentation on the depth frame format? #7

andybak · 2020-12-01T11:09:00Z

I've exported a recording in the native r3d format and I'm attempting to read the depth data

>>> pth = 'winhome/Documents/3D Scans/2020-10-28--15-01-03/rgbd/1.depth'
>>> fh = open(pth, "rb")
>>> compressed = fh.read()
>>> decompressed = liblzfse.decompress(compressed)

But then I'm not sure what to do with the decompressed data. Is it just a case of reading each 4 bytes, unpacking them to a single precision float? The jpgs are 192x256 and doing the maths on that seems to add up: 192 x 256 x 4 = 196608 and len(decompressed) gives me 196608.

So this looks right:

>>> f = [struct.unpack('f', d[x:x+4]) for x in range(0,len(d),4)]

Then I guess I can just write f into any image format that supports floating point (.hdr or .exr maybe)

Am I on the right lines? Are the values linear distances from the camera?

If so - it would be nice to add this to the docs.

The text was updated successfully, but these errors were encountered:

andybak · 2020-12-01T11:30:16Z

Follow up question - what are the .conf files for? Are there some docs on this I've overlooked?

marek-simonik · 2020-12-01T14:06:51Z

Hello Andy,
yes, you are right. See this simple example of how to load a .depth file:

import numpy as np
import cv2
import liblzfse  # https://pypi.org/project/pyliblzfse/


def load_depth(filepath):
    with open(filepath, 'rb') as depth_fh:
        raw_bytes = depth_fh.read()
        decompressed_bytes = liblzfse.decompress(raw_bytes)
        depth_img = np.frombuffer(decompressed_bytes, dtype=np.float32)

    depth_img = depth_img.reshape((640, 480))  # For a FaceID camera 3D Video
    # depth_img = depth_img.reshape((256, 192))  # For a LiDAR 3D Video

    return depth_img


if __name__ == '__main__':
    depth_filepath = '/tmp/depth_0.lzfse'
    depth_img = load_depth(depth_filepath)

    cv2.imshow('Depth', depth_img)
    cv2.waitKey(0)

As you wrote, the decompressed .depth file is just a buffer of raw float32 depth bytes (each float32 value is a depth value in meters). There are 49 152 (i.e. 192×256) values for a LiDAR frame and 307 200 (i.e. 480×640) values for a FaceID frame.

The .conf files contain confidence map for each frame, which is of the same size as the depth map and for each pixel of the depth map it contains an uint8 number in the range 0-2, which suggest the confidence that the sensed LiDAR depth is "correct". In other words, it is a measure of depth data quality.

I think this answers your question, so I am closing this issue, but feel free to ask follow-up questions.

andybak · 2020-12-18T10:06:04Z

(thanks! the above was really helpful for me. However - will anyone else find it easily as it is in a closed github issue? Part of my reason for opening this was to suggest that something like the above would be a great addition to the docs)

marek-simonik · 2020-12-23T16:24:17Z

You are right, thanks for reminding me that. I added link to this Issue into the Wiki.

wolterlw · 2021-03-12T14:17:50Z

Had the same confusion and found this issue before going to the Wiki.
It would be very helpful to add some mention of it into the main Readme

Also on a related note - is it possible to get distance in meters from an exported RGBD video?

marek-simonik · 2021-03-12T15:15:51Z

OK, I will mention the Wiki in the Readme the next time I will push an update.

As for getting the distance in meters from exported RGBD videos: yes, it is possible. I described how to do it in the Readme of this demo.

wolterlw · 2021-03-12T15:18:06Z

got it, thank you for a great app and library!
please do add landscape mode for the iPad someday though

marek-simonik · 2021-03-12T15:28:59Z

Thank you for the suggestion, noted! I will include landscape mode in a future update.

zehuiz2 · 2021-09-01T01:20:04Z

Two follow-up questions:

In 'How to use?', you wrote 'JSON config file (containing the intrinsic matrix, FPS, and width/height of the RGBD frames)'. Where could I find this info?
Does 2 mean high confidence or 0?

marek-simonik · 2021-09-01T10:46:16Z

To answer your questions:

After you unzip an exported .r3d file, you will see a metadata file. This is the JSON config file.
In my understanding, 2 is high confidence, 1 is "lower" confidence and 0 is the lowest confidence.

zehuiz2 · 2022-01-28T21:43:35Z

Hi,
I wonder if you've updated both LiDAR & FaceID depth resolution?

depth_img = depth_img.reshape((1280, 960)) # For a FaceID camera 3D Video
depth_img = depth_img.reshape((512, 384)) # For a LiDAR 3D Video

Is the above correct?
Another three questions:

Is it possible you could update the FaceID RGB resolution? It is still 640*480.
It seems the LiDAR confidence resolution is not updated?
What are the units of the depth measurements? I assume it is mm?

Thank you very much!

knsjoon · 2022-05-25T02:44:26Z

Addition on this issue regarding Apple ARKIT depth confidence map:

From https://developer.apple.com/documentation/arkit/arconfidencelevel, there are only three levels of confidence map values.

case low
Depth-value accuracy in which the framework is less confident.
case medium
Depth-value accuracy in which the framework is moderately confident.
case high
Depth-value accuracy in which the framework is fairly confident.

Hope it helps future users to understand why confidence map is only consist of 0, 1, 2 !

marek-simonik closed this as completed Dec 1, 2020

marek-simonik mentioned this issue Jul 18, 2021

RGB-D images #19

Closed

marek-simonik mentioned this issue Jul 26, 2021

How to get the pointcloud from the USB-streamed depth images #6

Closed

kentaroy47 mentioned this issue Aug 11, 2021

The 256, 192 and divide by 4 kentaroy47/apple-lidar-stream#1

Closed

marek-simonik mentioned this issue Sep 13, 2021

Load .r3d files in Python #24

Closed

zehuiz2 mentioned this issue Jan 28, 2022

Several questions on updated resolution #30

Closed

marek-simonik mentioned this issue May 23, 2022

How to process depth confidence map files? #39

Closed

marek-simonik mentioned this issue Mar 29, 2023

Possibility to acquire the depth information from the r3d file? #60

Closed

marek-simonik mentioned this issue Jun 2, 2023

decompose the recordings into RGB folder and depth folder #67

Open

marek-simonik mentioned this issue Aug 25, 2023

Flickering Effects with VFX Graph marek-simonik/record3d_offline_unity_demo#4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any documentation on the depth frame format? #7

Any documentation on the depth frame format? #7

andybak commented Dec 1, 2020 •

edited

andybak commented Dec 1, 2020

marek-simonik commented Dec 1, 2020

andybak commented Dec 18, 2020

marek-simonik commented Dec 23, 2020

wolterlw commented Mar 12, 2021 •

edited

marek-simonik commented Mar 12, 2021

wolterlw commented Mar 12, 2021

marek-simonik commented Mar 12, 2021

zehuiz2 commented Sep 1, 2021

marek-simonik commented Sep 1, 2021

zehuiz2 commented Jan 28, 2022 •

edited

knsjoon commented May 25, 2022

Any documentation on the depth frame format? #7

Any documentation on the depth frame format? #7

Comments

andybak commented Dec 1, 2020 • edited

andybak commented Dec 1, 2020

marek-simonik commented Dec 1, 2020

andybak commented Dec 18, 2020

marek-simonik commented Dec 23, 2020

wolterlw commented Mar 12, 2021 • edited

marek-simonik commented Mar 12, 2021

wolterlw commented Mar 12, 2021

marek-simonik commented Mar 12, 2021

zehuiz2 commented Sep 1, 2021

marek-simonik commented Sep 1, 2021

zehuiz2 commented Jan 28, 2022 • edited

knsjoon commented May 25, 2022

andybak commented Dec 1, 2020 •

edited

wolterlw commented Mar 12, 2021 •

edited

zehuiz2 commented Jan 28, 2022 •

edited