Skip to content

Research Project for image to image translation in RGB

Notifications You must be signed in to change notification settings

Riotpiaole/rgbd

Repository files navigation

RGBD Research Project

OverView

  • Building RGB additive color with supervised machine learning.

Multi-View 3D visualization

  • origin input

  • Original Input image from three POV, which will be masked by depth data. Then OpenCV filtering.

  • origin author PoKong Lai

  • Visualizating the 3D output to ensure the Camera Poses was properly predicted.

    • 3d of img_liang
    • 3d of po
  • This Visualizes the environment with three relative camera locations.

  • The major challenges is how to synchronizes these camera. So we uses SLAM and VR point-cloud scattering to predict the poses.

Preprocessing

  • Recording a person from three point of views. And perform loop closure to predict the camera poses of these POV.

    • filtered image
      • This is the initial mask that will be selected. For each image we will compute similarity to pick the most same components as filter.

Result Video

  • network architecture

    • encoder
    • decoder
  • link for result in prediction of backward image from front image.

    • left most column is input, middle is ground truth and last is prediction.
    • feed front image hence left most to AutoEncoder to generate backward image.
  • grid-output

About

Research Project for image to image translation in RGB

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages