Skip to content
This repository has been archived by the owner on Aug 28, 2021. It is now read-only.


Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?


Failed to load latest commit information.
Latest commit message
Commit time
December 12, 2018 13:59
July 23, 2019 10:29
July 23, 2019 10:29
December 6, 2017 11:26
December 6, 2017 11:26
October 14, 2018 22:29
December 20, 2018 13:49
July 22, 2019 12:13
July 22, 2019 12:13
December 6, 2017 11:26
July 5, 2020 21:41
December 8, 2017 12:13

House3D: A Rich and Realistic 3D Environment

Yi Wu, Yuxin Wu, Georgia Gkioxari and Yuandong Tian

House3D is a virtual 3D environment which consists of thousands of indoor scenes equipped with a diverse set of scene types, layouts and objects sourced from the SUNCG dataset. It consists of over 45k indoor 3D scenes, ranging from studios to two-storied houses with swimming pools and fitness rooms. All 3D objects are fully annotated with category labels. Agents in the environment have access to observations of multiple modalities, including RGB images, depth, segmentation masks and top-down 2D map views. The renderer runs at thousands frames per second, making it suitable for large-scale RL training.

Usage instructions can be found at

Existing Research Projects with House3D

A. RoomNav (paper)

Yi Wu, Yuxin Wu, Georgia Gkioxari, Yuandong Tian

In this work we introduce a concept learning task, RoomNav, where an agent is asked to navigate to a destination specified by a high-level concept, e.g. dining room. We demonstrated two neural models: a gated-CNN and a gated-LSTM, which effectively improve the agent's sensitivity to different concepts. For evaluation, we emphasize on generalization ability and show that our agent can generalize across environments due to the diverse and large-scale dataset.

B. Embodied QA (project page | EQA paper | NMC paper)

Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra

Embodied Question Answering is a new AI task where an agent is spawned at a random location in a 3D environment and asked a natural language question ("What color is the car?"). In order to answer, the agent must first intelligently navigate to explore the environment, gather information through first-person (egocentric) vision, and then answer the question ("orange").


If you use our platform in your research, you can cite us with:

  title={Building generalizable agents with a realistic and rich 3D environment},
  author={Wu, Yi and Wu, Yuxin and Gkioxari, Georgia and Tian, Yuandong},
  journal={arXiv preprint arXiv:1801.02209},


House3D is released under the Apache 2.0 license.


a Realistic and Rich 3D Environment



Code of conduct

Security policy





No packages published