Publications

2024

Consistent Multimodal Generation via A Unified GAN Framework
Zhen Zhu, Yijun Li, Weijie Lyu, Krishna Kumar Singh, Zhixin Shu, Sören Pirk, Derek Hoiem
WACV, 2024. [paper]

2023

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu, Christopher Clark, Sangho Lee, Zichen Zhang, Savya Khosla, Ryan Marten, Derek Hoiem, Aniruddha Kembhavi
arXiv:2312.17172, 2023. [paper]

ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
Yangyi Chen, Xingyao Wang, Manling Li, Derek Hoiem, Heng Ji
EMNLP, 2023. [paper]

WebWISE: Web Interface Control and Sequential Exploration with Large Language Models
Heyi Tao, Sethuraman T V, Michal Shlapentokh-Rothman, Derek Hoiem
arXiv:2310.16042, 2023. [paper]

Comparing Human Object Learning with Deep Neural Networks
Yinuo Peng, Zhen Zhu, Derek Hoiem, Ranxiao Frances Wang
Journal of Vision 23 (9), 2023. [paper]

Continual Learning in Open-vocabulary Classification with Complementary Memory Systems
Zhen Zhu, Weijie Lyu, Yao Xiao, Derek Hoiem
arXiv:2307.01430, 2023. [paper]

StyleGAN knows Normal, Depth, Albedo, and More
Anand Bhattad, Daniel McKee, Derek Hoiem, DA Forsyth
NeurIPS, 2023. [paper]

Make It So: Steering StyleGAN for Any Image Inversion and Editing
Anand Bhattad, Viraj Shah, Derek Hoiem, DA Forsyth
arxiv:2304.14403, 2023. [paper]

2022

Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Zhenhailong Wang, Manling Li, Ruochen Xu, Luowei Zhou, Jie Lei, Xudong Lin, Shuohang Wang, Ziyi Yang, Chenguang Zhu, Derek Hoiem, Shih-Fu Chang, Mohit Bansal, Heng Ji
NeurIPS, 2022. [paper]

GRIT: General Robust Image Task Benchmark
Tanmay Gupta, Ryan Marten, Aniruddha Kembhavi, Derek Hoiem
arXiv:2204.13653, 2022. [paper]

Webly Supervised Concept Expansion for General Purpose Vision Models
Amita Kamath, Christopher Clark, Tanmay Gupta, Eric Kolve, Derek Hoiem, Aniruddha Kembhavi
ECCV, 2022. [project] [demo] [github] [paper]

Towards General Purpose Vision Systems: An End-to-End Task-Agnostic Vision-Language Architecture
Tanmay Gupta, Amita Kamath, Aniruddha Kembhavi, Derek Hoiem
CVPR, 2022. [project] [demo] [github] [paper]

2021

PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility
Jae Yong Lee, Joseph DeGol, Chuhang Zou, Derek Hoiem
ICCV, 2021. [code] [paper]

Learning Curves for Analysis of Deep Networks
Derek Hoiem, Tanmay Gupta, Zhizhong Li, Michal Shlapentokh-Rothman
ICML, 2021. [project] [paper]

Manhattan Room Layout Reconstruction from a Single 360 Image: A Comparative Study of State-of-the-Art Methods
Chuhang Zou, Jheng-Wei Su, Chi-Han Peng, Alex Colburn, Qi Shan, Peter Wonka, Hung-Kuo Chu, Derek Hoiem
IJCV Vol 129 (5), p. 1410-1431, 2021. [project] [paper]

Task-assisted domain adaptation with anchor tasks
Zhizhong Li, Linjie Luo, Sergey Tulyakov, Qieyun Dai, Derek Hoiem
WACV, 2021. [paper]

2020

Improving Structure from Motion with Reliable Resectioning
Rajbir Kataria, Joseph DeGol, Derek Hoiem
3DV, 2020. [project] [paper]

Contrastive Learning for Weakly Supervised Phrase Grounding
Tanmay Gupta, Arash Vahdat, Gal Chechik, Xiaodong Yang, Jan Kautz, Derek Hoiem
ECCV, 2020. [project] [paper] [supp]

Improving Confidence Estimates for Unfamiliar Examples
Zhizhong Li and Derek Hoiem
CVPR, 2020. [project] [pdf] [supp] [github]

Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion
Hongxu Yin, Pavlo Molchanov, Jose M. Alvarez, Zhizhong Li, Arun Mallya, Derek Hoiem, Niraj K. Jha, Jan Kautz
CVPR, 2020. [pdf] [supp] [arXiv]

Silhouette Guided Point Cloud Reconstruction beyond Occlusion
Chuhang Zou and Derek Hoiem
WACV, 2020. [pdf]

2019

No-Frills Human-Object Interaction Detection: Factorization, Appearance and Layout Encodings, and Training Techniques
Tanmay Gupta, Alexander Schwing, Derek Hoiem
ICCV, 2019. [arXiv] [project] [github]

ViCo: Word Embeddings from Visual Co-occurrences
Tanmay Gupta, Alexander Schwing, Derek Hoiem
ICCV, 2019. [arXiv] [project] [github]

Complete 3D Scene Parsing from Single RGBD Image
Chuhang Zou, Ruiqi Guo, Zhizhong Li, Derek Hoiem
International Journa of Computer Vision (IJCV), Vol. 127, No 2, Feb 2019. [arxiv]

An automated tool for measuring human limb bones using 2D images
Amanda B Lee, Peng Li, Derek Hoiem
American Journal of Physical Anthropology, Vol 168, Mar 2019.

2018

Improved structure from motion using fiducial marker matching
Joseph DeGol, Timothy Bretl, Derek Hoiem
ECCV, 2018. [pdf] [project]

Imagine This! Scripts to Compositions to Videos
Tanmay Gupta, Dustin Schwenk, Ali Farhadi, Derek Hoiem, Aniruddha Kembhavi
ECCV, 2018. [arXiv] [project]

FEATS: Synthetic Feature Tracks for Structure from Motion Evaluation
Joseph DeGol, Jae Yong Lee, Rajbir Kataria, Daniel Yuan, Timothy Bretl, Derek Hoiem
3DV, 2018. [pdf] [project]

LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image
Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem
CVPR, 2018. [pdf] [arXiv] [github]

Pixels, voxels, and views: A study of shape representations for single view 3D object shape prediction
Daeyun Shin, Charless C. Fowlkes, Derek Hoiem
CVPR, 2018. [pdf]

2017

3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks
Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, Derek Hoiem
ICCV, 2017. [pdf] [arXiv] [github] [data]

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks
Tanmay Gupta, Kevin Shih, Saurabh Singh, Derek Hoiem
ICCV, 2017. [pdf] [slides]

ChromaTag: A Colored Marker and Fast Detection Algorithm
Joseph DeGol, Timothy Bretl, Derek Hoiem
ICCV, 2017. [pdf] [project]

Learning without Forgetting
Zhizhong Li, Derek Hoiem
IEEE PAMI, 99, Nov 2017. [arXiv]

2016

Swapout: Learning an ensemble of deep architectures
Saurabh Singh, Derek Hoiem, David Forsyth
NIPS, 2016. [pdf]

Learning without Forgetting
Zhizhong Li and Derek Hoiem
ECCV, 2016. [pdf] [github]

Learning to Localize Little Landmarks
Saurabh Singh, Derek Hoiem, David Forsyth
CVPR, 2016. [pdf] [project]

Geometry-Informed Material Recognition
Joseph DeGol, Mani Golparvar-Fard, Derek Hoiem
CVPR, 2016. [pdf] [project]

Where to Look: Focus Regions for Visual Question Answering
Kevin J. Shih, Saurabh Singh, Derek Hoiem
CVPR, 2016. [pdf]

2015

Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization
Kevin J Shih, Arun Mallya, Saurabh Singh, Derek Hoiem
BMVC, 2015. [pdf] [project]

Predicting Complete 3D Models of Indoor Scenes
Ruiqi Guo, Chuhang Zou, Derek Hoiem
arXiv:1504.02437, 2015. [pdf] [video] [supplemental]

Completing 3D Object Shape from One Depth Image
Jason Rock, Tanmay Gupta, Justin Thorsen, JunYoung Gwak, Daeyun Shin, Derek Hoiem
CVPR, 2015. [pdf] [supplemental]

Learning a Sequential Search for Landmarks
Saurabh Singh, Derek Hoiem, and David Forsyth
CVPR, 2015. [pdf] [project]

Family Member Identification from Photo Collections
Qieyun Dai, Peter Carr, Leonid Sigal, and Derek Hoiem
IEEE Winter Conference on Applications of Computer Vision (WACV), 2015. [pdf]

Labeling Complete Surfaces in Scene Understanding
Ruiqi Guo and Derek Hoiem
IJCV, Vol. 112 (2), April 2015. [pdf]

2014

Category-Independent Object Proposals with Diverse Ranking
Ian Endres and Derek Hoiem
PAMI, Vol. 36 (2), Feb 2014. [pdf] [project]

2013

Support Surface Prediction in Indoor Scenes
Ruiqi Guo and Derek Hoiem
ICCV, 2013. [pdf] [project] [dataset]

Boundary Cues for 3D Object Shape Recovery
Kevin Karsch, Zicheng Liao, Jason Rock, Jonathan T. Barron, and Derek Hoiem
CVPR, 2013. [pdf] [supp] [data]

Learning Collections of Part Models for Object Recognition
Ian Endres, Kevin Shih, Johnston Jiaa, and Derek Hoiem
CVPR, 2013. [pdf] [project]

Improved Object Categorization and Detection Using Comparative Object Similarity
Gang Wang, David Forsyth, and Derek Hoiem
PAMI Vol. 35 (10), Oct 2013. [pdf]

2012

Diagnosing Error in Object Detectors
Derek Hoiem, Yodsawalai Chodpathumwan, and Qieyun Dai
ECCV, 2012. [pdf] [project] [slides]

Beyond the line of sight: labeling the underlying surfaces
Ruiqi Guo and Derek Hoiem
ECCV, 2012. [pdf]

Indoor Segmentation and Support Inference from RGBD Images
Nathan Silberman, Derek Hoiem, Pushmeet Kohli, and Rob Fergus
ECCV, 2012. [pdf] [project] [slides]

Learning Shared Body Plans
Ian Endres, Vivek Srikumar, Ming-wei Chang, and Derek Hoiem
CVPR, 2012. [pdf]

Learning to Localize Detected Objects
Qieyun Dai and Derek Hoiem
CVPR, 2012. [pdf]

Recovering Free Space of Indoor Scenes from a Single Image
Varsha Hedau and Derek Hoiem and David Forsyth
CVPR, 2012. [pdf]

A Data-driven Method for Feature Transformation
Mert Dikmen and Derek Hoiem and Thomas S. Huang
CVPR, 2012. [pdf]

Learning Image Similarity from Flickr Groups Using Fast Kernel Machines
Gang Wang, Derek Hoiem, and David Forsyth
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 16 Jan 2012. [pdf]

Paired Regions for Shadow Detection and Removal
Ruiqi Guo, Qieyun Dai, and Derek Hoiem
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 02 Oct 2012. [pdf] [data]

2011

Representations and Techniques for 3D Object Recognition and Scene Interpretation
Synthesis Lecture on Artificial Intelligence and Machine Learning
D. Hoiem and S. Savarese
Morgan & Claypool Publishers, Aug 2011. ISBN: 1608457281 [amazon] [M&P] [draft pdf]

Rendering Synthetic Objects into Legacy Photographs
K. Karsch and V. Hedau and D. Forsyth and D. Hoiem
ACM SIGGRAPH Asia 2011. [pdf (3mb)] [pdf (42mb)] [video] [github]

Learning Random Fields using Graph Cuts
M. Szummer and P. Kohli and D. Hoiem
Chapter in Markov Random Fields for Vision and Image Processing
Edited by A. Blake, P. Kohli, and C. Rother, MIT Press, September 2011. [link]

Single-Image Shadow Detection and Removal using Paired Regions
R. Guo and Q. Dai and D. Hoiem
CVPR, 2011. [pdf] [data] [journal]

Recovering Occlusion Boundaries from an Image
D. Hoiem, A.A. Efros, and M. Hebert
IJCV (91), No. 3, 2011. [pdf]
The final publication is available at www.springerlink.com.

2010

Category Independent Object Proposals
I. Endres and D. Hoiem
ECCV 2010. [pdf] [project] [journal]

Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry
V. Hedau, D. Hoiem, and D.A. Forsyth
ECCV 2010. [pdf]

Attribute-Centric Recognition for Cross-Category Generalization
A. Farhadi, I. Endres, and D. Hoiem
CVPR 2010. [pdf]

Comparative object similarity for improved recognition with few or no examples
G. Wang, D.A. Forsyth, D. Hoiem
CVPR 2010. [pdf] [journal]

The Benefits and Challenges of Collecting Richer Object Annotations
Ian Endres, Ali Farhadi, Derek Hoiem, and David Forsyth
ACVHL 2010 (in conjunction with CVPR). [pdf]

It's All About the Data
T.L. Berg, A. Sorokin, G. Wang, D.A. Forsyth, D. Hoiem, A. Farhadi, and I. Endres
Proc. IEEE , Special Issue on Internet Vision, August 2010, 98 (8), 1434-1453.

2009

Recovering the Spatial Layout of Cluttered Rooms
V. Hedau, D. Hoiem, and D.A. Forsyth
ICCV 2009. [pdf] [project]

Learning Image Similarity from Flickr Groups Using Stochastic Intersection Kernel Machines
G. Wang, D. Hoiem, and D.A. Forsyth
ICCV 2009. [pdf] [project] [journal]

Describing Objects by their Attributes
A. Farhadi, I. Endres, D. Hoiem, and D.A. Forsyth
CVPR 2009. [pdf] [project]

Building Text Features for Object Image Classification
G. Wang, D. Hoiem, and D.A. Forsyth
CVPR 2009. [pdf]

An Empirical Study of Context in Object Detection
S.K. Divvala, D. Hoiem, J.H. Hays, A.A. Efros, and M. Hebert
CVPR 2009. [pdf]

2008

Learning CRFs using Graph Cuts
M. Szummer, P. Kohli, and D. Hoiem
ECCV 2008. [pdf]

Closing the Loop on Scene Interpretation
D. Hoiem, A.A. Efros, and M. Hebert
CVPR 2008. [pdf]

Putting Objects in Perspective
D. Hoiem, A.A. Efros, and M. Hebert
IJCV (80), No. 1, October 2008. [pdf]
The final publication is available at www.springerlink.com.

2007

Seeing the World Behind the Image: Spatial Layout for 3D Scene Understanding
D. Hoiem
Doctoral Dissertation, CMU-RI-TR-07-28, Robotics Institute, Carnegie Mellon University, August 2007. [pdf]
CMU School of Computer Science Distinguished Dissertation Award
ACM Doctoral Dissertation Honorable Mention

Learning to Extract Object Boundaries using Motion Cues
A.N. Stein, D. Hoiem, and M. Hebert
ICCV 2007. [pdf]

Recovering Occlusion Boundaries from a Single Image
D. Hoiem, A.N. Stein, A.A. Efros, and M. Hebert
ICCV 2007. [pdf] [journal]

Photo Clip Art
J-F. Lalonde, D. Hoiem, A.A. Efros, J. Winn, C. Rother and A. Criminisi
ACM SIGGRAPH 2007. [pdf] [project ]

3D LayoutCRF for Multi-View Object Class Recognition and Segmentation
D. Hoiem, C. Rother, and J. Winn
CVPR 2007. [pdf]

Recovering Surface Layout from an Image
D. Hoiem, A.A. Efros, and M. Hebert
IJCV, Vol. 75, No. 1, October 2007. [pdf]
The final publication is available at www.springerlink.com.

2006

Putting Objects in Perspective
D. Hoiem, A.A. Efros, and M. Hebert
CVPR 2006. Best Paper Award [pdf] [project] [journal]

Opportunistic use of vision to push back the path-planning horizon
B. Nabbe, D. Hoiem, A.A. Efros, and M. Hebert
IROS 2006. [pdf]

2005

Geometric Context from a Single Image
D. Hoiem, A.A. Efros, and M. Hebert
ICCV 2005. [pdf] [project] [journal]

Automatic Photo Pop-up
D. Hoiem, A.A. Efros, and M. Hebert
ACM SIGGRAPH 2005. [pdf] [project]

Vision for Music Identification
Y. Ke, D. Hoiem, and R. Sukthankar
ICCV 2005. [pdf] [project]

SOLAR: Sound Object Localization and Retrieval in Complex Audio Environments
D. Hoiem, Y. Ke, and R. Sukthankar
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005. [pdf] [project]

2004

Object-Based Image Retrieval Using the Statistics of Images
D. Hoiem, R. Sukthankar, H. Schneiderman, and L. Huston
CVPR 2004. [pdf] [project]

SnapFind: Brute Force Interactive Image Retrieval
L. Huston, R. Sukthankar, D. Hoiem, and J. Zhang
IEEE International Conference on Image Processing and Graphics, 2004. [pdf]

Home