Yi Zhang
I obtained my Ph.D. at Johns Hopkins University, advised by Bloomberg
Distinguished Professor Alan
Yuille, where I worked on computer vision and machine learning.
I received my B.E. in EE from Tsinghua University.
Email  / 
CV  / 
Google
Scholar  / 
Github
|
|
Research
My current research focuses on: 1) human analysis 2) generative models for (3D)
computer vision, and 3) robustness and generalization.
|
|
Learning Direct Text-to-3D Generation on Massive Noisy 3D Data
Qihao Liu,
Yi Zhang,
Song Bai,
Adam Kortylewski,
Alan Yuille
CVPR, 2024
paper /
arXiv /
Project /
code
Enabling training 3D diffusion model on large-scale noisy 3D data.
|
|
Generating Images with 3D Annotations using Diffusion Models
Wufei Ma,
Qihao Liu,
Jiahao Wang,
Xiaoding Yuan,
Angtian Wang,
Yi Zhang,
Zihao
Xiao,
Guofeng Zhang,
Beijia Lu,
Ruxiao
Duan,
Yongrui Qi,
Adam Kortylewski,
Yaoyao Liu,
Alan Yuille
ICLR, 2024   (Spotlight)
paper /
arXiv /
Project /
code
Generating images with 3D Annotations using Diffusion Models.
|
|
3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose
Estimation
Yi Zhang*,
Pengliang Ji*,
Angtian Wang,
Jieru Mei,
Adam Kortylewski,
Alan Yuille
ICCV, 2023
paper /
arXiv /
Project /
code
An analysis-by-synthesis approach for 3D human pose estimation that is highly
robust to occlusions.
|
|
Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape
Jiacong Xu,
Yi Zhang, ...,
Alan Yuille,
Adam Kortylewski
ICCV, 2023
paper /
arXiv /
Project /
code
Animal3D consists of 3379 images collected from 40 mammal species, high-quality
annotations of 26 keypoints, and importantly the pose and shape
parameters of the SMAL model. We demonstrate that synthetic pre-training is a
viable strategy to boost the model performance.
|
|
Explicit Occlusion Reasoning for Multi-person 3D Human Pose
Estimation
Qihao Liu,
Yi Zhang,
Song Bai,
Alan Yuille
ECCV, 2022
paper
/
arXiv /
code
We explicitly reason about occlusion in multi-person 3D human pose estimation
that can generalize better than using pose priors/constraints, data
augmentation, or implicit reasoning.
|
|
DASZL: Dynamic Action Signatures for Zero-shot Learning
Tae Soo Kim*,
Jonathan Jones*,
Michael Peven*,
Zihao Xiao,
Jin Bai,
Yi Zhang,
Weichao Qiu,
Alan Yuille,
Gregory D. Hager
AAAI, 2021
arXiv
This compositional approach allows us to reframe fine-grained recognition as
zero-shot activity recognition, where a detector is composed “on the fly” from
simple first-principles state machines supported by deep-learned components.
|
|
Nuisance-Label Supervision: Robustness Improvement by Free Labels
Xinyue Wei,
Weichao Qiu,
Yi Zhang,
Zihao Xiao,
Alan Yuille
ICCVW, 2021
arXiv /
supp
/
poster
Improving model robustness to nuisance factors using "free" nuidance labels and
adversarial debiasing.
|
|
Synthesize then Compare: Detecting Failures and Anomalies for
Semantic Segmentation
Yingda Xia*,
Yi Zhang*,
Fengze Liu,
Wei Shen,
Alan Yuille
ECCV, 2020   (Oral Presentation)
arXiv /
code /
video
A simple unified framework for failure and anomaly detection for segmantic
segmentation based on a generative model and a comparison module.
|
|
RSA: Randomized Simulation as Augmentation for Robust Human Action
Recognition
Yi Zhang,
Xinyue Wei,
Weichao Qiu,
Zihao Xiao,
Gregory D. Hager,
Alan Yuille
arXiv, 2019
arXiv
We use simulated videos to augment real training data to improve model robustness
to nuisance factors, e.g. novel viewpoint, change
of background and human appearance.
|
|
UnrealStereo: Controlling Hazardous Factors to Analyze Stereo Vision
Yi Zhang,
Weichao Qiu,
Qi Chen,
Xiaolin Hu,
Alan Yuille
3DV, 2018   (Oral Presentation)
paper /
project page /
code
We control hazardous factors, e.g. specularity, texturelessness and transparency,
to analyze robustness of binocular stereo algorithms. Findings in virtual world
is verified in real world.
|
|
SampleAhead: Online Classifier-Sampler Communication for Learning
from Synthesized Data
Qi Chen,
Weichao Qiu,
Yi Zhang,
Lingxi Xie,
Alan Yuille
BMVC, 2018   (Oral Presentation)
arXiv
Efficient online sampling algorithm for active learning from the combinatorially
large synthetic data space.
|
|
UnrealCV: Virtual Worlds for Computer Vision
Weichao Qiu,
Fangwei Zhong,
Yi Zhang,
Zihao Xiao,
Siyuan Qiao,
Tae Soo Kim,
Yizhou Wang,
Alan Yuille
ACM Multimedia Open Source Software Competition, 2017
paper /
project page
An open-source tool for interacting with and obtain data from virtual worlds in
Unreal Engine.
|
|