Metadrive: Composing diverse driving scenarios for generalizable reinforcement learning Q Li, Z Peng, L Feng, Q Zhang, Z Xue, B Zhou IEEE transactions on pattern analysis and machine intelligence (TPAMI), 2022 | 208 | 2022 |
Learn to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining Q Zhang, Z Peng, B Zhou European Conference on Computer Vision (ECCV), 2022 | 35 | 2022 |
F³A-GAN: Facial Flow for Face Animation With Generative Adversarial Networks X Wu, Q Zhang, Y Wu, H Wang, S Li, L Sun, X Li IEEE Transactions on Image Processing (TIP), 2021 | 34 | 2021 |
Scenewiz3d: Towards text-guided 3d scene composition Q Zhang, C Wang, A Siarohin, P Zhuang, Y Xu, C Yang, D Lin, B Zhou, ... IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 24 | 2023 |
Improving the generalization of end-to-end driving through procedural generation Q Li, Z Peng, Q Zhang, C Liu, B Zhou arXiv preprint arXiv:2012.13681, 2020 | 22 | 2020 |
Generative category-level shape and pose estimation with semantic primitives G Li, Y Li, Z Ye, Q Zhang, T Kong, Z Cui, G Zhang Conference on Robot Learning (CORL), 2023 | 20 | 2023 |
Geomim: Towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding J Liu, T Wang, B Liu, Q Zhang, Y Liu, H Li International Conference on Computer Vision (ICCV), 2023 | 13* | 2023 |
Towards smooth video composition Q Zhang, C Yang, Y Shen, Y Xu, B Zhou International Conference on Learning Representation (ICLR), 2022 | 13 | 2022 |
Dart: Denoising autoregressive transformer for scalable text-to-image generation J Gu, Y Wang, Y Zhang, Q Zhang, D Zhang, N Jaitly, J Susskind, S Zhai arXiv preprint arXiv:2410.08159, 2024 | 5 | 2024 |
Berfscene: Bev-conditioned equivariant radiance fields for infinite 3d scene generation Q Zhang, Y Xu, Y Shen, B Dai, B Zhou, C Yang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 4 | 2023 |
Urban Scene Diffusion through Semantic Occupancy Map J Zhang, Q Zhang, L Zhang, RR Kompella, G Liu, B Zhou arXiv preprint arXiv:2403.11697, 2024 | 3 | 2024 |
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting Q Zhang, Y Xu, C Wang, HY Lee, G Wetzstein, B Zhou, C Yang arXiv preprint arXiv:2405.18424, 2024 | 1 | 2024 |
Learning Modulated Transformation in GANs C Yang, Q Zhang, Y Xu, J Zhu, Y Shen, B Dai Advances in Neural Information Processing Systems (NeurIPS), 2024 | 1 | 2024 |
Pgdrive: Procedural generation of driving environments for generalization Q Li, Z Peng, Q Zhang, C Liu, B Zhou | 1 | |
World-consistent Video Diffusion with Explicit 3D Modeling Q Zhang, S Zhai, MA Bautista, K Miao, A Toshev, J Susskind, J Gu arXiv preprint arXiv:2412.01821, 2024 | | 2024 |
Towards Text-guided 3D Scene Composition Supplementary Material Q Zhang, C Wang, A Siarohin, P Zhuang, Y Xu, C Yang, D Lin, B Zhou, ... | | |