Publications

(2024). RegionGrasp: A Novel Task for Contact Region Controllable Hand Grasp Generation. European Conference on Computer Vision (ECCV) Workshops.

PDF

(2024). GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction. European Conference on Computer Vision (ECCV).

PDF Project

(2024). RACon: Retrieval-Augmented Simulated Character Locomotion Control. IEEE International Conference on Multimedia & Expo (ICME), Oral.

(2024). Generative Human Motion Stylization in Latent Space. International Conference on Learning Representations (ICLR).

(2023). DVSOD: RGB-D Video Salient Object Detection. Neural Information Processing Systems (NeurIPS).

Dataset Project

(2023). Segment Anything Is Not Always Perfect: An Investigation of SAM on Different Real-world Applications. Workshop on Vision-based InduStrial InspectiON at CVPR 2023 (Best Paper), 1-11.

PDF

(2023). BigNeuron: A resource to benchmark and predict best-performing algorithms for automated reconstruction of neuronal morphology. Nature Methods, 1-12.

PDF DOI

(2023). Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR).

PDF

(2023). Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet. IEEE Transactions on Circuits and Systems for Video Technology(TCSVT).

PDF

(2023). Delving into Calibrated Depth for Accurate RGB-D Salient Object Detection. Interactional Journal of Computer Vision (IJCV).

(2022). Resource-Efficient Medical Image Analysis. MICCAI Workshop on Resource-Efficient Medical Image Analysis(REMIA).

PDF DOI

(2022). Object Wake-up: 3D Object Rigging from a Single Image.. European Conference on Computer Vision (ECCV).

PDF Project

(2022). Human Pose and Shape Estimation from Single Polarization Images. IEEE Transactions on Multimedia (TMM).

PDF

(2022). Promoting Saliency From Depth: Deep Unsupervised RGB-D Saliency Detection. International Conference on Learning Representations(ICLR).

PDF

(2022). DMRA: Depth-induced Multi-scale Recurrent Attention Network for RGB-D Saliency Detection. IEEE Transactions on Image Processing(TIP).

PDF

(2022). Action2video: Generating Videos of Human 3D Actions. International Journal of Computer Vision (IJCV).

PDF

(2022). Investigating Pose Representations and Motion Contexts Modeling for 3D Motion Prediction. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF Project

(2021). Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection. International Conference on Neural Information Processing Systems (NeurIPS).

PDF Project

(2021). Joint Visual and Audio Learning for Video Highlight Detection. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).

PDF

(2021). EventHPE: Event-based 3D Human Pose and Shape Estimation. Proceedings of the IEEE/CVF International Conference on Computer Vision(ICCV).

PDF Code

(2021). Dual Learning Music Composition and Dance Choreography. Proceedings of the 29th ACM International Conference on Multimedia.

PDF

(2021). Learning Calibrated Medical Image Segmentation via Multi-rater Agreement Modeling. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR).

PDF

(2021). Calibrated RGB-D Salient Object Detection. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR).

(2020). Stabilizing Training of Generative Adversarial Nets via Langevin Stein Variational Gradient Descent. IEEE Transactions on Neural Networks and Learning Systems (TNNLS).

PDF DOI

(2020). Action2Motion: Conditioned Generation of 3D Human Motions. Proceedings of the 28th ACM International Conference on Multimedia (ACM).

PDF Code Dataset Project Video

(2020). 3D Human Shape Reconstruction from a Polarization Image. Proceedings of the European Conference on Computer Vision(ECCV).

PDF Dataset Project Video

(2020). FALCONS: FAst Learner-grader for CONtorted poses in Sports. International Workshop on Computer Vision in Sports (CVsports) at CVPR 2020.

PDF Dataset Slides

(2020). SparseFusion: Dynamic Human Avatar Modeling from Sparse RGBD Images. IEEE Transactions on Multimedia(TMM).

PDF Video DOI

(2020). Least Squares Approximation via Sparse Subsampled Randomized Hadamard Transform. IEEE Trans Big Data.

(2020). Improving retinal vessel segmentation with joint local loss by matting. Pattern Recognition.

(2020). IDRiD: Diabetic Retinopathy - Segmentation and Grading Challenge. Medical Image Analysis (MIA).

(2020). Fully automated leg movement tracking in freely moving insects using Feature Learning Leg Segmentation and Tracking (FLLIT). Journal of Visualized Experiments (JoVE).

(2019). Towards Natural and Accurate Future Motion Prediction of Humans and Animals. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR).

PDF Code Project Video

(2019). Multivariate Regression with Gross Errors on Manifold-valued Data. IEEE Trans. Pattern Analysis and Machine Intelligence (TPAMI).

(2019). Fully automated leg tracking of Drosophila neurodegeneration models reveals distinct conserved movement signatures. PLoS Computational Biology.

(2018). Transduction on Directed Graphs via Absorbing Random Walks. IEEE Trans. Pattern Analysis and Machine Intelligence (TPAMI).

(2018). Too Far to See? Not Really! Pedestrian Detection with Scale-aware Localization Policy. IEEE Trans. Image Processing (TIP).

(2018). Synthesizing Retinal and Neuronal Images with Generative Adversarial Nets. Medical Image Analysis (MedIA).

(2018). Supervised Segmentation of Un-annotated Retinal Fundus Images by Synthesis. IEEE Trans. Medical Imaging (TMI).

(2018). Multi-modal Multi-task Learning for Automatic Dietary Assessment. National Conference on Artificial Intelligence (AAAI).

(2017). Segment 2D and 3D Filaments by Learning Structured and Contextual Features. IEEE Trans. Medical Imaging (TMI).

(2017). Quantitative localization of a Golgi protein by imaging its fluorescence center of mass. Journal of Visualized Experiments (JoVE).

(2017). Quantitative 3D analysis of complex single border cell behaviors in coordinated collective cell migration. Nature Communications.

(2017). Pose Estimation from Line Correspondences: A Complete Analysis and A Series of Solutions. IEEE Trans. Pattern Analysis and Machine Intelligence (TPAMI).

(2017). Multiview and Multimodal Pervasive Indoor Localization. ACM Multimedia.

(2017). Lie-X: Depth Image Based Articulated Object Pose Estimation, Tracking, and Action Recognition on Lie Groups. International Journal of Computer Vision (IJCV).

(2017). Hand Action Detection from Ego-centric Depth Sequences. Pattern Recognition.

(2017). Fusion of Magnetic and Vision sensors for indoor localization: Infrastructure-free and More Effective. IEEE Trans. Multimedia (TMM).

(2016). Recognizing Complex Activities by a Probabilistic Interval-based Model. National Conference on Artificial Intelligence (AAAI).

(2016). NeuronCyto II: An Automatic and Quantitative Solution for Crossover Neural Cells in High Throughput Screening. Cytometry Part A.

(2016). Incremental Regularized Least Squares for Dimensionality Reduction of Large-Scale Data. SIAM Journal on Scientific Computing (SISC).

(2016). Estimate Hand Poses Efficiently from Single Depth Images. International Journal of Computer Vision (IJCV).

(2016). Action Recognition in Still Images with Minimum Annotation Efforts. IEEE Trans. Image Processing (TIP).

(2016). A novel imaging method for quantitative Golgi localization reveals differential intra-Golgi trafficking of secretory cargos. Molecular Biology of the Cell.

(2016). A Graph-theoretical Approach for Tracing Filamentary Structures in Neuronal and Retinal Images. IEEE Trans. Medical Imaging (TMI).

(2015). Robust Multivariate Regression with Grossly Corrupted Observations and Its Application to Personality Prediction. Journal of Machine Learning Research (Workshop & Conf. Proceedings)/ACML.

(2015). Learning to Boost Filamentary Structure Segmentation. International Conference on Computer Vision (ICCV).

(2015). Integrated Foreground Segmentation and Boundary Matting for Live Videos. IEEE Trans. Image Processing (TIP).

(2015). GHand: A GPU algorithm for realtime hand pose estimation using depth camera. Eurographics.

(2015). Automated Image Based Prominent Nucleoli Detection. Journal of Pathology Informatics.

(2015). An Efficient Self-Tuning Multiclass Classification Approach. LNCS.

(2014). Tracing retinal vessel trees by transductive inference. BMC Bioinformatics.

(2014). Tracing Retinal Blood Vessels by Matrix-Forest Theorem of Directed Graphs. Medical Image Computing and Computer Assisted Intervention (MICCAI).

(2014). Semi-supervised Domain Adaptation on Manifolds. IEEE Transactions on Neural Networks and Learning Systems (TNNLS).

(2014). Recognizing Flu-like Symptoms from Videos. BMC Bioinformatics.

(2014). Myopia in Asian Subjects with Primary Angle Closure: Implications for Glaucoma Trends in East Asia. Ophthalmology.

(2014). A retinal vessel boundary tracking method based on Bayesian theory and multi-scale line detection. Computerized Medical Imaging and Graphics.

(2014). A Random-Forest Random Field Approach for Cellular Image Segmentation. International Symposium on Biomedical Imaging (ISBI).

(2013). Subgrouping of Primary Angle-Closure Suspects Based on Anterior Segment Optical Coherence Tomography Parameters. Ophthalmology.

(2013). Riemannian Similarity Learning. International Conference on Machine Learning (ICML).

(2013). Finding Distinctive Shape Features for Hematoma Classification in Brain CT Images. International Conference on Tools with Artificial Intelligence (ICTAI).

(2013). Exploiting Syntactic, Semantic, and Lexical Regularities in Language Modeling via Directed Markov Random Fields. Computational Intelligence.

(2013). Efficient Hand Pose Estimation from a Single Depth Image. International Conference on Computer Vision (ICCV).

(2013). Editorial of the Special issue: Machine Learning in Motion Analysis: New Advances. Image and Vision Cmputing (IVC).

(2013). Automated Tracing of Retinal Blood Vessels Using Graphical Models. Scandinavian Conference on Image Analysis.

(2013). Anterior segment optical coherence tomography parameters in subtypes of primary angle closure. Investigative Ophthalmology & Visual Science.

(2012). Structured learning of local features for human action classification and localization. Image and Vision Computing (IVC).

(2012). Integrating Local Action Elements for Action Analysis. Computer Vision and Image Understanding (CVIU).

(2012). A Bag-of-Words Model for Cellular Image Segmentation. Advances in Bio-Imaging: From Physics to Signal Understanding Issues.

(2011). Real-time Discriminative Background Subtraction. IEEE Trans. Image Processing (TIP).

(2011). Incorporating Estimated Motion in Real-time Background Subtraction. IEEE International Conference on Image Processing (ICIP).

(2011). Foreground Segmentation of Live Videos using Locally Competing 1SVMs. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

(2011). Elastic Sequence Correlation for Human Action Analysis. IEEE Trans. Image Processing (TIP).

(2011). Discriminative Human Action Segmentation and Recognition using SMMs. International Journal of Computer Vision (IJCV).

(2011). Discriminative Cellular Segmentation for Microscopic Images. Medical Image Computing and Computer Assisted Intervention (MICCAI).

(2010). Implicit Motion-Shape Model: A generic approach for action matching. International Conference on Image Processing.

(2010). Human Action Recognition from Boosted Pose Estimation. International Conference on Digital Image Computing: Techniques and Applications (DICTA).

(2010). Human Action Recognition and Localization in Video using Structured Learning of Local Space-Time Features. IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

(2010). Efficient Learning to Label Images. International Conference on Pattern Recognition.

(2009). Learning Graph Matching. IEEE Trans. Pattern Analysis and Machine Intelligence (TPAMI).

(2009). User-Driven Lossy Compression for Images and Video. Image and Vision Computing New Zealand (IVCNZ).

(2009). Spatial-Temporal Modeling of Interactive Image Interpretation. Spatial Vision.

(2009). Realtime Background Subtraction from Dynamic Scenes. International Conference on Computer Vision (ICCV).

(2009). Machine Learning for Human Motion Analysis: Theory and Practice. IGI Global.

(2009). Learning-based multiview video coding. Proc. of the 27th Picture Coding Symposium (PCS).

(2009). Inference of the Structural Credit Risk Model using MLE. IEEE Symposium on Computational Intelligence for Financial Engineering.

(2009). Human Body Articulation for Action Recognition in Video Sequences. IEEE Advanced Video and Signal Based Surveillance (AVSS).

(2008). Prediction and Change Detection In Sequential Data for Interactive Applications. National Conference on Artificial Intelligence (AAAI).

(2008). Discriminative Human Action Segmentation and Recognition using. Semi-Markov Model. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

(2008). Consistent image analogies using semi-supervised learning. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

(2007). Stochastic Analysis of Lexical and Semantic for Enhanced Structural Language Model. International Colloquium on Grammatical Inference.

(2007). Online Learning with Novelty Detection in Human-guided Road Tracking. IEEE Transactions on Geoscience and Remote Sensing.

(2007). Learning to compress images and videos. Proceedings of the 24th international conference on Machine learning (ICML).

(2007). Learning Graph Matching. IEEE International Conference on Computer Vision (ICCV).

(2007). Influence of Human Inputs on Semi-automatic Image Interpretation. International Conference on Human-Computer Interaction (HCII).

(2007). Bayesian stereo matching. Computer Vision and Image Understanding (CVIU).

(2006). Component Optimization for Image Understanding: a Bayesian Approach. IEEE Trans. Pattern Analysis and Machine Intelligence (TPAMI).

(2006). An Online Discriminative Approach to Background Subtraction. IEEE international conference on advanced video and signal based surveillance.

(2006). A Novel Learning Approach for Semi-automatic Road Tracking. International Workshop on Pattern Recognition in Remote Sensing (PRRS).

(2005). Variational Bayesian image modelling. Proceedings of the 22nd international conference on Machine learning (ICML).

(2005). Exploiting syntactic, semantic and lexical regularities in language modeling via directed Markov random fields. Proceedings of the 22nd international conference on Machine learning (ICML).

(2005). Bayesian Image Understanding: From Images to Virtual Forests. International Journal of Robotics and Automation.

(2004). Forestry Scene Geometry Estimation via Statistical Learning. CVPR Workshop on Learning in Computer Vision and Pattern Recognition (LCVPR).

(2004). Bayesian Stereo Matching. CVPR Workshop on Generative Model Based Vision (GMBV).

(2003). Unsupervised Image Segmentation: A Bayesian Approach. International Conference on Vision Interface (VI).

(2003). Doubly-MRF Stereo Matching. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

(2003). A Bayesian Approach to Image Understanding: From Images to Virtual Forests. International Conference on Vision Interface (VI).

(2002). A Trainable Hierarchical Hidden Markov Tree Model for color image Annotation. International conference on Pattern Recognition (ICPR).