Difficulty Controlled Diffusion Model for Synthesizing Effective Training Data
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization
Makeup Prior Models for 3D Facial Makeup Estimation and Applications
SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration
Towards Diverse and Consistent Typography Generation
Complementary-Contradictory Feature Regularization against Multimodal Overfitting
Multimodal color recommendation in vector graphic documents
Dissecting multimodal learning via regularized masking of multimodal features
グラフィックデザインの教師ありレイヤー分解
iMixer: invertible, implicit, and iterative MLP-Mixer from modern Hopfield networks
Color Recommendation for Vector Graphic Documents based on Multi-Palette Representation
Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization
AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval
An Intelligent Color Recommendation Tool for Landing Page Design
Optimal Correction Cost for Object Detection Evaluation
Does robustness on ImageNet transfer to downstream tasks?
Video Summarization Overview
Constrained Graphic Layout Generation via Latent Optimization
CanvasVAE: Learning to Generate Vector Graphic Documents
Uncovering Hidden Challenges in Query-Based Video Moment Retrieval
BERT representations for Video Question Answering