コンピュータビジョン

2025.11.17

Difficulty Controlled Diffusion Model for Synthesizing Effective Training Data

2024.9.27

The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization

2024.3.7

Makeup Prior Models for 3D Facial Makeup Estimation and Applications

2024.3.7

SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration

2024.1.17

Towards Diverse and Consistent Typography Generation

2024.1.16

Complementary-Contradictory Feature Regularization against Multimodal Overfitting

2023.9.27

Multimodal color recommendation in vector graphic documents

2023.7.31

Dissecting multimodal learning via regularized masking of multimodal features

2023.7.24

グラフィックデザインの教師ありレイヤー分解

2023.5.18

iMixer: invertible, implicit, and iterative MLP-Mixer from modern Hopfield networks

2023.2.1

Color Recommendation for Vector Graphic Documents based on Multi-Palette Representation

2023.2.1

Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization

2022.9.14

AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval

2022.2.2

An Intelligent Color Recommendation Tool for Landing Page Design

2022.2.1

Optimal Correction Cost for Object Detection Evaluation

2022.2.1

Does robustness on ImageNet transfer to downstream tasks?

2022.2.1

Video Summarization Overview

2021.8.24

Constrained Graphic Layout Generation via Latent Optimization

2021.8.24

CanvasVAE: Learning to Generate Vector Graphic Documents

2020.9.15

Uncovering Hidden Challenges in Query-Based Video Moment Retrieval

2020.5.12

BERT representations for Video Question Answering