Learning Transferable Features with Deep Adaptation Networks

Abstract

Recent studies reveal that a deep neural network can learn transferable features which generalize well to novel tasks for domain adaptation. However, as deep features eventually transition from general to specific along the network, the feature transferability drops significantly in higher layers with increasing domain discrepancy. Hence, it is important to formally reduce the dataset bias and enhance the transferability in task-specific layers. In this paper, we propose a new Deep Adaptation Network (DAN) architecture, which generalizes deep convolutional neural network to the domain adaptation scenario. In DAN, hidden representations of all task-specific layers are embedded in a reproducing kernel Hilbert space where the mean embeddings of different domain distributions can be explicitly matched. The domain discrepancy is further reduced using an optimal multi-kernel selection method for mean embedding matching. DAN can learn transferable features with statistical guarantees, and can scale linearly by unbiased estimate of kernel embedding. Extensive empirical evidence shows that the proposed architecture yields state-of-the-art image classification error rates on standard domain adaptation benchmarks.

Keywords

Computer scienceArtificial intelligenceDomain adaptationConvolutional neural networkKernel (algebra)EmbeddingPattern recognition (psychology)Deep learningDomain (mathematical analysis)Task (project management)Machine learningArtificial neural networkReproducing kernel Hilbert spaceFeature (linguistics)Theoretical computer scienceMathematicsHilbert space

Affiliated Institutions

Related Publications

Deep Convolutional Transfer Learning Network: A New Method for Intelligent Fault Diagnosis of Machines With Unlabeled Data

Liang Guo , Yaguo Lei , Saibo Xing +2 more

The success of intelligent fault diagnosis of machines relies on the following two conditions: 1) labeled data with fault information are available; and 2) the training and test...

2018 IEEE Transactions on Industrial Elect... 1138 citations

Learning Deep Architectures for AI

Yoshua Bengio

Can machine learning deliver AI? Theoretical results, inspiration from the brain and cognition, as well as machine learning experiments suggest that in order to learn the kind o...

2009 now publishers, Inc. eBooks 5043 citations

Going deeper with convolutions

Christian Szegedy , Wei Liu , Yangqing Jia +6 more

We propose a deep convolutional neural network architecture codenamed Inception that achieves the new state of the art for classification and detection in the ImageNet Large-Sca...

2015 45596 citations

Deep Domain Confusion: Maximizing for Domain Invariance

Eric Tzeng , Judy Hoffman , Ning Zhang +2 more

Recent reports suggest that a generic supervised deep CNN model trained on a large-scale dataset reduces, but does not remove, dataset bias on a standard benchmark. Fine-tuning ...

2014 arXiv (Cornell University) 2347 citations

Hybrid Task Cascade for Instance Segmentation

Kai Chen , Wanli Ouyang , Chen Change Loy +9 more

Cascade is a classic yet powerful architecture that has boosted performance on various tasks. However, how to introduce cascade to instance segmentation remains an open question...

2019 1411 citations

Publication Info

Year: 2015
Type: preprint
Citations: 1222
Access: Closed

External Links

Citation Metrics

1222

OpenAlex

Cite This

APA Style

                            
                                    Mingsheng Long, 
                                
                                    Yue Cao, 
                                
                                    Jianmin Wang
                                
                                et al.
                            
                            (2015). 
                            Learning Transferable Features with Deep Adaptation Networks. 
                            arXiv (Cornell University)
                            
                            .