Abstract

Recent studies reveal that a deep neural network can learn transferable features which generalize well to novel tasks for domain adaptation. However, as deep features eventually transition from general to specific along the network, the feature transferability drops significantly in higher layers with increasing domain discrepancy. Hence, it is important to formally reduce the dataset bias and enhance the transferability in task-specific layers. In this paper, we propose a new Deep Adaptation Network (DAN) architecture, which generalizes deep convolutional neural network to the domain adaptation scenario. In DAN, hidden representations of all task-specific layers are embedded in a reproducing kernel Hilbert space where the mean embeddings of different domain distributions can be explicitly matched. The domain discrepancy is further reduced using an optimal multi-kernel selection method for mean embedding matching. DAN can learn transferable features with statistical guarantees, and can scale linearly by unbiased estimate of kernel embedding. Extensive empirical evidence shows that the proposed architecture yields state-of-the-art image classification error rates on standard domain adaptation benchmarks.

Keywords

Computer scienceArtificial intelligenceDomain adaptationConvolutional neural networkKernel (algebra)EmbeddingPattern recognition (psychology)Deep learningDomain (mathematical analysis)Task (project management)Machine learningArtificial neural networkReproducing kernel Hilbert spaceFeature (linguistics)Theoretical computer scienceMathematicsHilbert space

Affiliated Institutions

Related Publications

Learning Deep Architectures for AI

Can machine learning deliver AI? Theoretical results, inspiration from the brain and cognition, as well as machine learning experiments suggest that in order to learn the kind o...

2009 now publishers, Inc. eBooks 5043 citations

Publication Info

Year
2015
Type
preprint
Citations
1222
Access
Closed

External Links

Citation Metrics

1222
OpenAlex

Cite This

Mingsheng Long, Yue Cao, Jianmin Wang et al. (2015). Learning Transferable Features with Deep Adaptation Networks. arXiv (Cornell University) .