Meta-Transfer Learning for Few-Shot Learning

Qianru Sun; Yaoyao Liu; Tat‐Seng Chua; Bernt Schiele

doi:10.1109/cvpr.2019.00049

Abstract

Meta-learning has been proposed as a framework to address the challenging few-shot learning setting. The key idea is to leverage a large number of similar few-shot tasks in order to learn how to adapt a base-learner to a new task for which only a few labeled samples are available. As deep neural networks (DNNs) tend to overfit using a few samples only, meta-learning typically uses shallow neural networks (SNNs), thus limiting its effectiveness. In this paper we propose a novel few-shot learning method called meta-transfer learning (MTL) which learns to adapt a deep NN for few shot learning tasks. Specifically, "meta" refers to training multiple tasks, and "transfer" is achieved by learning scaling and shifting functions of DNN weights for each task. In addition, we introduce the hard task (HT) meta-batch scheme as an effective learning curriculum for MTL. We conduct experiments using (5-class, 1-shot) and (5-class, 5-shot) recognition tasks on two challenging few-shot learning benchmarks: miniImageNet and Fewshot-CIFAR100. Extensive comparisons to related works validate that our meta-transfer learning approach trained with the proposed HT meta-batch scheme achieves top performance. An ablation study also shows that both components contribute to fast convergence and high accuracy.

Keywords

Computer scienceArtificial intelligenceMeta learning (computer science)Transfer of learningMachine learningLeverage (statistics)Multi-task learningOverfittingDeep learningArtificial neural networkTask (project management)

Affiliated Institutions

Related Publications

A Survey on Multi-Task Learning

Yu Zhang , Qiang Yang

Multi-Task Learning (MTL) is a learning paradigm in machine learning and its aim is to leverage useful information contained in multiple related tasks to help improve the genera...

2021 IEEE Transactions on Knowledge and Da... 1864 citations

Pedestrian detection aided by deep learning semantic tasks

Yonglong Tian , Ping Luo , Xiaogang Wang +1 more

Deep learning methods have achieved great successes in pedestrian detection, owing to its ability to learn discriminative features from raw pixels. However, they treat pedestria...

2015 418 citations

Fully Convolutional Networks for Semantic Segmentation

Evan Shelhamer , Jonathan Long , Trevor Darrell

Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, im...

2016 IEEE Transactions on Pattern Analysis... 10715 citations

MegDet: A Large Mini-Batch Object Detector

Chao Peng , Tete Xiao , Zeming Li +5 more

The development of object detection in the era of deep learning, from R-CNN [11], Fast/Faster R-CNN [10, 31] to recent Mask R-CNN [14] and RetinaNet [24], mainly come from novel...

2018 316 citations

Heterogeneous Graph Neural Network

Chuxu Zhang , Dongjin Song , Chao Huang +2 more

Representation learning in heterogeneous graphs aims to pursue a meaningful vector representation for each node so as to facilitate downstream applications such as link predicti...

2019 1375 citations

Publication Info

Year: 2019
Type: article
Pages: 403-412
Citations: 1224
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Meta-Transfer Learning for Few-Shot Learning

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1224

OpenAlex

Cite This

APA Style

                            
                                    Qianru Sun, 
                                
                                    Yaoyao Liu, 
                                
                                    Tat‐Seng Chua
                                
                                et al.
                            
                            (2019). 
                            Meta-Transfer Learning for Few-Shot Learning. 
                            
                            , 403-412.
                            https://doi.org/10.1109/cvpr.2019.00049

Identifiers

DOI: 10.1109/cvpr.2019.00049