How to improve transfer learning/training?

jhertz · July 6, 2021, 12:37am

Hi all! Attached is a graph of my latest attempt at transfer learning from a mobile_netV1 model already trained to 1 million steps, doing transfer learning with 50000 additional steps to my new dataset. The chart of training loss is very volatile and certainly never converges or improves past a loss of ~0.3 . My paramters are as follows:

Learning Rate=0.045
Label Smoothing=0.1
Learning Rate Decay Factor=0.98
Number of Epochs Per Decay=2.5
Moving Average Decay=0.9999

Does anyone have suggestions or ideas about what’s causing his behavior/why training is not converging?

Topic		Replies	Views
Training my Model from Scratch General Discussions	5	461	September 5, 2024
Thoughts on Fundamentals of TinyMl Edx course Education	4	326	April 29, 2022
STN - Spatial Transformer Network Programming Q&A	0	120	May 25, 2022
Cnn-trad-fpool3 model Programming Q&A	0	9	September 5, 2024
tinyML Talks - tinyMLPerf: Deep Learning Benchmarks for Embedded Devices General Discussions	8	436	March 17, 2021

How to improve transfer learning/training?

Related topics