NAME
train.pl - Run sparse wide and deep classification
SYNOPSIS
--log-interval number of batches to wait before logging training status, 100
--optimizer what optimizer to use, 'adam'
--cuda train on gpu with cuda, 0
--num-epoch number of epochs to train, 10
--batch-size number of examples per batch, 100
--lr learning rate, 0.001