Online multi-object tracking by providing a comprehensive multi-modal structure based on deep learning