model:use the heavy model (8.6M parameters) or the lighter one with 2M parameters
