* [Tensor] activation is an attr of ColoTensor * [Tensor] add optimizer * only detach parameters in context * polish code