The current profiler is messy and we have to reorganize these code. Memory and speed profiler for both PatrickStar and PyTorch.