* Move regression tests to evaluation/ * use pythnon instead of docker in the script * add model para * change python to python3 * bug fix