* changed builtin clock to wall_clock64 * updated npkit_Trace_generator to the new version of npkit
* Improve clock calibration in NPKit * Improve gfx macro * Fix macro
* apply npkit * fix bug * add npkit in readme