self-adaptive latency pruning