Tag inference latency optimization