You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Change fallback alert from DEBUG level to WARNING level and only do once.
* Add debug information for profiling cache right after warmup phase.
* Change the level of exception message during tactic profiling from ERROR to WARNING. And all exception details are pushed to DEBUG.
* Other trivial refinements.
Signed-off-by: Yukun He <[email protected]>
f"[Autotuner]: profiling chosen runner: {runners[runner_id]}{tactic} for {cache_key}"
428
+
f"[Autotuner] Profiling runner={runners[runner_id]}, tactic={tactic} for cache_key={cache_key}."
430
429
)
430
+
else:
431
+
logger.warning(
432
+
f"[Autotuner] No valid runner/tactic was found for custom_op={custom_op}, input_shapes={input_shapes}. "
433
+
f"At least one valid (runner, tactic) pair is required. "
434
+
f"If get_valid_tactics is intended to return empty list, please ensure that this profile is not valid for the custom_op "
435
+
f"and should not occurs during the inference stage, or fallback tactic is implemented. Otherwise, the the tuning process will crash."
436
+
)
437
+
438
+
# If failed profiling tactics occurs, log the error.
439
+
ifnew_tuning_failuer_occured:
440
+
logger.error(
441
+
f"[Autotuner] New tuning error occurs: Total failed profiling tactics occurs: {self.stats.failed_profiling_count[custom_op]} for custom_op={custom_op}. "
442
+
f"This will not block the tuning process. "
443
+
f"Please set TLLM_LOG_LEVEL=WARNING to find out when the tactic profiling fails. "
444
+
f"Set TLLM_LOG_LEVEL=DEBUG to get more details of the failures."
445
+
)
431
446
432
447
# Get the best runner and tactic from cache
433
448
# If no valid tactic is found, the fallback runner and tactic will be used
0 commit comments