Interactive visualization of performance and quality metrics across different LLM models and precision types
This dashboard presents a comprehensive comparison of various Large Language Models (LLMs) across different precision types (INT4, INT8, and FP16). The metrics include performance indicators such as latency and throughput, as well as quality metrics like BLEU score, ROUGE scores, and other text quality measurements.