The new Aegaeon system can serve dozens of large language models using a fraction of the GPUs previously required, ...