Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI 13 hours ago aws.amazon.com - machine-learning