Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI 47 minutes ago aws.amazon.com - machine-learning