How to deploy and fine-tune DeepSeek models on AWS
… I have been trying to deploy deepseek-ai/DeepSeek-R1-Distill-Qwen-32B on inferentia with a context window higher than 4096 let's say MAX TOTAL TOKENS=8192 , but it seems there is no pre-compiled model for that. …