Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers
… Happy to share my demo if useful: https://www.linkedin.com/posts/dr-mm-alam-93991120b demofirst-aichips-edgeai-activity-7381674484098883584-0Rwn/?utm source=share&utm medium=member desktop&rcm=ACoAADVZuP0BheDJgKL8dWk-bNo7Yd4zhsOnNL4 PyTorch now natively supports Flash Attention. …