Paper page - A Causal Language Modeling Detour Improves Encoder Continued Pretraining
…Plan to release the code, I would like to try this with other models for domain adaption 😃 · Hi @ stefan-it thank you very much! I will try to release it asap, until…
