This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | |||
sequence_to_sequence_learning [2023/12/25 04:22] 135.23.195.80 [Recommended Reading] |
sequence_to_sequence_learning [2023/12/25 06:43] (current) burkov [Recommended Reading] |
||
---|---|---|---|
Line 10: | Line 10: | ||
* [[http://ruder.io/deep-learning-nlp-best-practices/|Deep Learning for NLP Best Practices]] by Sebastian Ruder (2017) | * [[http://ruder.io/deep-learning-nlp-best-practices/|Deep Learning for NLP Best Practices]] by Sebastian Ruder (2017) | ||
* [[https://arxiv.org/abs/1910.10683|Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer]] by Raffel at all (2019) (the T5 paper) | * [[https://arxiv.org/abs/1910.10683|Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer]] by Raffel at all (2019) (the T5 paper) | ||
+ | * [[https://arxiv.org/abs/2210.11416|Scaling Instruction-Finetuned Language Models]] by Raffel at all (2019) (the Flan-T5 paper) | ||
===== Tutorials ===== | ===== Tutorials ===== | ||
* [[https://blog.keras.io/a-ten-minute-introduction-to-sequence-to-sequence-learning-in-keras.html|A ten-minute introduction to sequence-to-sequence learning in Keras]] by Francois Chollet (2017), Oriol Vinyals and Quoc Le | * [[https://blog.keras.io/a-ten-minute-introduction-to-sequence-to-sequence-learning-in-keras.html|A ten-minute introduction to sequence-to-sequence learning in Keras]] by Francois Chollet (2017), Oriol Vinyals and Quoc Le |