This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
sequence_to_sequence_learning [2023/07/14 00:13] 181.115.209.67 added space |
sequence_to_sequence_learning [2023/12/25 06:43] (current) burkov [Recommended Reading] |
||
---|---|---|---|
Line 7: | Line 7: | ||
===== Recommended Reading ===== | ===== Recommended Reading ===== | ||
- | * [[https://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks.pdf|Sequence to Sequence Learning with Neural Networks]] by Ilya Sutskever, Oriol Vinyals, and Quoc Le (2014) | + | * [[https://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks.pdf|Sequence to Sequence Learning with Neural Networks]] by Ilya Sutskever, Oriol Vinyals, and Quoc Le (2014). |
* [[http://ruder.io/deep-learning-nlp-best-practices/|Deep Learning for NLP Best Practices]] by Sebastian Ruder (2017) | * [[http://ruder.io/deep-learning-nlp-best-practices/|Deep Learning for NLP Best Practices]] by Sebastian Ruder (2017) | ||
+ | * [[https://arxiv.org/abs/1910.10683|Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer]] by Raffel at all (2019) (the T5 paper) | ||
+ | * [[https://arxiv.org/abs/2210.11416|Scaling Instruction-Finetuned Language Models]] by Raffel at all (2019) (the Flan-T5 paper) | ||
===== Tutorials ===== | ===== Tutorials ===== | ||
* [[https://blog.keras.io/a-ten-minute-introduction-to-sequence-to-sequence-learning-in-keras.html|A ten-minute introduction to sequence-to-sequence learning in Keras]] by Francois Chollet (2017), Oriol Vinyals and Quoc Le | * [[https://blog.keras.io/a-ten-minute-introduction-to-sequence-to-sequence-learning-in-keras.html|A ten-minute introduction to sequence-to-sequence learning in Keras]] by Francois Chollet (2017), Oriol Vinyals and Quoc Le |