Differences

This shows you the differences between two versions of the page.

--- sequence_to_sequence_learning [2023/12/25 04:22]
135.23.195.80 [Recommended Reading]
+++ sequence_to_sequence_learning [2023/12/25 06:43] (current)
burkov [Recommended Reading]
@@ Line 10: / Line 10: @@
   * [[http://ruder.io/deep-learning-nlp-best-practices/|Deep Learning for NLP Best Practices]] by Sebastian Ruder (2017)
   * [[https://arxiv.org/abs/1910.10683|Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer]] by Raffel at all (2019) (the T5 paper)
+  * [[https://arxiv.org/abs/2210.11416|Scaling Instruction-Finetuned Language Models]] by Raffel at all (2019) (the Flan-T5 paper)
 ===== Tutorials =====
   * [[https://blog.keras.io/a-ten-minute-introduction-to-sequence-to-sequence-learning-in-keras.html|A ten-minute introduction to sequence-to-sequence learning in Keras]] by Francois Chollet (2017), Oriol Vinyals and Quoc Le

The Hundred-Page Machine Learning Book