User Tools

Site Tools


sequence_to_sequence_learning

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
sequence_to_sequence_learning [2023/12/25 04:22]
135.23.195.80 [Recommended Reading]
sequence_to_sequence_learning [2023/12/25 06:43] (current)
burkov [Recommended Reading]
Line 10: Line 10:
   * [[http://​ruder.io/​deep-learning-nlp-best-practices/​|Deep Learning for NLP Best Practices]] by Sebastian Ruder (2017) ​   * [[http://​ruder.io/​deep-learning-nlp-best-practices/​|Deep Learning for NLP Best Practices]] by Sebastian Ruder (2017) ​
   * [[https://​arxiv.org/​abs/​1910.10683|Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer]] by Raffel at all (2019) (the T5 paper)   * [[https://​arxiv.org/​abs/​1910.10683|Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer]] by Raffel at all (2019) (the T5 paper)
 +  * [[https://​arxiv.org/​abs/​2210.11416|Scaling Instruction-Finetuned Language Models]] by Raffel at all (2019) (the Flan-T5 paper)
  
 ===== Tutorials ===== ===== Tutorials =====
  
   * [[https://​blog.keras.io/​a-ten-minute-introduction-to-sequence-to-sequence-learning-in-keras.html|A ten-minute introduction to sequence-to-sequence learning in Keras]] by Francois Chollet (2017), Oriol Vinyals and Quoc Le   * [[https://​blog.keras.io/​a-ten-minute-introduction-to-sequence-to-sequence-learning-in-keras.html|A ten-minute introduction to sequence-to-sequence learning in Keras]] by Francois Chollet (2017), Oriol Vinyals and Quoc Le