CS224N
发布于 2023-02-13
CS224N Lecture 10: Transformers and Pretraining
Subword Modeling 在之前的学习中,我们都以有限的词汇作为基本假设来训练模型。在遇到“字典”里没出现过的词语时,我们 …
发布于 2023-02-13
CS224N Lecture 9: Self-Attention and Transformers
Issues with RNN models Linear interaction distance: words that sh …
发布于 2023-02-06
CS224N Lecture 7: Translation, Seq2Seq, Attention
For language which there isn’t much parallel data available, comm …
发布于 2023-02-06
CS224N Lecture 6: Simple and LSTM RNNs
Training a RNN Language Model At each step, the model have the pr …
发布于 2023-01-16
CS224N Lecture 4: Syntactic Structure and Dependency Parsing
Context-Free Grammars (CFGs) Also called constituency or phrases …
发布于 2023-01-15
CS224N Lecture 3: Neural net learning: Gradients and algorithmically
本章主要内容是导函数的计算以及 backpropagation 算法的概念和步骤 Rember: Stochastic Gradi …
发布于 2023-01-15
CS224N Lecture 2: Word Vectors, Word Senses, and Neural Classifiers
Review: Main idea of word2vec Start with random word vectors Iter …
发布于 2023-01-07
CS224N Assignment 1: Exploring Word Vectors
Assignment 1 Word Vectors Part 1 Count-Based Word Vectors Constru …