add resources from DL4NLP

keon · keon · commit 0bcd6f62297d · 2016-04-24T15:49:50.000+09:00
diff --git a/README.md b/README.md
@@ -30,8 +30,6 @@ Please feel free to [pull requests](https://github.com/keonkim/awesome-nlp/pulls
    - [Neural Network](#neural-network)
    - [Supplementary Materials](#supplementary-materials)
  - [Blogs](#blogs)
- - [Multilingual](#multilingual)
-   - [Spanish](#spanish)
  - [Credits](#credits)
 
 
@@ -50,6 +48,16 @@ Please feel free to [pull requests](https://github.com/keonkim/awesome-nlp/pulls
 * [Statistical Machine Translation](http://mt-class.org) - a Machine Translation course with great assignments and slides. 
 * [Natural Language Processing SFU](http://www.cs.sfu.ca/~anoop/teaching/CMPT-413-Spring-2014/) - course by [Prof Anoop Sarkar](https://www.cs.sfu.ca/~anoop/) on Natural Language Processing. Good notes and some good lectures on youtube about HMM. 
 
+## Deep Learning for NLP 
+[Stanford Natural Language Processing](https://class.coursera.org/nlp/lecture/preview)  
+Intro NLP course with videos. This has no deep learning. But it is a good primer for traditional nlp.  
+
+[Stanford CS 224D: Deep Learning for NLP class](http://cs224d.stanford.edu/syllabus.html)  
+[Richard Socher](https://scholar.google.com/citations?user=FaOcyfMAAAAJ&hl=en). (2015)  Class with videos, and slides.
+
+[A Primer on Neural Network Models for Natural Language Processing](http://u.cs.biu.ac.il/~yogo/nnlp.pdf)  
+Yoav Goldberg. October 2015. No new info, 75 page summary of state of the art.  
+
 
 ## Codes
 
@@ -132,13 +140,88 @@ Please feel free to [pull requests](https://github.com/keonkim/awesome-nlp/pulls
 * [Online named entity recognition method for microtexts in social networking services: A case study of twitter](http://arxiv.org/pdf/1301.2857.pdf)
 
 
-### Word Vectors
+### Word Vectors (part of it from [DL4NLP](https://github.com/andrewt3000/DL4NLP))
+Resources about word vectors, aka word embeddings, and distributed representations for words.  
+Word vectors are numeric representations of words that are often used as input to deep learning systems. This process is sometimes called pretraining.  
+
+[Efficient Estimation of Word Representations in Vector Space](http://arxiv.org/pdf/1301.3781v3.pdf)  
+[Distributed Representations of Words and Phrases and their Compositionality]
+(http://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf)  
+[Mikolov](https://scholar.google.com/citations?user=oBu8kMMAAAAJ&hl=en) et al. 2013.  
+Generate word and phrase vectors.  Performs well on word similarity and analogy task and includes [Word2Vec source code](https://code.google.com/p/word2vec/)  Subsamples frequent words. (i.e. frequent words like "the" are skipped periodically to speed things up and improve vector for less frequently used words)  
+[Word2Vec tutorial](http://tensorflow.org/tutorials/word2vec/index.html) in [TensorFlow](http://tensorflow.org/)  
+
+[Deep Learning, NLP, and Representations](http://colah.github.io/posts/2014-07-NLP-RNNs-Representations/)  
+Chris Olah (2014)  Blog post explaining word2vec.  
+
+[GloVe: Global vectors for word representation](http://nlp.stanford.edu/projects/glove/glove.pdf)  
+Pennington, Socher, Manning. 2014. Creates word vectors and relates word2vec to matrix factorizations.  [Evalutaion section led to controversy](http://rare-technologies.com/making-sense-of-word2vec/) by [Yoav Goldberg](https://plus.google.com/114479713299850783539/posts/BYvhAbgG8T2)  
+[Glove source code and training data](http://nlp.stanford.edu/projects/glove/)
+
 * [word2vec](http://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf) - on creating vectors to represent language, useful for RNN inputs
 * [sense2vec](http://arxiv.org/abs/1511.06388) - on word sense disambiguation
 * [Infinite Dimensional Word Embeddings](http://arxiv.org/abs/1511.05392) - new
 * [Skip Thought Vectors](http://arxiv.org/abs/1506.06726) - word representation method
 * [Adaptive skip-gram](http://arxiv.org/abs/1502.07257) - similar approach, with adaptive properties
 
+### Thought Vectors (from [DL4NLP](https://github.com/andrewt3000/DL4NLP))
+Thought vectors are numeric representations for sentences, paragraphs, and documents.  The following papers are listed in order of date published, each one replaces the last as the state of the art in sentiment analysis.  
+
+[Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.383.1327&rep=rep1&type=pdf)  
+Socher et al. 2013.  Introduces Recursive Neural Tensor Network.  Uses a parse tree.
+
+[Distributed Representations of Sentences and Documents](http://cs.stanford.edu/~quocle/paragraph_vector.pdf)  
+[Le](https://scholar.google.com/citations?user=vfT6-XIAAAAJ), Mikolov. 2014.  Introduces Paragraph Vector. Concatenates and averages pretrained, fixed word vectors to create vectors for sentences, paragraphs and documents. Also known as paragraph2vec.  Doesn't use a parse tree.  
+Implemented in [gensim](https://github.com/piskvorky/gensim/).  See [doc2vec tutorial](http://rare-technologies.com/doc2vec-tutorial/)
+
+[Deep Recursive Neural Networks for Compositionality in Language](http://www.cs.cornell.edu/~oirsoy/files/nips14drsv.pdf)  
+Irsoy & Cardie. 2014.  Uses Deep Recursive Neural Networks. Uses a parse tree.
+
+[Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks](https://aclweb.org/anthology/P/P15/P15-1150.pdf)  
+Tai et al. 2015  Introduces Tree LSTM. Uses a parse tree.
+
+[Semi-supervised Sequence Learning](http://arxiv.org/pdf/1511.01432.pdf)  
+Dai, Le 2015 "With pretraining, we are able to train long short term memory recurrent networks up to a few hundred
+timesteps, thereby achieving strong performance in many text classification tasks, such as IMDB, DBpedia and 20 Newsgroups."  
+##Machine Translation
+[Neural Machine Translation by jointly learning to align and translate](http://arxiv.org/pdf/1409.0473v6.pdf)  
+Bahdanau, Cho 2014.  "comparable to the existing state-of-the-art phrase-based system on the task of English-to-French translation."  Implements attention mechanism.  
+[English to French Demo](http://104.131.78.120/)  
+
+[Sequence to Sequence Learning with Neural Networks](http://arxiv.org/pdf/1409.3215v3.pdf)  
+Sutskever, Vinyals, Le 2014.  ([nips presentation](http://research.microsoft.com/apps/video/?id=239083)). Uses LSTM RNNs to generate translations. " Our main result is that on an English to French translation task from the WMT’14 dataset, the translations produced by the LSTM achieve a BLEU score of 34.8"  
+[seq2seq tutorial](http://tensorflow.org/tutorials/seq2seq/index.html) in 
+
+### Single Exchange Dialogs (from [DL4NLP](https://github.com/andrewt3000/DL4NLP))
+[A Neural Network Approach toContext-Sensitive Generation of Conversational Responses](http://arxiv.org/pdf/1506.06714v1.pdf)  
+Sordoni 2015.  Generates responses to tweets.   
+Uses [Recurrent Neural Network Language Model (RLM) architecture
+of (Mikolov et al., 2010).](http://www.fit.vutbr.cz/research/groups/speech/publi/2010/mikolov_interspeech2010_IS100722.pdf)  source code: [RNNLM Toolkit](http://www.rnnlm.org/)
+
+[Neural Responding Machine for Short-Text Conversation](http://arxiv.org/pdf/1503.02364v2.pdf)  
+Shang et al. 2015  Uses Neural Responding Machine.  Trained on Weibo dataset.  Achieves one round conversations with 75% appropriate responses.  
+
+[A Neural Conversation Model](http://arxiv.org/pdf/1506.05869v3.pdf)  
+Vinyals, [Le](https://scholar.google.com/citations?user=vfT6-XIAAAAJ) 2015.  Uses LSTM RNNs to generate conversational responses. Uses [seq2seq framework](http://tensorflow.org/tutorials/seq2seq/index.html).  Seq2Seq was originally designed for machine transation and it "translates" a single sentence, up to around 79 words, to a single sentence response, and has no memory of previous dialog exchanges.  Used in Google [Smart Reply feature for Inbox](http://googleresearch.blogspot.co.uk/2015/11/computer-respond-to-this-email.html)  
+
+### Memory and Attention Models (from [DL4NLP](https://github.com/andrewt3000/DL4NLP))
+[Reasoning, Attention and Memory RAM workshop at NIPS 2015. slides included](http://www.thespermwhale.com/jaseweston/ram/)  
+
+[Memory Networks](http://arxiv.org/pdf/1410.3916v10.pdf) Weston et. al 2014, and 
+[End-To-End Memory Networks](http://arxiv.org/pdf/1503.08895v4.pdf) Sukhbaatar et. al 2015.  
+Memory networks are implemented in [MemNN](https://github.com/facebook/MemNN).  Attempts to solve task of reason attention and memory.  
+[Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks](http://arxiv.org/pdf/1502.05698v7.pdf)  
+Weston 2015. Classifies QA tasks like single factoid, yes/no etc. Extends memory networks.  
+[Evaluating prerequisite qualities for learning end to end dialog systems](http://arxiv.org/pdf/1511.06931.pdf)  
+Dodge et. al 2015. Tests Memory Networks on 4 tasks including reddit dialog task.  
+See [Jason Weston lecture on MemNN](https://www.youtube.com/watch?v=Xumy3Yjq4zk)  
+  
+[Neural Turing Machines](http://arxiv.org/pdf/1410.5401v2.pdf)  
+Graves et al. 2014.  
+
+[Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets](http://arxiv.org/pdf/1503.01007v4.pdf)  
+Joulin, Mikolov 2015. [Stack RNN source code](https://github.com/facebook/Stack-RNN) and [blog post](https://research.facebook.com/blog/1642778845966521/inferring-algorithmic-patterns-with-stack/)  
+
 ### General Natural Language Processing
 * [Neural autocoder for paragraphs and documents](http://arxiv.org/abs/1506.01057) - LTSM representation
 * [LTSM over tree structures](http://arxiv.org/abs/1503.04881)
@@ -192,28 +275,6 @@ Please feel free to [pull requests](https://github.com/keonkim/awesome-nlp/pulls
 * [Natural Language Processing Blog](http://nlpers.blogspot.ch/) by Hal Daumé III
 * [Machine Learning Blog](https://bmcfee.github.io/#home) by Brian McFee
 
-## Multilingual
-
-### Spanish
-
-
-- POS TAGGERS
-   - [TreeTagger - POSTagger](http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/)
-   - [Stanford - POSTagger](http://nlp.stanford.edu/software/tagger.shtml)
-   - [Freeling](http://nlp.lsi.upc.edu/freeling/)
-   - [ixa-pipe-pos](https://github.com/ixa-ehu/ixa-pipe-pos)
-   - [Ruby Snowball Implementation](https://github.com/MaG21/estem)
-   - [Spaguetti POSTagger(Based on NLTK +  CESS corpus](https://code.google.com/p/spaghetti-tagger/)
-- NER
-   - [OpenNLP - Person/Place/Organization models](http://opennlp.sourceforge.net/models-1.5/)
-   - [DBPedia Spotlight](https://github.com/dbpedia-spotlight/dbpedia-spotlight/)
-   - [CitiusTagger - Spanish NER and  POSTagger](http://gramatica.usc.es/pln/tools/CitiusTools.html)
-- ETC
-   - [Word2Vec vectors for Wikipedia Spanish Articles](https://github.com/idio/wiki2vec)
-   - [DBpedia Spanish Entities Titles](http://data.dws.informatik.uni-mannheim.de/dbpedia/2014/es/labels_es.nt.bz2)
-   - [DBpedia Spanish Abstracts](http://data.dws.informatik.uni-mannheim.de/dbpedia/2014/es/short_abstracts_es.nt.bz2)
-   - [Conshuga - Galician Verb conjugator](http://gramatica.usc.es/pln/tools/conjugador/download.html)
-
 
 ## Credits
 part of the lists are from 
@@ -222,3 +283,4 @@ part of the lists are from
 * [awesome-spanish-nlp](https://github.com/dav009/awesome-spanish-nlp)
 * [jjangsangy's awesome-nlp](https://gist.github.com/jjangsangy/8759f163bc3558779c46)
 * [awesome-machine-learning](https://github.com/josephmisiti/awesome-machine-learning/edit/master/README.md)
+* [DL4NLP](https://github.com/andrewt3000/DL4NLP)