Skip to content

samiroid/usr2vec

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Usr2Vec

Code for learning user representations as described in the paper Modelling Context with User Embeddings for Sarcasm Detection in Social Media paper

requirements:

Running the code

  1. pretrain word embeddings using [gensim] (https://radimrehurek.com/gensim/models/word2vec.html) with the hierarchical softmax option (see the [documention] (https://radimrehurek.com/gensim/models/word2vec.html) on how to do this--tl;dr set the flag hs=1). Save the embeddings in binary format.
  2. clone or download the [my_utils] (https://github.com/samiroid/utils) module
  3. edit file setup.sh to change the paths to my_utils and the word embeddings; run setup.sh
  4. edit file build_data.sh to change the paths to the word embeddings and the file containing the user's tweets; run build_data.sh
  5. edit file run.sh to change the paths to the word embeddings (binary format) and the ouput user embeddings; run run.sh
  6. kick-back and relax :)

About

Learning User Representations

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published