A context dependent Typeahead prototype

A Typeahead (Autocomplete) server in cpp (tapp) where queries are suggested based on a combination of global scores and local similarities with a given context. The context is some words which can be extracted from the previous query for instance. The similarity between a query and the context is the inner product between the vector representations of their bags of words. In this code, words are represented by dense vectors computed from some word-to-vec packages. Tf-idf vectors of bags of words can be used instead which can yield faster speed because of sparsity.

The http server is based on lib-face. This Typeahead server is using a naive tree data structure with hash map links to child nodes. The space usage can be improved a little bit by using a Ternary search tree.

Dependencies:

libuv and the joyent http-parser (already included in deps)
armadillo library for matrix computation. You may need to comment out the line #define ARMA_USE_WRAPPER in your include_path/armadillo_bits/config.hpp.
OpenBLAS for fast matrix computation with armadillo (change -L/opt/OpenBLAS/lib in Makefile to your OpenBLAS localtion)
Boost: for command arguments (program options)

Data files:

query index file (not included) with the following format: The lines should be ordered in decreasing order of scores. <query_string><tab><comma_separated_list_of_tags_for_this_query><tab><score>
word2vec pretrain vectors: -- https://github.com/mmihaltz/word2vec-GoogleNews-vectors

Name		Name	Last commit message	Last commit date
Latest commit History 166 Commits
deps		deps
include		include
src		src
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
Makefile		Makefile
README.md		README.md
TODO		TODO

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A context dependent Typeahead prototype

About

Uh oh!

Releases

Packages

Languages

trungthanh/tapp

Folders and files

Latest commit

History

Repository files navigation

A context dependent Typeahead prototype

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages