Publications > Scalable Systems > Distributed Negative Sampling For Word Embeddings

Distributed Negative Sampling For Word Embeddings

Publication

Feb 6, 2017

Abstract

Word2Vec recently popularized dense vector word representations as fixed-length features for machine learning algorithms and is in widespread use today. In this paper we investigate one of its core components, Negative Sampling, and propose efficient distributed algorithms that allow us to scale to vocabulary sizes of more than 1 billion unique words and corpus sizes of more than 1 trillion words.

Download

Venue:

Thirty-First AAAI Conference on Artificial Intelligence (AAAI 2017)

Type:

Conference/Workshop Paper

Authors:

Zygimantas Straznickas
Rolina Wu
Kostas Tsioutsiouliklis

BibTeX

@inproceedings{ author = {Zygimantas Straznickas and Rolina Wu and Kostas Tsioutsiouliklis}, title = {Distributed Negative Sampling For Word Embeddings}, booktitle = {Proceedings of Thirty-First AAAI Conference on Artificial Intelligence}, year = {2017} }

- Help
- About our ads

Distributed Negative Sampling For Word Embeddings

Publication

Abstract

Thirty-First AAAI Conference on Artificial Intelligence (AAAI 2017)

Conference/Workshop Paper

Zygimantas Straznickas

Rolina Wu

Kostas Tsioutsiouliklis

BibTeX