Efficient Estimation of Word Representations in Vector Space
- Tomas MikolovKai ChenG. CorradoJ. Dean
- 16 January 2013
Computer Science
Two novel model architectures for computing continuous vector representations of words from very large data sets are proposed and it is shown that these vectors provide state-of-the-art performance on the authors' test set for measuring syntactic and semantic word similarities.
Distributed Representations of Words and Phrases and their Compositionality
- Tomas MikolovI. SutskeverKai ChenG. CorradoJ. Dean
- 16 October 2013
Computer Science
This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.
Distilling the Knowledge in a Neural Network
- Geoffrey E. HintonO. VinyalsJ. Dean
- 9 March 2015
Computer Science
This work shows that it can significantly improve the acoustic model of a heavily used commercial system by distilling the knowledge in an ensemble of models into a single model and introduces a new type of ensemble composed of one or more full models and many specialist models which learn to distinguish fine-grained classes that the full models confuse.
TensorFlow: A system for large-scale machine learning
- Martín AbadiP. Barham Xiaoqiang Zhang
- 27 May 2016
Computer Science
The TensorFlow dataflow model is described and the compelling performance that TensorFlow achieves for several real-world applications is demonstrated.
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
- Martín AbadiAshish Agarwal Xiaoqiang Zheng
- 14 March 2016
Computer Science, Engineering
The TensorFlow interface and an implementation of that interface that is built at Google are described, which has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields.
Bigtable: a distributed storage system for structured data
- Fay W. ChangJ. Dean R. Gruber
- 6 November 2006
Computer Science
The simple data model provided by Bigtable is described, which gives clients dynamic control over data layout and format, and the design and implementation of Bigtable are described.
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
- Yonghui WuM. Schuster J. Dean
- 26 September 2016
Computer Science, Linguistics
GNMT, Google's Neural Machine Translation system, is presented, which attempts to address many of the weaknesses of conventional phrase-based translation systems and provides a good balance between the flexibility of "character"-delimited models and the efficiency of "word"-delicited models.
Large Scale Distributed Deep Networks
- J. DeanG. Corrado A. Ng
- 3 December 2012
Computer Science
This paper considers the problem of training a deep network with billions of parameters using tens of thousands of CPU cores and develops two algorithms for large-scale distributed training, Downpour SGD and Sandblaster L-BFGS, which increase the scale and speed of deep network training.
DeViSE: A Deep Visual-Semantic Embedding Model
- Andrea FromeG. Corrado Tomas Mikolov
- 5 December 2013
Computer Science
This paper presents a new deep visual-semantic embedding model trained to identify visual objects using both labeled image data as well as semantic information gleaned from unannotated text and shows that the semantic information can be exploited to make predictions about tens of thousands of image labels not observed during training.
The Case for Learned Index Structures
- Tim KraskaAlex BeutelEd H. ChiJ. DeanN. Polyzotis
- 4 December 2017
Computer Science
SIGMOD Conference
The idea of replacing core components of a data management system through learned models has far reaching implications for future systems designs and that this work provides just a glimpse of what might be possible.
...
...