Theano: A Python framework for fast computation of mathematical expressions
- Rami Al-RfouGuillaume Alain Ying Zhang
- 9 May 2016
Computer Science, Mathematics
The performance of Theano is compared against Torch7 and TensorFlow on several machine learning models and recently-introduced functionalities and improvements are discussed.
Adversarial Learning for Neural Dialogue Generation
- Jiwei LiWill MonroeTianlin ShiSébastien JeanAlan RitterDan Jurafsky
- 23 January 2017
Computer Science
This work applies adversarial training to open-domain dialogue generation, training a system to produce sequences that are indistinguishable from human-generated dialogue utterances, and investigates models for adversarial evaluation that uses success in fooling an adversary as a dialogue evaluation metric, while avoiding a number of potential pitfalls.
On Using Very Large _target Vocabulary for Neural Machine Translation
- Sébastien JeanKyunghyun ChoR. MemisevicYoshua Bengio
- 5 December 2014
Computer Science
It is shown that decoding can be efficiently done even with the model having a very large _target vocabulary by selecting only a small subset of the whole _target vocabulary.
Does Neural Machine Translation Benefit from Larger Context?
- Sébastien JeanStanislas LaulyOrhan FiratKyunghyun Cho
- 17 April 2017
Computer Science, Linguistics
We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation…
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
- Jonathan ShenPatrick Nguyen Pat Rondon
- 21 February 2019
Computer Science
This document outlines the underlying design of Lingvo and serves as an introduction to the various pieces of the framework, while also offering examples of advanced features that showcase the capabilities of the Framework.
Combining modality specific deep neural networks for emotion recognition in video
- Samira Ebrahimi KahouC. Pal Zhenzhou Wu
- 9 December 2013
Computer Science
In this paper we present the techniques used for the University of Montréal's team submissions to the 2013 Emotion Recognition in the Wild Challenge. The challenge is to classify the emotions…
EmoNets: Multimodal deep learning approaches for emotion recognition in video
- Samira Ebrahimi KahouXavier Bouthillier Yoshua Bengio
- 5 March 2015
Computer Science
This paper explores multiple methods for the combination of cues from these modalities into one common classifier, which achieves a considerably greater accuracy than predictions from the strongest single-modality classifier.
Neural Machine Translation Systems for WMT ’ 15
- Sébastien JeanOrhan FiratKyunghyun ChoR. MemisevicYoshua Bengio
- 2015
Computer Science, Linguistics
The Montreal Institute for Learning Algorithms (MILA) submission to WMT’15 is to evaluate this new approach to NMT on a greater variety of language pairs, using the RNNsearch architecture, which adds an attention mechanism to the encoderdecoder.
Adaptive Scheduling for Multi-Task Learning
- Sébastien JeanOrhan FiratMelvin Johnson
- 13 September 2019
Computer Science
This work first considers existing non-adaptive techniques, then moves on to adaptive schedules that over-sample tasks with poorer results compared to their respective baseline, and considers implicit schedules, learning to scale learning rates or gradients of individual tasks instead.
Embedding Word Similarity with Neural Machine Translation
- Felix HillKyunghyun ChoSébastien JeanColine DevinYoshua Bengio
- 19 December 2014
Computer Science, Linguistics
Embeddings from translation models outperform those learned by monolingual models at tasks that require knowledge of both conceptual similarity and lexical-syntactic role, and are indicated that translation-based embeddings should be used in applications that require concepts to be organised according to similarity and/or lexical function.
...
...