Concept-AI Knowledge Acquisition – Page 12

Showing 51 Result(s)

Attention Is All You Need

Posted: on December, 2017August, 2019by admin

Articles proposes a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. The model achieves 28.4 BLEU on the WMT 2014 English- to-German translation task, improving over the existing best results, including ensembles, by over 2 BLEU. The Transformer also generalizes well to other tasks.

Learning to Compare: Relation Network for Few-Shot Learning

Posted: on November, 2017July, 2019by admin

A general framework for few-shot learning. The method, called the Relation Network (RN), is trained end-to-end from scratch. Besides providing improved performance on few-shot learning, our framework is easily extended to zero-shot learning.

Enriching Word Vectors with Subword Information

Posted: on June, 2017July, 2019by admin

Popular models that learn word representations ignore the morphology of words, by assigning a distinct vector to each word. Article proposes a new approach based on the skipgram model, where each word is represented as a bag of character n-grams. A vector representation is associated to each character n-gram; words being represented as the sum of these representations.