Research Assistant
AI-powered research paper analysis, summarization, and literature review
Computer Science2017
87,432 citations
Attention Is All You Need
Vaswani et al.
We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely.
NLP Deep Learning Transformers
Computer Science2016
145,000 citations
Deep Residual Learning for Image Recognition
He et al.
We present a residual learning framework to ease the training of networks that are substantially deeper than those used previously.
Computer Vision CNN ResNet
Computer Science2018
62,000 citations
BERT: Pre-training of Deep Bidirectional Transformers
Devlin et al.
We introduce BERT, a new language representation model designed to pre-train deep bidirectional representations from unlabeled text.
NLP BERT Language Models