GitHub / marcomoldovan / hierarchical-language-modeling
We address the task of learning contextualized word, sentence and document representations with a hierarchical language model by stacking Transformer-based encoders on a sentence level and subsequently on a document level and performing masked token prediction.
Stars: 6
Forks: 0
Open issues: 5
License: mit
Language: Jupyter Notebook
Size: 6.83 MB
Dependencies parsed at: Pending
Created at: almost 5 years ago
Updated at: almost 2 years ago
Pushed at: almost 2 years ago
Last synced at: over 1 year ago
Topics: attention-mechanism, deep-learning, document-embedding, document-retrieval, information-retrieval, language-model, machine-learning, natural-language-processing, natural-language-understanding, pytorch, representation-learning, sentence-embeddings, transfer-learning, transformer, word-embeddings