An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: document-representation

AkbiHiba/Enhancing_IR_using_Query_Clarificati

Work done with a teammate as part of the graduate PLDAC course at Sorbonne University

Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

machine-intelligence-laboratory/TopicNet

Interface for easier topic modelling.

Language: Python - Size: 10.5 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 139 - Forks: 17

jaeeun-n/hilbert-contrastive-learning

Hyperbolic Contrastive Learning for Document Representations - A Multi-View Approach with Paragraph-level Similarities

Language: Python - Size: 155 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 3 - Forks: 0

hank110/bagofconcepts

Python implementation of bag-of-concepts

Language: Python - Size: 5.34 MB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 20 - Forks: 1

omni-us/pagexml

Library in C++ and a python wrapper for dealing with Page XML files

Language: C++ - Size: 6.7 MB - Last synced at: 19 days ago - Pushed at: about 1 month ago - Stars: 13 - Forks: 2

OneOffTech/parse-document-model-python

Define models to represent a textual document, e.g. a PDF, preserving the hierarchy of the content.

Language: Python - Size: 20.5 KB - Last synced at: 24 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

psaegert/pmtrendviz

Unsupervised Discovery Of Trends In Biomedical Research Based On The PubMed Baseline Repository

Language: Python - Size: 500 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

laddie132/LW-PT

Dataset and code for "Label-Wise Document Pre-Training for Multi-Label Text Classification" (NLPCC 2020)

Language: Python - Size: 109 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0