Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / thunlp 117 repositories
Natural Language Processing Lab at Tsinghua University
thunlp/LEGENT
Open Platform for Embodied Agents
Language: Python - Size: 1.7 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 117 - Forks: 5
thunlp/OpenHowNet
Core Data of HowNet and OpenHowNet Python API
Language: Python - Size: 259 MB - Last synced: 1 day ago - Pushed: over 2 years ago - Stars: 592 - Forks: 89
thunlp/OpenNRE
An Open-Source Package for Neural Relation Extraction (NRE)
Language: Python - Size: 261 MB - Last synced: 2 days ago - Pushed: 4 months ago - Stars: 4,256 - Forks: 1,055
thunlp/OpenKE
An Open-Source Package for Knowledge Embedding (KE)
Language: Python - Size: 281 MB - Last synced: 2 days ago - Pushed: 4 months ago - Stars: 3,725 - Forks: 984
thunlp/FewRel
A Large-Scale Few-Shot Relation Extraction Dataset
Language: Python - Size: 24.1 MB - Last synced: 1 day ago - Pushed: about 2 years ago - Stars: 718 - Forks: 164
thunlp/Chinese_Rumor_Dataset
中文谣言数据
Size: 53.5 MB - Last synced: 2 days ago - Pushed: almost 4 years ago - Stars: 680 - Forks: 134
thunlp/LAE
Datasets and code for "Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training"
Language: Jupyter Notebook - Size: 26.5 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 0 - Forks: 0
thunlp/XQA
Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"
Language: Python - Size: 43 KB - Last synced: 2 days ago - Pushed: over 2 years ago - Stars: 85 - Forks: 16
thunlp/BERT-KPE
Language: Python - Size: 8.43 MB - Last synced: 3 days ago - Pushed: over 1 year ago - Stars: 438 - Forks: 78
thunlp/Prompt-Transferability
On Transferability of Prompt Tuning for Natural Language Processing
Language: Python - Size: 629 MB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 87 - Forks: 11
thunlp/GEAR
Source code for ACL 2019 paper "GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification"
Language: Python - Size: 9.4 MB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 97 - Forks: 25
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Language: Python - Size: 42 MB - Last synced: 9 days ago - Pushed: 9 months ago - Stars: 939 - Forks: 76
thunlp/PEVL
Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”
Language: Python - Size: 12.7 MB - Last synced: 8 days ago - Pushed: over 1 year ago - Stars: 45 - Forks: 6
thunlp/MatPlotAgent
Language: Python - Size: 19.9 MB - Last synced: 23 days ago - Pushed: about 2 months ago - Stars: 28 - Forks: 3
thunlp/TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
Language: Python - Size: 173 KB - Last synced: 25 days ago - Pushed: about 1 month ago - Stars: 1,450 - Forks: 192
thunlp/PLMpapers
Must-read Papers on pre-trained language models.
Size: 1.68 MB - Last synced: 24 days ago - Pushed: over 1 year ago - Stars: 3,288 - Forks: 436
thunlp/InfLLM
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"
Language: Python - Size: 262 KB - Last synced: 25 days ago - Pushed: 26 days ago - Stars: 196 - Forks: 16
thunlp/JointNRE
Joint Neural Relation Extraction with Text and KGs
Language: Python - Size: 259 KB - Last synced: 23 days ago - Pushed: over 1 year ago - Stars: 186 - Forks: 36
thunlp/SememePSO-Attack
Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"
Language: Python - Size: 58.7 MB - Last synced: 25 days ago - Pushed: about 3 years ago - Stars: 85 - Forks: 14
thunlp/ERNIE
Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"
Language: Python - Size: 1.45 MB - Last synced: 17 days ago - Pushed: 4 months ago - Stars: 1,401 - Forks: 270
thunlp/THULAC-Java
An Efficient Lexical Analyzer for Chinese
Language: Java - Size: 332 KB - Last synced: 21 days ago - Pushed: over 6 years ago - Stars: 324 - Forks: 114
thunlp/THULAC-Python
An Efficient Lexical Analyzer for Chinese
Language: Python - Size: 78.1 KB - Last synced: 29 days ago - Pushed: over 2 years ago - Stars: 1,962 - Forks: 334
thunlp/ChatEval
Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"
Language: Python - Size: 101 MB - Last synced: 3 months ago - Pushed: 4 months ago - Stars: 160 - Forks: 9
thunlp/THULAC
An Efficient Lexical Analyzer for Chinese
Language: C++ - Size: 93.8 KB - Last synced: 29 days ago - Pushed: 11 months ago - Stars: 769 - Forks: 170
thunlp/LLaVA-UHD
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Language: Python - Size: 2.21 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 59 - Forks: 2
thunlp/OpenAttack
An Open-Source Package for Textual Adversarial Attack.
Language: Python - Size: 4.65 MB - Last synced: 30 days ago - Pushed: 10 months ago - Stars: 650 - Forks: 121
thunlp/KRLPapers
Must-read papers on knowledge representation learning (KRL) / knowledge embedding (KE)
Language: TeX - Size: 85 KB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 1,527 - Forks: 249
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
Language: Python - Size: 14.4 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 4,132 - Forks: 427
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Language: Python - Size: 5.47 MB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 2,087 - Forks: 107
thunlp/Knowledge-Plugin
Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"
Language: Python - Size: 2.6 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 46 - Forks: 3
thunlp/ToolLearningPapers
Size: 13.4 MB - Last synced: about 2 months ago - Pushed: 9 months ago - Stars: 766 - Forks: 42
thunlp/explore-and-evaluate
Code for EMNLP2020 paper "Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment".
Language: Python - Size: 26.4 KB - Last synced: 2 days ago - Pushed: about 2 years ago - Stars: 27 - Forks: 7
thunlp/EREN
Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1
Language: Python - Size: 1.72 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 4 - Forks: 0
thunlp/DeltaPapers
Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.
Size: 36.1 KB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 257 - Forks: 17
thunlp/GNNPapers
Must-read papers on graph neural networks (GNN)
Size: 301 KB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 15,439 - Forks: 2,979
thunlp/MuGNN
Source code for ACL2019 paper "Multi-Channel Graph Neural Network for Entity Alignment".
Language: Python - Size: 136 MB - Last synced: 2 days ago - Pushed: over 3 years ago - Stars: 61 - Forks: 11
thunlp/OpenCLaP
Open Chinese Language Pre-trained Model Zoo
Size: 17.6 KB - Last synced: about 1 month ago - Pushed: about 4 years ago - Stars: 972 - Forks: 144
thunlp/Few-NERD
Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"
Language: Python - Size: 67.4 KB - Last synced: about 1 month ago - Pushed: 8 months ago - Stars: 373 - Forks: 56
thunlp/Fast-TransX
An Efficient implementation of TransE and its extended models for Knowledge Representation Learning
Language: C++ - Size: 9.78 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 397 - Forks: 109
thunlp/S3Delta
code for paper Sparse Structure Search for Delta Tuning
Language: Jupyter Notebook - Size: 587 KB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 6 - Forks: 2
thunlp/DebugBench
The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".
Language: Python - Size: 18.2 MB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 39 - Forks: 1
thunlp/BMCourse
The repo for Tsinghua summer course: Interdisciplinary Seminar on Big Models
Language: Python - Size: 101 MB - Last synced: about 2 months ago - Pushed: almost 2 years ago - Stars: 258 - Forks: 49
thunlp/PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
Size: 194 KB - Last synced: about 2 months ago - Pushed: 10 months ago - Stars: 3,874 - Forks: 367
thunlp/CANE
Source code and datasets of "CANE: Context-Aware Network Embedding for Relation Modeling"
Language: Python - Size: 1.66 MB - Last synced: about 2 months ago - Pushed: about 5 years ago - Stars: 190 - Forks: 77
thunlp/KnowledgeablePromptTuning
kpt code
Language: Python - Size: 211 KB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 196 - Forks: 21
thunlp/THUOCL
THUOCL(THU Open Chinese Lexicon)中文词库
Size: 1.34 MB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 778 - Forks: 189
thunlp/NRLPapers
Must-read papers on network representation learning (NRL) / network embedding (NE)
Language: TeX - Size: 229 KB - Last synced: about 2 months ago - Pushed: almost 4 years ago - Stars: 2,520 - Forks: 652
thunlp/Seq2Seq-Prompt
Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"
Language: Python - Size: 1.52 MB - Last synced: 22 days ago - Pushed: over 1 year ago - Stars: 24 - Forks: 4
thunlp/OpenNE
An Open-Source Package for Network Embedding (NE)
Language: Python - Size: 114 MB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 1,668 - Forks: 489
thunlp/CSSReview
This repository contains the paperlist of CSS.
Size: 68.4 KB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 25 - Forks: 0
thunlp/EmbodiedAIxLLMPapers
Papers on integrating large language models with embodied AI
Size: 46.9 KB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 6 - Forks: 2
thunlp/OpenMatch 📦
An Open-Source Package for Information Retrieval.
Language: Python - Size: 61 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 443 - Forks: 47
thunlp/HATT-Proto
Code and dataset of AAAI2019 paper Hybrid Attention-Based Prototypical Networks for Noisy Few-Shot Relation Classification
Language: Python - Size: 5.36 MB - Last synced: 2 months ago - Pushed: over 5 years ago - Stars: 180 - Forks: 35
thunlp/SCPapers
Must-read Papers on Sememe Computation
Size: 1.12 MB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 191 - Forks: 37
thunlp/attribute_charge
The source code of our COLING'18 paper "Few-Shot Charge Prediction with Discriminative Legal Attributes".
Language: Python - Size: 16.6 KB - Last synced: about 2 months ago - Pushed: over 5 years ago - Stars: 123 - Forks: 29
thunlp/sememe_prediction
Codes for Lexical Sememe Prediction via Word Embeddings and Matrix Factorization (IJCAI 2017).
Language: Python - Size: 39.4 MB - Last synced: 25 days ago - Pushed: over 4 years ago - Stars: 60 - Forks: 25
thunlp/THUCTC
An Efficient Chinese Text Classifier
Language: Java - Size: 1.67 MB - Last synced: 2 months ago - Pushed: over 5 years ago - Stars: 196 - Forks: 67
thunlp/AutoForm
Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"
Language: Python - Size: 4.73 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
thunlp/LegalPapers
Must-read Papers on Legal Intelligence
Size: 26.4 KB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 443 - Forks: 60
thunlp/Ouroboros
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting
Language: Python - Size: 101 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 10 - Forks: 4
thunlp/COS960
COS960: A Chinese Word Similarity Dataset of 960 Word Pairs
Language: Python - Size: 20.5 KB - Last synced: 25 days ago - Pushed: almost 5 years ago - Stars: 36 - Forks: 2
thunlp/NREPapers
Must-read papers on neural relation extraction (NRE)
Language: TeX - Size: 58.6 KB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 1,017 - Forks: 154
thunlp/WantWords
An open-source online reverse dictionary.
Language: JavaScript - Size: 14.5 MB - Last synced: 3 months ago - Pushed: about 2 years ago - Stars: 6,912 - Forks: 620
thunlp/KB2E
Knowledge Graph Embeddings including TransE, TransH, TransR and PTransE
Language: C++ - Size: 63.6 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 1,373 - Forks: 456
thunlp/TransNet
Source code and datasets of IJCAI2017 paper "TransNet: Translation-Based Network Representation Learning for Social Relation Extraction".
Language: Jupyter Notebook - Size: 153 MB - Last synced: about 2 months ago - Pushed: about 6 years ago - Stars: 103 - Forks: 38
thunlp/NSC
Neural Sentiment Classification
Language: Python - Size: 40 KB - Last synced: 3 months ago - Pushed: about 6 years ago - Stars: 287 - Forks: 96
thunlp/RCPapers
Must-read papers on Machine Reading Comprehension
Size: 16.6 KB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 894 - Forks: 172
thunlp/FalseQA
Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"
Language: Python - Size: 630 KB - Last synced: about 2 months ago - Pushed: 11 months ago - Stars: 19 - Forks: 0
thunlp/Muffin
Language: Python - Size: 5.93 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 29 - Forks: 2
thunlp/NRE
Neural Relation Extraction, including CNN, PCNN, CNN+ATT, PCNN+ATT
Language: C++ - Size: 234 MB - Last synced: 4 months ago - Pushed: almost 4 years ago - Stars: 813 - Forks: 317
thunlp/Sememe-SC
Source code and data for ACL 2019 paper "Modeling Semantic Compositionality with Sememe Knowledge"
Language: Python - Size: 55.9 MB - Last synced: 25 days ago - Pushed: almost 4 years ago - Stars: 35 - Forks: 7
thunlp/UnifiedInstructionTuning
Language: Python - Size: 1.09 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 2 - Forks: 0
thunlp/CPT
Colorful Prompt Tuning for Pre-trained Vision-Language Models
Language: Python - Size: 19.2 MB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 43 - Forks: 1
thunlp/VisualDS
Language: Python - Size: 9.91 MB - Last synced: 4 months ago - Pushed: about 2 years ago - Stars: 25 - Forks: 3
thunlp/OpenQA
The source code of ACL 2018 paper "Denoising Distantly Supervised Open-Domain Question Answering".
Language: Python - Size: 71.3 KB - Last synced: 4 days ago - Pushed: over 5 years ago - Stars: 206 - Forks: 49
thunlp/Moderate-fitting
Language: Python - Size: 8.85 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 3 - Forks: 0
thunlp/IKRL
Image-embodied Knowledge Representation Learning (IJCAI-2017)
Language: C++ - Size: 346 KB - Last synced: 4 months ago - Pushed: over 2 years ago - Stars: 43 - Forks: 10
thunlp/TKRL
Representation Learning of Knowledge Graphs with Hierarchical Types (IJCAI-2016)
Language: C++ - Size: 17.5 MB - Last synced: 4 months ago - Pushed: about 5 years ago - Stars: 79 - Forks: 27
thunlp/RECIPE
Language: Python - Size: 2.48 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
thunlp/TLNN
Source code for EMNLP-IJCNLP 2019 paper "Event Detection with Trigger-Aware Lattice Neural Network".
Language: Python - Size: 73.2 KB - Last synced: 5 months ago - Pushed: over 4 years ago - Stars: 75 - Forks: 17
thunlp/CKRL
Does William Shakespeare REALLY Write Hamlet? Knowledge Representation Learning with Confidence (AAAI-2018)
Language: C++ - Size: 12.2 MB - Last synced: 4 months ago - Pushed: about 5 years ago - Stars: 44 - Forks: 11
thunlp/WebCPM 📦
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
Language: HTML - Size: 4.01 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 909 - Forks: 77
thunlp/Chinese_NRE
Source code for ACL 2019 paper "Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge"
Language: Python - Size: 1.5 MB - Last synced: 5 months ago - Pushed: about 4 years ago - Stars: 263 - Forks: 42
thunlp/DeepTHULAC
A High-Performance Lexical Analyzer for Chinese
Language: Python - Size: 139 KB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 18 - Forks: 1
thunlp/Modularity-Analysis
Repo for ACL2023 Findings paper "Emergent Modularity in Pre-trained Transformers"
Language: Python - Size: 1.29 MB - Last synced: 6 months ago - Pushed: 11 months ago - Stars: 13 - Forks: 0
thunlp/OOP-THU
OOP Course Material & QA
Size: 19.7 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 141 - Forks: 18
thunlp/LLM-generated-text-detection
Language: Python - Size: 6.49 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 4 - Forks: 0
thunlp/PL-Marker
Source code for "Packed Levitated Marker for Entity and Relation Extraction"
Language: Python - Size: 1.96 MB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 229 - Forks: 36
thunlp/KernelGAT
The source codes for Fine-grained Fact Verification with Kernel Graph Attention Network.
Language: Python - Size: 2.26 MB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 166 - Forks: 36
thunlp/LegalPLMs
Source code and checkpoints for legal pre-trained language models.
Language: Python - Size: 30.3 KB - Last synced: 6 months ago - Pushed: about 3 years ago - Stars: 147 - Forks: 22
thunlp/NLP-THU
NLP Course Material & QA
Size: 69.3 KB - Last synced: 6 months ago - Pushed: about 2 years ago - Stars: 158 - Forks: 22
thunlp/DocRED
Dataset and codes for ACL 2019 DocRED: A Large-Scale Document-Level Relation Extraction Dataset.
Language: Python - Size: 55.7 KB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 572 - Forks: 107
thunlp/CAIL2018
Language: Python - Size: 41 KB - Last synced: 6 months ago - Pushed: almost 6 years ago - Stars: 109 - Forks: 45
thunlp/SOS4NLP
Survey of Surveys for Natural Language Processing (SOS4NLP)
Size: 162 KB - Last synced: 6 months ago - Pushed: almost 3 years ago - Stars: 327 - Forks: 40
thunlp/HMEAE
Source code for EMNLP-IJCNLP 2019 paper "HMEAE: Hierarchical Modular Event Argument Extraction".
Language: Python - Size: 44.9 KB - Last synced: 7 months ago - Pushed: about 2 years ago - Stars: 84 - Forks: 21
thunlp/MetaAdaptRank
Code and data of the ACL 2021 paper: Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision
Language: Python - Size: 55.3 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 7 - Forks: 2
thunlp/PathNRE
Source code and dataset of EMNLP2017 paper "Incorporating Relation Paths in Neural Relation Extraction".
Language: C++ - Size: 245 KB - Last synced: 7 months ago - Pushed: over 4 years ago - Stars: 41 - Forks: 11
thunlp/SelectiveMasking
Source code for "Train No Evil: Selective Masking for Task-Guided Pre-Training"
Language: Python - Size: 2.01 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 67 - Forks: 15
thunlp/MNRE
The code and data for ACL2017 paper "Neural Relation Extraction with Multi-lingual Attention"
Language: C++ - Size: 20.5 KB - Last synced: 7 months ago - Pushed: about 7 years ago - Stars: 45 - Forks: 17
thunlp/TensorFlow-Summarization
Language: Python - Size: 851 KB - Last synced: 6 months ago - Pushed: over 6 years ago - Stars: 391 - Forks: 112