GitHub / Alibaba-NLP 15 Repositories
Alibaba-NLP/WebAgent
đ WebAgent for Information Seeking built by Tongyi Lab: WebWalker & WebDancer & WebSailor & WebShaper https://arxiv.org/pdf/2507.02592
Language: Python - Size: 121 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 4,628 - Forks: 346

Alibaba-NLP/ZeroSearch
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Language: Python - Size: 4.98 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1,051 - Forks: 99

Alibaba-NLP/ViDoRAG
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents
Language: Python - Size: 15.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 493 - Forks: 36

Alibaba-NLP/VRAG
Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"
Language: Python - Size: 28.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 197 - Forks: 10

Alibaba-NLP/LaRA
The code for LaRA Benchmark
Language: Python - Size: 23.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 33 - Forks: 1

Alibaba-NLP/MaskSearch
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
Language: Python - Size: 1.25 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 16 - Forks: 0

Alibaba-NLP/CDQA
CDQA: Chinese Dynamic Question Answering Benchmark
Language: Python - Size: 4.08 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 15 - Forks: 0

Alibaba-NLP/OmniSearch
Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
Language: Python - Size: 13.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 302 - Forks: 21

Alibaba-NLP/CoFE-RAG
Language: Python - Size: 2.24 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 33 - Forks: 2

Alibaba-NLP/SeqGPT
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
Language: Python - Size: 715 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 223 - Forks: 11

Alibaba-NLP/EcomGPT
An Instruction-tuned Large Language Model for E-commerce
Language: Python - Size: 4.89 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 241 - Forks: 14

Alibaba-NLP/ACE
[ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction
Language: Python - Size: 1.68 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 306 - Forks: 46

Alibaba-NLP/CHRONOS
Repo for Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"
Language: Python - Size: 15.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 118 - Forks: 10

Alibaba-NLP/MultilangStructureKD
[ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling
Language: Python - Size: 1.16 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 71 - Forks: 9

Alibaba-NLP/Vec-RA-ODQA
Source code of paper Improving "Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
Language: Python - Size: 13 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Alibaba-NLP/RankingGPT
code for paper ăRankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancementă
Language: Python - Size: 10 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 2

Alibaba-NLP/IBKD
This is the official repository for the IBKD knowledge distillation method, as described in the paper .
Language: Python - Size: 92.8 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 2

Alibaba-NLP/Multi-CPR
[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
Language: Python - Size: 234 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 150 - Forks: 17

Alibaba-NLP/HLATR
Hybrid List Aware Transformer Reranking
Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 1

Alibaba-NLP/PoincareProbe Fork of FranxYao/PoincareProbe
Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces
Size: 5.94 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Alibaba-NLP/ProtoRE
Code for 'Prototypical Representation Learning for Relation Extraction'.
Language: Python - Size: 95.7 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 30 - Forks: 6

Alibaba-NLP/HiAGM
Hierarchy-Aware Global Model for Hierarchical Text Classification
Language: Python - Size: 259 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 191 - Forks: 42

Alibaba-NLP/KB-NER
Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.
Language: Python - Size: 1.73 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 158 - Forks: 17

Alibaba-NLP/AISHELL-NER
[ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech
Size: 6.54 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 1

Alibaba-NLP/MANNER
[ACL 2023] MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition
Language: Python - Size: 624 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

Alibaba-NLP/CLNER
[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
Language: Python - Size: 1.76 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 77 - Forks: 13

Alibaba-NLP/StructuralKD
[ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor
Language: Python - Size: 48.3 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 1

Alibaba-NLP/Partially-Observed-TreeCRFs Fork of FranxYao/Partially-Observed-TreeCRFs
Implementation of AAAI 21 paper: Nested Named Entity Recognition with Partially Observed TreeCRFs
Size: 188 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Alibaba-NLP/MuVER
[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations
Language: Python - Size: 24.4 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 26 - Forks: 0

Alibaba-NLP/EBM-Net
Codes for the EMNLP'2020 paper "Predicting Clinical Trial Results by Implicit Evidence Integration".
Language: Python - Size: 27.3 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 14 - Forks: 1

Alibaba-NLP/AIN
Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"
Language: Python - Size: 116 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 2

Alibaba-NLP/DAAT-CWS
Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation
Language: Python - Size: 24.8 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 23 - Forks: 6

Alibaba-NLP/Triaffine-nested-ner Fork of GanjinZero/Triaffine-nested-ner
[ACL 2022 Findings] Fusing Heterogeneous Factors with Triaffine Mechanism for Nested Named Entity Recognition
Size: 72.3 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Alibaba-NLP/ICD-MSMN Fork of GanjinZero/ICD-MSMN
[ACL 2022] Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding
Size: 1000 Bytes - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Alibaba-NLP/MarCo-Dialog
Language: Python - Size: 90 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

Alibaba-NLP/Alibaba-TREC-PM
Codes and data for Alibaba's winning systems at the TREC Precision Medicine Track 2020.
Size: 23.4 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Alibaba-NLP/Gumbel-CRF Fork of FranxYao/Gumbel-CRF
Implementation of NeurIPS 20 paper: Latent Template Induction with Gumbel-CRFs
Size: 22.9 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
