An open API service providing repository metadata for many open source software ecosystems.

GitHub / Alibaba-NLP 15 Repositories

Alibaba-NLP/WebAgent

🌐 WebAgent for Information Seeking built by Tongyi Lab: WebWalker & WebDancer & WebSailor & WebShaper https://arxiv.org/pdf/2507.02592

Language: Python - Size: 121 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 4,628 - Forks: 346

Alibaba-NLP/ZeroSearch

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Language: Python - Size: 4.98 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1,051 - Forks: 99

Alibaba-NLP/ViDoRAG

ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

Language: Python - Size: 15.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 493 - Forks: 36

Alibaba-NLP/VRAG

Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"

Language: Python - Size: 28.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 197 - Forks: 10

Alibaba-NLP/LaRA

The code for LaRA Benchmark

Language: Python - Size: 23.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 33 - Forks: 1

Alibaba-NLP/MaskSearch

Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"

Language: Python - Size: 1.25 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 16 - Forks: 0

Alibaba-NLP/CDQA

CDQA: Chinese Dynamic Question Answering Benchmark

Language: Python - Size: 4.08 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 15 - Forks: 0

Alibaba-NLP/OmniSearch

Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Language: Python - Size: 13.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 302 - Forks: 21

Alibaba-NLP/CoFE-RAG

Language: Python - Size: 2.24 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 33 - Forks: 2

Alibaba-NLP/SeqGPT

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

Language: Python - Size: 715 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 223 - Forks: 11

Alibaba-NLP/EcomGPT

An Instruction-tuned Large Language Model for E-commerce

Language: Python - Size: 4.89 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 241 - Forks: 14

Alibaba-NLP/ACE

[ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction

Language: Python - Size: 1.68 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 306 - Forks: 46

Alibaba-NLP/CHRONOS

Repo for Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"

Language: Python - Size: 15.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 118 - Forks: 10

Alibaba-NLP/MultilangStructureKD

[ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling

Language: Python - Size: 1.16 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 71 - Forks: 9

Alibaba-NLP/Vec-RA-ODQA

Source code of paper Improving "Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts

Language: Python - Size: 13 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Alibaba-NLP/RankingGPT

code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》

Language: Python - Size: 10 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 2

Alibaba-NLP/IBKD

This is the official repository for the IBKD knowledge distillation method, as described in the paper .

Language: Python - Size: 92.8 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 2

Alibaba-NLP/Multi-CPR

[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

Language: Python - Size: 234 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 150 - Forks: 17

Alibaba-NLP/HLATR

Hybrid List Aware Transformer Reranking

Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 1

Alibaba-NLP/PoincareProbe Fork of FranxYao/PoincareProbe

Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces

Size: 5.94 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Alibaba-NLP/ProtoRE

Code for 'Prototypical Representation Learning for Relation Extraction'.

Language: Python - Size: 95.7 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 30 - Forks: 6

Alibaba-NLP/HiAGM

Hierarchy-Aware Global Model for Hierarchical Text Classification

Language: Python - Size: 259 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 191 - Forks: 42

Alibaba-NLP/KB-NER

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

Language: Python - Size: 1.73 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 158 - Forks: 17

Alibaba-NLP/AISHELL-NER

[ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech

Size: 6.54 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 1

Alibaba-NLP/MANNER

[ACL 2023] MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition

Language: Python - Size: 624 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

Alibaba-NLP/CLNER

[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning

Language: Python - Size: 1.76 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 77 - Forks: 13

Alibaba-NLP/StructuralKD

[ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor

Language: Python - Size: 48.3 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 1

Alibaba-NLP/Partially-Observed-TreeCRFs Fork of FranxYao/Partially-Observed-TreeCRFs

Implementation of AAAI 21 paper: Nested Named Entity Recognition with Partially Observed TreeCRFs

Size: 188 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Alibaba-NLP/MuVER

[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

Language: Python - Size: 24.4 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 26 - Forks: 0

Alibaba-NLP/EBM-Net

Codes for the EMNLP'2020 paper "Predicting Clinical Trial Results by Implicit Evidence Integration".

Language: Python - Size: 27.3 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 14 - Forks: 1

Alibaba-NLP/AIN

Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"

Language: Python - Size: 116 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 2

Alibaba-NLP/DAAT-CWS

Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation

Language: Python - Size: 24.8 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 23 - Forks: 6

Alibaba-NLP/Triaffine-nested-ner Fork of GanjinZero/Triaffine-nested-ner

[ACL 2022 Findings] Fusing Heterogeneous Factors with Triaffine Mechanism for Nested Named Entity Recognition

Size: 72.3 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Alibaba-NLP/ICD-MSMN Fork of GanjinZero/ICD-MSMN

[ACL 2022] Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding

Size: 1000 Bytes - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Alibaba-NLP/MarCo-Dialog

Language: Python - Size: 90 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

Alibaba-NLP/Alibaba-TREC-PM

Codes and data for Alibaba's winning systems at the TREC Precision Medicine Track 2020.

Size: 23.4 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Alibaba-NLP/Gumbel-CRF Fork of FranxYao/Gumbel-CRF

Implementation of NeurIPS 20 paper: Latent Template Induction with Gumbel-CRFs

Size: 22.9 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0