Atualizado 2 h ago

Organization

Pegada pública no GitHub de Tongyi Lab, Alibaba Group

@Alibaba-NLP

Ver perfil no GitHub

Our team at Tongyi Lab is dedicated to pioneer advancements in AI search technologies.

China

Repositórios públicos

25.454

Total de estrelas

1.658

Seguidores

A presença pública do Alibaba-NLP no GitHub é focada em tecnologias de busca em IA, com uma ampla gama de repositórios. Entre os projetos notáveis estão DeepResearch, ZeroSearch e VRAG, todos desenvolvidos em Python, refletindo o compromisso da equipe do Tongyi Lab, Alibaba Group, com a pesquisa em inteligência artificial.

Principais linguagens

Python 34

Repositórios públicos

DeepResearch

★19.381

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python

Atualizado 13 de jun. de 2026

ZeroSearch

★1.292

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python

Atualizado 13 de jun. de 2026

VRAG

★947

Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.

Python

Atualizado 12 de jun. de 2026

ViDoRAG

★664

[EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

Python

Atualizado 11 de jun. de 2026

OmniSearch

★430

Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Python

Atualizado 11 de jun. de 2026

ACE

★313

[ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction

Python

Atualizado 1 de jun. de 2026

CHRONOS

★300

Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"

Python

Atualizado 12 de jun. de 2026

EcomGPT

★275

An Instruction-tuned Large Language Model for E-commerce

Python

Atualizado 12 de jun. de 2026

qqr

★254

qqr is an RL training framework for open-ended agents.

Python

Atualizado 10 de jun. de 2026

HiAGM

★230

Hierarchy-Aware Global Model for Hierarchical Text Classification

Python

Atualizado 1 de jun. de 2026

SeqGPT

★227

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

Python

Atualizado 1 de jun. de 2026

Multi-CPR

★206

[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

Python

Atualizado 1 de jun. de 2026

KB-NER

★186

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

Python

Atualizado 22 de mai. de 2026

MaskSearch

★155

Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"

Python

Atualizado 6 de jun. de 2026

CLNER

★93

[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning

Python

Atualizado 19 de mai. de 2026

MultilangStructureKD

★74

[ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling

Python

Atualizado 1 de jun. de 2026

E2Rank

★57

E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

Python

Atualizado 10 de jun. de 2026

LaRA

★51

The code for LaRA Benchmark

Python

Atualizado 8 de jun. de 2026

CoFE-RAG

★45

Nenhuma descrição fornecida para este repositório.

Python

Atualizado 7 de jun. de 2026

RankingGPT

★35

code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》

Python

Atualizado 9 de abr. de 2026

ProtoRE

★32

Code for 'Prototypical Representation Learning for Relation Extraction'.

Python

Atualizado 1 de jun. de 2026

MuVER

★32

[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

Python

Atualizado 9 de abr. de 2026

AISHELL-NER

★25

[ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech

Linguagem Desconhecida

Atualizado 4 de jan. de 2026

DAAT-CWS

★23

Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation

Python

Atualizado 1 de jun. de 2026

MANNER

★20

[ACL 2023] MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition

Python

Atualizado 1 de jun. de 2026

HLATR

★20

Hybrid List Aware Transformer Reranking

Linguagem Desconhecida

Atualizado 9 de abr. de 2026

AIN

★20

Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"

Python

Atualizado 9 de abr. de 2026

CDQA

★18

CDQA: Chinese Dynamic Question Answering Benchmark

Python

Atualizado 9 de abr. de 2026

EBM-Net

★14

Codes for the EMNLP'2020 paper "Predicting Clinical Trial Results by Implicit Evidence Integration".

Python

Atualizado 27 de nov. de 2024

StructuralKD

★11

[ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor

Python

Atualizado 1 de jun. de 2026

WebDetective

★7

A new evaluation paradigm for deep search that identifies specific LLM failure sources, introduces challenging hint-free datasets with holistic evaluation, and offers a strong baseline incorporating memory and verification.

Python

Atualizado 1 de jun. de 2026

Vec-RA-ODQA

★6

Source code of paper Improving "Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts

Python

Atualizado 1 de jun. de 2026

IBKD

★3

This is the official repository for the IBKD knowledge distillation method, as described in the paper .

Python

Atualizado 1 de jun. de 2026

MarCo-Dialog

★3

Nenhuma descrição fornecida para este repositório.

Python

Atualizado 17 de mar. de 2022

VLLM-KB

★2

[EMNLP 2025] Code for "Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference"

Python

Atualizado 9 de abr. de 2026

Key-Point-Analysis

★1

Nenhuma descrição fornecida para este repositório.

Python

Atualizado 29 de ago. de 2024

Gumbel-CRF

★1

Implementation of NeurIPS 20 paper: Latent Template Induction with Gumbel-CRFs

Linguagem Desconhecida

Atualizado 24 de mar. de 2024

Partially-Observed-TreeCRFs

★1

Implementation of AAAI 21 paper: Nested Named Entity Recognition with Partially Observed TreeCRFs

Linguagem Desconhecida

Atualizado 28 de fev. de 2023

hilichurl

★0

Nenhuma descrição fornecida para este repositório.

Linguagem Desconhecida

Atualizado 13 de jan. de 2026

Triaffine-nested-ner

★0

[ACL 2022 Findings] Fusing Heterogeneous Factors with Triaffine Mechanism for Nested Named Entity Recognition

Linguagem Desconhecida

Atualizado 1 de mai. de 2022

ICD-MSMN

★0

[ACL 2022] Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding

Linguagem Desconhecida

Atualizado 29 de abr. de 2022

Alibaba-TREC-PM

★0

Codes and data for Alibaba's winning systems at the TREC Precision Medicine Track 2020.

Linguagem Desconhecida

Atualizado 28 de ago. de 2021

PoincareProbe

★0

Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces

Linguagem Desconhecida

Atualizado 7 de abr. de 2021

Perguntas frequentes

O que o Alibaba-NLP desenvolve no GitHub?

Alibaba-NLP desenvolve uma variedade de projetos relacionados a tecnologias de busca em inteligência artificial, incluindo repositórios como DeepResearch, ZeroSearch e VRAG, que são amplamente utilizados na comunidade de pesquisa.

Quais linguagens de programação o Alibaba-NLP utiliza?

O Alibaba-NLP utiliza principalmente Python em seus projetos. Essa escolha de linguagem permite a implementação de algoritmos complexos e a integração com bibliotecas de aprendizado de máquina.

Os repositórios do Alibaba-NLP são públicos?

Sim, todos os repositórios do Alibaba-NLP são públicos. Isso permite que outros desenvolvedores e pesquisadores acessem e contribuam para os projetos, promovendo a colaboração na área de inteligência artificial.

Essa exposição é intencional?

Monitore Tongyi Lab, Alibaba Group com o RepoGuard e receba alertas no momento em que um novo repositório público aparecer.

Monitore esta conta