Aggiornato 2 h ago

Organization

Impronta pubblica su GitHub di Tongyi Lab, Alibaba Group

@Alibaba-NLP

Visualizza profilo su GitHub

Our team at Tongyi Lab is dedicated to pioneer advancements in AI search technologies.

China

Repository pubblici

25.454

Stelle totali

1658

Follower

L'organizzazione Alibaba-NLP, parte del Tongyi Lab di Alibaba Group, ha una presenza significativa su GitHub con una vasta gamma di repository pubblici. Tra i progetti più noti ci sono DeepResearch e ZeroSearch, entrambi sviluppati in Python, che si concentrano su tecnologie di ricerca avanzate e sull'ottimizzazione delle capacità di ricerca degli LLM.

Lingue principali

Python 34

Repository pubblici

DeepResearch

★19.381

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python

Aggiornato 13 giu 2026

ZeroSearch

★1292

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python

Aggiornato 13 giu 2026

VRAG

★947

Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.

Python

Aggiornato 12 giu 2026

ViDoRAG

★664

[EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

Python

Aggiornato 11 giu 2026

OmniSearch

★430

Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Python

Aggiornato 11 giu 2026

ACE

★313

[ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction

Python

Aggiornato 1 giu 2026

CHRONOS

★300

Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"

Python

Aggiornato 12 giu 2026

EcomGPT

★275

An Instruction-tuned Large Language Model for E-commerce

Python

Aggiornato 12 giu 2026

qqr

★254

qqr is an RL training framework for open-ended agents.

Python

Aggiornato 10 giu 2026

HiAGM

★230

Hierarchy-Aware Global Model for Hierarchical Text Classification

Python

Aggiornato 1 giu 2026

SeqGPT

★227

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

Python

Aggiornato 1 giu 2026

Multi-CPR

★206

[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

Python

Aggiornato 1 giu 2026

KB-NER

★186

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

Python

Aggiornato 22 mag 2026

MaskSearch

★155

Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"

Python

Aggiornato 6 giu 2026

CLNER

★93

[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning

Python

Aggiornato 19 mag 2026

MultilangStructureKD

★74

[ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling

Python

Aggiornato 1 giu 2026

E2Rank

★57

E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

Python

Aggiornato 10 giu 2026

LaRA

★51

The code for LaRA Benchmark

Python

Aggiornato 8 giu 2026

CoFE-RAG

★45

Nessuna descrizione fornita per questo repository.

Python

Aggiornato 7 giu 2026

RankingGPT

★35

code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》

Python

Aggiornato 9 apr 2026

ProtoRE

★32

Code for 'Prototypical Representation Learning for Relation Extraction'.

Python

Aggiornato 1 giu 2026

MuVER

★32

[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

Python

Aggiornato 9 apr 2026

AISHELL-NER

★25

[ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech

Lingua sconosciuta

Aggiornato 4 gen 2026

DAAT-CWS

★23

Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation

Python

Aggiornato 1 giu 2026

MANNER

★20

[ACL 2023] MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition

Python

Aggiornato 1 giu 2026

HLATR

★20

Hybrid List Aware Transformer Reranking

Lingua sconosciuta

Aggiornato 9 apr 2026

AIN

★20

Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"

Python

Aggiornato 9 apr 2026

CDQA

★18

CDQA: Chinese Dynamic Question Answering Benchmark

Python

Aggiornato 9 apr 2026

EBM-Net

★14

Codes for the EMNLP'2020 paper "Predicting Clinical Trial Results by Implicit Evidence Integration".

Python

Aggiornato 27 nov 2024

StructuralKD

★11

[ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor

Python

Aggiornato 1 giu 2026

WebDetective

★7

A new evaluation paradigm for deep search that identifies specific LLM failure sources, introduces challenging hint-free datasets with holistic evaluation, and offers a strong baseline incorporating memory and verification.

Python

Aggiornato 1 giu 2026

Vec-RA-ODQA

★6

Source code of paper Improving "Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts

Python

Aggiornato 1 giu 2026

IBKD

★3

This is the official repository for the IBKD knowledge distillation method, as described in the paper .

Python

Aggiornato 1 giu 2026

MarCo-Dialog

★3

Nessuna descrizione fornita per questo repository.

Python

Aggiornato 17 mar 2022

VLLM-KB

★2

[EMNLP 2025] Code for "Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference"

Python

Aggiornato 9 apr 2026

Key-Point-Analysis

★1

Nessuna descrizione fornita per questo repository.

Python

Aggiornato 29 ago 2024

Gumbel-CRF

★1

Implementation of NeurIPS 20 paper: Latent Template Induction with Gumbel-CRFs

Lingua sconosciuta

Aggiornato 24 mar 2024

Partially-Observed-TreeCRFs

★1

Implementation of AAAI 21 paper: Nested Named Entity Recognition with Partially Observed TreeCRFs

Lingua sconosciuta

Aggiornato 28 feb 2023

hilichurl

★0

Nessuna descrizione fornita per questo repository.

Lingua sconosciuta

Aggiornato 13 gen 2026

Triaffine-nested-ner

★0

[ACL 2022 Findings] Fusing Heterogeneous Factors with Triaffine Mechanism for Nested Named Entity Recognition

Lingua sconosciuta

Aggiornato 1 mag 2022

ICD-MSMN

★0

[ACL 2022] Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding

Lingua sconosciuta

Aggiornato 29 apr 2022

Alibaba-TREC-PM

★0

Codes and data for Alibaba's winning systems at the TREC Precision Medicine Track 2020.

Lingua sconosciuta

Aggiornato 28 ago 2021

PoincareProbe

★0

Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces

Lingua sconosciuta

Aggiornato 7 apr 2021

Domande frequenti

Cosa costruisce Alibaba-NLP su GitHub?

Alibaba-NLP sviluppa progetti focalizzati sull'intelligenza artificiale e la ricerca, come DeepResearch e ZeroSearch. Questi repository affrontano sfide avanzate nel campo della ricerca e delle capacità linguistiche dei modelli di apprendimento.

Quali linguaggi di programmazione utilizza Alibaba-NLP?

Alibaba-NLP utilizza principalmente Python per lo sviluppo dei suoi progetti su GitHub. Questo linguaggio è particolarmente adatto per le applicazioni di intelligenza artificiale e machine learning.

I repository di Alibaba-NLP sono pubblici?

Sì, tutti i repository di Alibaba-NLP sono pubblici. Questo consente a sviluppatori e ricercatori di accedere ai progetti e contribuire al progresso delle tecnologie di ricerca avanzate.

Questa esposizione è intenzionata?

Monitora Tongyi Lab, Alibaba Group con RepoGuard e ricevi un avviso nel momento in cui appare un nuovo repository pubblico.

Monitora questo account