10 h ago को अपडेट किया गया

Organization

THUNLP का सार्वजनिक GitHub फुटप्रिंट

@thunlp

GitHub पर प्रोफ़ाइल देखें

Natural Language Processing Lab at Tsinghua University

FIT Building, Tsinghua University, Beijing

269

सार्वजनिक रिपोजिटरी

83,916

कुल सितारे

3,402

अनुयायी

THUNLP, Tsinghua University का Natural Language Processing Lab है, जिसका सार्वजनिक GitHub प्रोफ़ाइल एक व्यापक संग्रह प्रस्तुत करता है। इसमें Python, C++, TeX, Java, JavaScript और HTML जैसी प्रमुख भाषाओं का उपयोग किया गया है। THUNLP के शीर्ष प्रोजेक्ट्स में GNNPapers, WantWords, और OpenPrompt शामिल हैं, जो उनके अनुसंधान और विकास प्रयासों को दर्शाते हैं।

शीर्ष भाषाएँ

Python 74C++ 4TeX 3Java 2JavaScript 1HTML 1C 1TypeScript 1

सार्वजनिक रिपोजिटरी

GNNPapers

★16,792

Must-read papers on graph neural networks (GNN)

अज्ञात भाषा

अपडेट किया गया 13 जून 2026

WantWords

★7,109

An open-source online reverse dictionary.

JavaScript

अपडेट किया गया 12 जून 2026

OpenPrompt

★4,877

An Open-Source Framework for Prompt-Learning.

Python

अपडेट किया गया 11 जून 2026

OpenNRE

★4,466

An Open-Source Package for Neural Relation Extraction (NRE)

Python

अपडेट किया गया 10 जून 2026

PromptPapers

★4,315

Must-read papers on prompt-based tuning for pre-trained language models.

अज्ञात भाषा

अपडेट किया गया 7 जून 2026

OpenKE

★4,040

An Open-Source Package for Knowledge Embedding (KE)

Python

अपडेट किया गया 11 जून 2026

PLMpapers

★3,362

Must-read Papers on pre-trained language models.

अज्ञात भाषा

अपडेट किया गया 8 जून 2026

UltraChat

★2,864

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python

अपडेट किया गया 13 जून 2026

NRLPapers

★2,517

Must-read papers on network representation learning (NRL) / network embedding (NE)

TeX

अपडेट किया गया 10 जून 2026

THULAC-Python

★2,087

An Efficient Lexical Analyzer for Chinese

Python

अपडेट किया गया 9 जून 2026

OpenNE

★1,705

An Open-Source Package for Network Embedding (NE)

Python

अपडेट किया गया 26 मई 2026

TAADpapers

★1,574

Must-read Papers on Textual Adversarial Attack and Defense

Python

अपडेट किया गया 20 मई 2026

KRLPapers

★1,525

Must-read papers on knowledge representation learning (KRL) / knowledge embedding (KE)

TeX

अपडेट किया गया 21 मई 2026

KB2E

★1,423

Knowledge Graph Embeddings including TransE, TransH, TransR and PTransE

C++

अपडेट किया गया 30 मई 2026

ERNIE

★1,420

Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"

Python

अपडेट किया गया 26 मई 2026

THUOCL

★1,079

THUOCL（THU Open Chinese Lexicon）中文词库

अज्ञात भाषा

अपडेट किया गया 13 जून 2026

OpenDelta

★1,045

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python

अपडेट किया गया 26 मई 2026

NREPapers

★1,030

Must-read papers on neural relation extraction (NRE)

TeX

अपडेट किया गया 2 जून 2026

OpenCLaP

★984

Open Chinese Language Pre-trained Model Zoo

अज्ञात भाषा

अपडेट किया गया 8 मई 2026

ToolLearningPapers

★922

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

अज्ञात भाषा

अपडेट किया गया 2 जून 2026

WebCPM

★911

Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"

HTML

अपडेट किया गया 31 मई 2026

RCPapers

★889

Must-read papers on Machine Reading Comprehension

अज्ञात भाषा

अपडेट किया गया 26 मई 2026

LLMxMapReduce

★875

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 9 जून 2026

THULAC

★832

An Efficient Lexical Analyzer for Chinese

C++

अपडेट किया गया 1 जून 2026

Chinese_Rumor_Dataset

★782

中文谣言数据

अज्ञात भाषा

अपडेट किया गया 1 जून 2026

OpenAttack

★777

An Open-Source Package for Textual Adversarial Attack.

Python

अपडेट किया गया 8 जून 2026

FewRel

★746

A Large-Scale Few-Shot Relation Extraction Dataset

Python

अपडेट किया गया 26 मई 2026

OPD

★654

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Python

अपडेट किया गया 13 जून 2026

DocRED

★652

Dataset and codes for ACL 2019 DocRED: A Large-Scale Document-Level Relation Extraction Dataset.

Python

अपडेट किया गया 2 जून 2026

OpenHowNet

★637

Core Data of HowNet and OpenHowNet Python API

Python

अपडेट किया गया 29 मई 2026

ProactiveAgent

★609

A LLM-based Agent that predict its tasks proactively.

Python

अपडेट किया गया 12 जून 2026

TensorFlow-TransX

★513

An implementation of TransE and its extended models for Knowledge Representation Learning on TensorFlow

Python

अपडेट किया गया 6 मई 2026

CAIL

★509

Chinese AI & Law Challenge

अज्ञात भाषा

अपडेट किया गया 9 जून 2026

LegalPapers

★498

Must-read Papers on Legal Intelligence

अज्ञात भाषा

अपडेट किया गया 25 मई 2026

BERT-KPE

★447

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 24 मई 2026

OpenMatch

★442

An Open-Source Package for Information Retrieval.

Python

अपडेट किया गया 24 मई 2026

LLaVA-UHD

★424

LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs

Python

अपडेट किया गया 11 जून 2026

Fast-TransX

★405

An Efficient implementation of TransE and its extended models for Knowledge Representation Learning

C++

अपडेट किया गया 6 जून 2026

InfLLM

★404

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python

अपडेट किया गया 9 जून 2026

Few-NERD

★400

Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"

Python

अपडेट किया गया 26 मई 2026

TensorFlow-Summarization

★386

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 12 जून 2026

BMCourse

★371

The repo for Tsinghua summer course: Interdisciplinary Seminar on Big Models

Python

अपडेट किया गया 20 मई 2026

LEGENT

★341

Open Platform for Embodied Agents

Python

अपडेट किया गया 2 जून 2026

THULAC-Java

★339

An Efficient Lexical Analyzer for Chinese

Java

अपडेट किया गया 6 जून 2026

ChatEval

★335

Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"

Python

अपडेट किया गया 5 जून 2026

NSC

★287

Neural Sentiment Classification

Python

अपडेट किया गया 8 मई 2026

DeltaPapers

★284

Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.

अज्ञात भाषा

अपडेट किया गया 9 जून 2026

JustRL

★276

[ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Python

अपडेट किया गया 11 जून 2026

PL-Marker

★272

Source code for "Packed Levitated Marker for Entity and Relation Extraction"

Python

अपडेट किया गया 10 जून 2026

OpenBackdoor

★209

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

Python

अपडेट किया गया 18 मई 2026

SE-WRL

★196

Improved Word Representation Learning with Sememes

अपडेट किया गया 8 मई 2026

LegalPLMs

★194

Source code and checkpoints for legal pre-trained language models.

Python

अपडेट किया गया 5 जून 2026

Auto_CLIWC

★168

Code for Chinese LIWC Lexicon Expansion via Hierarchical Classification of Word Embeddings with Sememe Attention (AAAI18)

Python

अपडेट किया गया 26 मई 2026

DeepNote

★134

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 7 जून 2026

TritonBench

★133

TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators

Python

अपडेट किया गया 8 जून 2026

attribute_charge

★132

The source code of our COLING'18 paper "Few-Shot Charge Prediction with Discriminative Legal Attributes".

Python

अपडेट किया गया 1 मई 2026

LEVEN

★123

Source code and dataset for ACL2022 Findings Paper "LEVEN: A Large-Scale Chinese Legal Event Detection dataset"

Python

अपडेट किया गया 12 जून 2026

Ouroboros

★117

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)

Python

अपडेट किया गया 26 मई 2026

MatPlotAgent

★115

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 8 जून 2026

MultiRD

★110

Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"

Python

अपडेट किया गया 6 मई 2026

GEAR

★100

Source code for ACL 2019 paper "GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification"

Python

अपडेट किया गया 24 मई 2026

TopJudge

★100

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 30 अप्रैल 2026

Prompt-Transferability

★99

On Transferability of Prompt Tuning for Natural Language Processing

Python

अपडेट किया गया 26 मई 2026

KV-PLM

★89

Source code for "A Deep-learning System Bridging Molecule Structure and Biomedical Text with Comprehension Comparable to Human Professionals"

Python

अपडेट किया गया 3 जून 2026

DebugBench

★86

The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".

Python

अपडेट किया गया 21 मई 2026

ChartCoder

★79

[ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

Python

अपडेट किया गया 28 अप्रैल 2026

Advbench

★77

Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP".

Python

अपडेट किया गया 6 मई 2026

NeuIRPapers

★74

Must-read Papers on Neural Information Retrieval

अज्ञात भाषा

अपडेट किया गया 29 मई 2026

MMDW

★73

Max-margin DeepWalk

Java

अपडेट किया गया 6 मई 2026

Optima

★72

Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"

Python

अपडेट किया गया 8 मई 2026

KARL

★68

KARL: Knowledge-Aware Reasoning and Reinforcement Learning for Knowledge-Intensive Visual Grounding

Python

अपडेट किया गया 14 मई 2026

CorefBERT

★67

Source code for EMNLP 2020 paper "Coreferential Reasoning Learning for Language Representation"

Python

अपडेट किया गया 24 मई 2026

H-Neurons

★66

The official implementation of the paper: H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs

Python

अपडेट किया गया 9 जून 2026

Adaptive-Note

★60

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 7 जून 2026

Delta-CoMe

★59

Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024

Python

अपडेट किया गया 2 मई 2026

EmbodiedEval

★58

Evaluate Multimodal LLMs as Embodied Agents

Python

अपडेट किया गया 11 जून 2026

FR-Spec

★55

[ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling

C++

अपडेट किया गया 29 मई 2026

duplex-model

★46

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

TypeScript

अपडेट किया गया 2 जून 2026

HiddenKiller

★45

Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"

Python

अपडेट किया गया 7 मई 2026

SubCharTokenization

★45

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 30 अप्रैल 2026

VERNet

★42

Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction

Python

अपडेट किया गया 29 मई 2026

EmbodiedAIxLLMPapers

★38

Papers on integrating large language models with embodied AI

अज्ञात भाषा

अपडेट किया गया 2 मई 2026

Seq1F1B

★37

Sequence-level 1F1B schedule for LLMs.

Python

अपडेट किया गया 23 अप्रैल 2026

hybrid-linear-attention

★36

Code and models for the paper: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Python

अपडेट किया गया 20 मई 2026

SparsingLaw

★32

The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".

Python

अपडेट किया गया 9 जून 2026

explore-and-evaluate

★31

Code for EMNLP2020 paper "Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment".

Python

अपडेट किया गया 3 मई 2026

CokeBERT

★30

CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models

Python

अपडेट किया गया 24 मई 2026

Model_Emotion

★27

Neuron Activation

Python

अपडेट किया गया 1 मई 2026

LoRAFlow

★25

ACL 2024: LoRA-Flow Dynamic LoRA Fusion for Large Language Models in Generative Tasks

Python

अपडेट किया गया 12 मई 2026

VisualDS

★24

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 28 अप्रैल 2026

KG-Infused-RAG

★23

Official implementation for the paper "KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs"

Python

अपडेट किया गया 1 जून 2026

SchemaReinforcementLearning

★23

Learning to Generate STRUCTURED Output with Schema Reinforcement Learning

Python

अपडेट किया गया 28 अप्रैल 2026

NOSA

★17

The official implementation of NOSA

Python

अपडेट किया गया 11 जून 2026

hyperbolic_llm

★13

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 9 जून 2026

ClueAnchor

★12

[EMNLP 2025 Findings] ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation

Python

अपडेट किया गया 6 जून 2026

Chujian

★12

A large-scale dataset of Chu bamboo slip scripts and a multi-granularity tokenizer for ancient Chinese scripts

Python

अपडेट किया गया 27 मई 2026

SMP

★8

Single-Shot Meta-Pruning (SMP) for attention heads of Transformers

Python

अपडेट किया गया 29 अप्रैल 2026

DECO

★2

Source code for paper "DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices".

Python

अपडेट किया गया 23 मई 2026

CPMobius

★1

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 14 मई 2026

LexRel

★1

इस रिपोजिटरी के लिए कोई विवरण प्रदान नहीं किया गया।

Python

अपडेट किया गया 7 मई 2026

अक्सर पूछे जाने वाले प्रश्न

thunlp GitHub पर क्या बनाता है?

thunlp GitHub पर प्राकृतिक भाषा प्रसंस्करण से संबंधित कई परियोजनाएँ विकसित करता है, जिनमें GNNPapers और OpenPrompt जैसे प्रमुख प्रोजेक्ट शामिल हैं। ये प्रोजेक्ट अनुसंधान और ओपन-सोर्स विकास के लिए महत्वपूर्ण हैं।

thunlp कौन सी प्रोग्रामिंग भाषाएँ उपयोग करता है?

thunlp के प्रोजेक्ट्स में मुख्य रूप से Python, C++, TeX, Java, JavaScript और HTML जैसी प्रोग्रामिंग भाषाएँ शामिल हैं। ये भाषाएँ उनके अनुसंधान कार्य और टूल्स के विकास में महत्वपूर्ण भूमिका निभाती हैं।

क्या thunlp के रिपॉजिटरी सार्वजनिक हैं?

हाँ, thunlp के सभी रिपॉजिटरी सार्वजनिक हैं। ये रिपॉजिटरी ओपन-सोर्स हैं और किसी भी व्यक्ति द्वारा उपयोग और योगदान के लिए उपलब्ध हैं, जिससे ज्ञान और नवाचार को बढ़ावा मिलता है।

क्या यह एक्सपोजर इरादा है?

RepoGuard के साथ THUNLP की निगरानी करें और जैसे ही एक नया सार्वजनिक रिपोजिटरी बनता है, सूचित हों।

इस खाते की निगरानी करें