Code and documentation to train Stanford's Alpaca models, and generate the data.
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
GPT4 based personalized ArXiv paper assistant bot
Nenhuma descrição fornecida para este repositório.
Nenhuma descrição fornecida para este repositório.
Nenhuma descrição fornecida para este repositório.
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"
Fast ImageNet training code with FFCV
Monitore Tatsu's shared repositories com o RepoGuard e receba alertas no momento em que um novo repositório público aparecer.
Monitore esta conta