Vicuna LLM

Vicuna LLM is an omnibus Large Language Model used in AI research.^[1] Its methodology is to enable the public at large to contrast and compare the accuracy of LLMs "in the wild" (an example of citizen science) and to vote on their output; a question-and-answer chat format is used. At the beginning of each round two LLM chatbots from a diverse pool of nine are presented randomly and anonymously, their identities only being revealed upon voting on their answers. The user has the option of either replaying ("regenerating") a round, or beginning an entirely fresh one with new LLMs.^[2] (The user also has the option of choosing which LLMs to do battle.) Based on Llama 2,^[3]^[4] it is an open source project,^[5]^[6] and it itself has become the subject of academic research in the burgeoning field.^[7]^[8] A non-commercial, public demo of the Vicuna-13b model is available to access using LMSYS.^[9]

References

^ "Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org". lmsys.org.
^ "Vicuna LLM Commercially Available, New v1.5 Update Improves Context Length".
^ "lmsys/vicuna-13b-v1.5 · Hugging Face". huggingface.co.
^ "The LLM Index: Vicuna | Sapling". sapling.ai.
^ "FastChat". October 29, 2023 – via GitHub.
^ "How to Train and Deploy Vicuna and FastChat LLMs | Width.ai". www.width.ai.
^ Peng, Baolin; Li, Chunyuan; He, Pengcheng; Galley, Michel; Gao, Jianfeng (2023). "Instruction Tuning with GPT-4". arXiv:2304.03277 [cs.CL].
^ Zheng, Lianmin; Chiang, Wei-Lin; Sheng, Ying; Zhuang, Siyuan; Wu, Zhanghao; Zhuang, Yonghao; Lin, Zi; Li, Zhuohan; Li, Dacheng; Xing, Eric P.; Zhang, Hao; Gonzalez, Joseph E.; Stoica, Ion (2023). "Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena". arXiv:2306.05685 [cs.CL].
^ "Vicuna - Open-Source AI Chatbot".

External links

[1] Test bed

[1] "Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org". lmsys.org.

[2] "Vicuna LLM Commercially Available, New v1.5 Update Improves Context Length".

[3] "lmsys/vicuna-13b-v1.5 · Hugging Face". huggingface.co.

[4] "The LLM Index: Vicuna | Sapling". sapling.ai.

[5] "FastChat". October 29, 2023 – via GitHub.

[6] "How to Train and Deploy Vicuna and FastChat LLMs | Width.ai". www.width.ai.

[7] Peng, Baolin; Li, Chunyuan; He, Pengcheng; Galley, Michel; Gao, Jianfeng (2023). "Instruction Tuning with GPT-4". arXiv:2304.03277 [cs.CL].

[8] Zheng, Lianmin; Chiang, Wei-Lin; Sheng, Ying; Zhuang, Siyuan; Wu, Zhanghao; Zhuang, Yonghao; Lin, Zi; Li, Zhuohan; Li, Dacheng; Xing, Eric P.; Zhang, Hao; Gonzalez, Joseph E.; Stoica, Ion (2023). "Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena". arXiv:2306.05685 [cs.CL].

[9] "Vicuna - Open-Source AI Chatbot".

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]