New ParagraWe summarized all relevant large language models in this blog post. If you want to learn what a LLM is read the following article about What is a Large Language Model.
If you want to see the latest benchmarks to evaluate the best LLMs have a look at the Hugging Face Benchmark or at sapling.ai. There is also the FLASK evaluation framework (Fine-grained Language Model Evaluation based on Alignment SKill Sets), a fine-grained evaluation protocol that can be used for both model-based and human-based evaluation).
Source: FLASK evaluation model
LLaMA 2 by Meta (Facebook) gained popularity because it is an open-source model that can be completely used for free also for commercial use cases. LLaMA-13B outperforms GPT-3 (175B) on most benchmarks, and LLaMA65B is competitive with the best models, Chinchilla-70B and PaLM-540B. This paper gives all details.
ChatGPT is the first Generative AI Chatbot presented by OpenAI to the market in November 2022, it is fine-tuned from either GPT-3.5 or GPT-4 Large Language Models using Reinforcement Learning from Human Feedback (RLHF). It allows us to chat in a conversational way, supporting many tasks like the answer to questions, writing summaries, debugging codes, generating texts and more.
Falcon first released in October 2021 was developed by a research center in Abu Dabi. It is optimized for performance and efficiency and focusses on high-quality data. "It outperforms GPT-3 for only 75% of the training compute budget—and requires a fifth of the compute at inference time."
Bard by Google was released on February 2023. It is based on the LLM PaLM 2 (May 2023) and originated from LaMDA Large Language Model. In contrast to ChatGPT the search data is from the web and in real-time meaning, it is not based on data until 2021. PaLM is multimodal, but only for it's domain specific model Med-PaLM 2.
h2oGPT is part of the platform H2O.ai supporting a variety of models: GPT 3.5 turbo, LLaMA 2, Falcon. You can use it either online or locally or with a UI.
Claude 2 by Anthropic works like ChatGPT, updates in real-time and is a upcoming competitor to ChatGPT and Bard.
Dolly was developed by Databricks and released in March 2023. It has a size of 12 billion parameters, based on EleutherAI’s Pythia model and fine-tuned on 15.000 record instruction corpus generated among Databricks employees.
Find a great comparison charts below from helpful sources.
Source: https://blog.zhaw.ch/artificial-intelligence/2023/04/20/deploy-your-own-open-source-language-model/
Source: towardsdatascience.com - https://towardsdatascience.com/choosing-the-right-language-model-for-your-nlp-use-case-1288ef3c4929
Source: Comparison of models can be easily done via sapling.ai
🚀 AI Strategy, business and tech support
🚀 ChatGPT, Generative AI & Conversational AI (Chatbot)
🚀 Support with AI product development
🚀 AI Tools and Automation
talk(at)voicetechhub.com
Etzbergstrasse 37, 8405 Winterthur
©VOOCE GmbH 2019 - 2025 - All rights reserved.
SWISS MADE. SWISS ENGINEERING.