Top 10 LLMs you should know

6 min readMay 1, 2024

Welcome back, fellow geeks. Let’s check out what’s new in the field of AI this week.

What I have for you today:

Top 10 LLMs you should know, and how they’re used.
A curated list of free AI courses from Google and Amazon.
5 best performing ChatGPT prompting frameworks.
Weekly Pulse on freshest updates in AI world. Edition №9

Top 10 LLMs you should know, and how they’re used

An LLM, or large language model, is a general-purpose AI text generator. It’s what’s behind the scenes of all AI chatbots.

To produce correct and coherent output, LLMs are trained on huge corpuses of data.

There are dozens of LLMs, and you are most certainly lost in all these “gpts” and “lamas”.

Here’s the simple guide how to differentiate top 10 most significant, interesting, and popular LLMs, and where are they used.

1. GPT by OpenAI — 175B+ parameters, used by ChatGPT, Microsoft, Duolingo, Stripe, Zapier, Dropbox

2. Gemini by Google — parameter data is largely unknown, used by Google in some Android apps and Google AI portal, previously known as Bard

3. PaLM 2 by Google — 340B parameters, used in Google Docs, Gmail and other Google Suite apps

4. Llama by Meta — 7/8/13/70B parameters, used in Meta AI assistant, across Meta applications (Facebook, Instagram, Threads), open source

5. Claude by Anthropic — parameter data is unknown, used in Slack, Notion, Zoom

6. Grok by xAI — parameter data is unknown, used in X(Twitter), Grok Chatbot

7. Mixtral by Mistral AI — 39B active out of 141B parameters (SMoE), academic and commercial use, partnership with IBM, ChatGPT-like “Le Chat”, open source

8. Command, Rerank and Embed by Cohere — parameter data is unknown, used in Notion, Jasper, HyperWrite

9. StableLM by Stability AI — 3/7B parameters available, 15/30/65/175B in progress, part of StableDiffusion, open source

10. Vicuna by LMSYS Org — 7/13/33B parameters, used in LMSYS Chatbot Arena, open source

But why so many of them?

↳ OpenAI’s research and effort showed the practical side of LLMs, so others started doing the same.

↳ Many models could be retrained or adapted so companies could tailor those to their needs.

↳ Despite the difficulty to train an LLM, it becomes more affordable and takes less time (weeks or months).

↳ Investment landscape is plentiful in AI field.

Which other prominent LLMs are worth mentioning?

A curated list of free AI courses from Google and Amazon

Last week I wrote about Microsoft, today adding Google and Amazon to this list.

1. Google courses today (intro to advanced level, links in attached pdf):

Beginner level:

↳ Introduction to Large Language Models — 30 minutes

↳ Introduction to Generative AI — 45 minutes

↳ Introduction to Image Generation — 8 hours

↳ Transformer Models and BERT Model — 8 hours

Intermediate level:

↳ Attention Mechanism — 8 hours

↳ Encoder-Decoder Architecture — 8 hours

↳ Create Image Captioning Models — 8 hours

↳ Gemini for Google Cloud Learning Path — 8 activities, 45 minutes to 6 hours each

Advanced level:

↳ Generative AI for Developers Learning Path — 11 activities, 30 minutes to 8 hours each

2. Amazon courses (again, intro to advanced level, links in attached pdf):

Beginner level:

↳ Introduction to Machine Learning on AWS — 6 hours, beginner

↳ Generative AI Learning Plan for Decision Makers — 3 hours

↳ Amazon Transcribe Getting Started — 1.5 hours

Intermediate level:

↳ Low-Code Machine Learning on AWS — 4 hours

↳ Generative AI Learning Plan for Developers — 11 hours, Python-specific

↳ Machine Learning Learning Plan — 11 hours

↳ Foundations of Prompt Engineering — 4 hours

Advanced level:

↳ Building Language Models on AWS — 5.5 hours

Bookmark it!

Every week I’m going to add to this pack!

5 best performing ChatGPT prompting frameworks

Using ChatGPT correctly is simply about communicating well and managing your expectations. If you excel at both in the real world, you can manage AI tools to generate wealth in the 21st century.

Here are 5 of the most effective prompting schemes for ChatGPT:

1. R-T-F — Role, Task, Format

↳ “Act like a [insert the ROLE you want AI to take]. Give me a [insert TASK] in [insert FORMAT] format.”

2. T-A-G — Task, Action, Goal

↳ “I need to accomplish [insert the TASK description]. In the process, I will perform [insert ACTIONs or processes to be undertaken]. As a result, I must achieve [insert the desired GOAL].”

3. C-A-R-E — Context, Action, Result, Example

↳ “Here’s the background of the task: [insert CONTEXT information]. I want you to perform [insert ACTIONs for AI to undertake] and achieve [insert description of desired RESULT]. Here’s an example: [insert EXAMPLE clue as a concrete illustration].”

4. R-I-S-E — Role, Input, Steps, Expectation

↳ “Act as [insert ROLE AI should take]. Here’s what you need to know: [insert INPUT describing situation or resources available]. Perform these steps: [insert STEPS instructions], and give me a [insert EXPECTATION of the desired outcome].”

5. R-A-C-E — Role, Action, Context, Expectation

↳ “Act as a [insert ROLE AI should take]. I want you to perform [insert ACTIONs for AI to undertake]. Here’s the background of the task: [insert CONTEXT information], give me a [insert EXPECTATION of the desired outcome].”

There are dozens more, obviously. Which are working for you?

Weekly Pulse on freshest updates in AI world. Edition №9

Curious of latest AI? I read dozens of article, so you don’t have to.

In full on the latest advances in AI and its implications.

Edition #9: 8 compact models for the masses from Apple, Meta unlocks Metaverse’ open OS, Microsoft’s tiny Phi-3 model, Biotech Profluent firm edits human genome with AI, Meta’s smart Ray-Ban major upgrade, Llama 3 topples ChatGPT, and more.

8 compact models for the masses from Apple:

submits 8 new AI models to Hugging Face
models are called OpenELMs, openly available for use
smaller models, efficient for devices, not cloud
also better privacy — no cloud upload
unusual move for Apple’s typically closed software

Meta unlocks Metaverse’ open OS:

opening Meta Quest OS to 3rd-party hardware
introduces Meta Horizon OS for mixed reality
combines tracking, blending digital-physical
leverages Android AOSP for mobile foundations
includes app store, social features, companion app
aims to bridge virtual worlds across devices

Microsoft’s tiny Phi-3 model:

smaller, affordable
comes in Mini (3.8B), Small (7B), and Medium (14B)
Mini 3.8B beats Llama 3’s 7B model on benchmarks
very compact: some users could run it on phones
smaller size usually means lower costs for users
good option for budget-conscious companies/users

Biotech firm Profluent edits human genome with AI:

GenAI aiding deep scientific research
reveals open-source AI-generated gene editor
enables precise editing of human genome
aims to create cures for currently untreatable diseases
technology available to license for ethical research
major step forward in using AI for scientific research

Meta’s smart Ray-Ban major upgrade:

smart glasses collection expanded
adding video calling with WhatsApp and Messenger
Meta AI with Vision for hands-free info
users can ask glasses about what they’re seeing
providing information without using hands

Llama 3 topples ChatGPT, experts claim:

Wharton prof says 8B Llama 3 beats free ChatGPT-3.5
Llama 3’s 70B model ranks 2nd only to GPT-4 Turbo on Lmsys
Llama 3 models are open-source, cheaper to run

One-liners:

Nvidia is acquiring the Israeli startup Run:ai, which built a Kubernetes-based GPU orchestrator, for approx. $700 million, highlighting the growing importance of Kubernetes in the generative AI era.
Inspired by AI-powered coding tools, former Microsoft engineer Igor Ostrovsky built Augment, an AI-powered coding platform that emerged from stealth with $252 million in funding from incl. former Google CEO Eric Schmidt.
The Rabbit R1, a new AI-powered mobile device from Humane, is impressing reviewers despite a rocky start for the company’s previous AI gadget, though it still has some limitations.
Elon Musk’s xAI, a dreaded competitor to OpenAI, is reportedly close to raising $6bn in funding.

Hey there!

With 15 years of experience in software development, system architecture, and team leadership, I’m passionate now about helping business leaders realise the power of AI.

I usually write for my LinkedIn. Follow me there for actionable guidance on AI, leadership and strategy!

Cheers!

Top 10 LLMs you should know

Top 10 LLMs you should know, and how they’re used

A curated list of free AI courses from Google and Amazon

5 best performing ChatGPT prompting frameworks

Written by Evgeny Krapivin – In full about AI

No responses yet