Shortest Masterclass on Understanding AI: A Journey from Comparing Cars to Understanding Neural Networks

Rajya Vardhan Mishra

2 min readAug 11, 2023

If I ask you - “is Maruti Swift similar to Hyundai i20 or BMW X7”

Though answer is simple. But how you and your brain answer it?

You mentally compare features between Swift(our query) and i20. And you mentally derive a similarity score.

Then you compare the features of Swift and X7 and then you again come at a similarity score.

These features could be Price, Engine CC, Brand Value, etc.

Now i20 would have a very high similarity score (say 0.88) and X7 would have less(say 0.56), right?

Answer — “Swift is similar to i20”.

Congratulations! intuitively you understood some fundamental concepts of ML/AI:

Dimension: unique feature or attribute — like Price, Engine CC, etc.
Parameter: An algorithm understands only numbers. Period. So a corpus of text undergoes several layers of deep learning(DL) model like GPT. The DL models identify features and patterns -> based on which complex algos identify what biases and weights should be used in complex equations. These learned values of biases and weights are called parameters. In our car example, Our brain’s neural network would have done something complex like this.

GPT3 has 175 Billion parameters
GPT4 has freaking 1.7 trillion parameters (Yes Trillion with a T). This explains why GPT models are so GPU hungry all the time :-P

Embedding: A deep learning algo uses these billions and trillions of parameters to convert those simple words into verbose array of floating numbers. Think of these array of numbers as coordinates for a multi-dimensional space. When you use these coordinates(formally called vectors), you can embed that text in the space. That’s what embedding is. So we have converted text into numbers. What do we do with these embeddings(numbers) now? Some geometry & trigonometry!!
Cosine Similarity: We identify similarity between two numbers(aka coordinates) by drawing lines from origin to that coordinate(side note: this line is called a Vector). Then we find the degree between those 2 lines and take cosine value of that degree. Why cosine? Because cos(0) is 1. And 0 degree means 2 lines are identical. So similar texts have a lesser degree between their embeddings. In car example, Swift’s embedding is nearer to embedding of i20 and far from X7 and even farther from a chair!

Congratulations! Now you understand how Netflix recommends shows (when they will show squid game S2🤯). And also how LLMs(like GPT, Bard) use this swiss army knife under the hood.

What’s next?

(Hint: Implement all of the above in just 3 lines of code using Langchain!!)

(Fun Fact: I was shocked when I saw the results of these lines)

Previous Post: How to Create your own ChatGPT in 2 lines?

Follow me on this adventure of #AI, #Langchain, and #LLM.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

ChatGPT

Machine Learning

Written by Rajya Vardhan Mishra

21 Followers

9 Following

Passionate Engineering Leader @Google. On a mission to make this world a better place. Driven by Optimism, Gratitude, Curiosity, & Determination.

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Recommended from Medium

Vipra Singh

AI Agents: Introduction (Part-1)

Discover AI agents, their design, and real-world applications.

Feb 2

1.2K

Building your first Agent with Deepseek : AI Email Agent

Parth Sharma

Building your first Agent with Deepseek : AI Email Agent

Introduction

Feb 17

Lists

The New Chatbots: ChatGPT, Bard, and Beyond

12 stories563 saves

What is ChatGPT?

9 stories521 saves

ChatGPT prompts

51 stories2643 saves

Generative AI Recommended Reading

52 stories1691 saves

I used OpenAI’s o1 model to develop a trading strategy. It is DESTROYING the market

DataDrivenInvestor

Austin Starks

I used OpenAI’s o1 model to develop a trading strategy. It is DESTROYING the market

It literally took one try. I was shocked.

Sep 15, 2024

9.1K

242

Exploring Mercury, the First Commercial-Scale Diffusion Large Language Model

Jenray

Exploring Mercury, the First Commercial-Scale Diffusion Large Language Model

Mercury, is making waves as the first commercial-scale dLLM, promising to revolutionize text generation with its speed and efficiency.

5d ago

15 AI Agent Business Ideas to Get Rich in 2025

Everyday AI

Manpreet Singh

15 AI Agent Business Ideas to Get Rich in 2025

Feb 6

1.5K

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Level Up Coding

Jacob Bennett

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Jan 7

10.6K

260

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams