0% found this document useful (0 votes)

35 views

Genai

Uploaded by

hfo32065

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views

Genai

Uploaded by

hfo32065

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Leadership needs us to do

Gen AI, what do we do?

Chip Huyen (@chipro)
Jun ‘23
Agenda
1. Exploration
2. Building

2
Phase 1: Exploration
1. Set expectations
2. Minimize risks
3. Invest in things that last
4. Experiment

3
Set expectations
● Building some cool demos with LLMs -> easy
● Actually building a product with LLMs -> hard

● If you just want some cool demos to show customers that you’re ahead of the
curve, go for it.
● If you just want your team to experiment and build out LLM muscle, go for it.
● If you want a product, set goals for what you expect that product will bring, and the
resources you’re willing to invest. 4
There are a lot of things LLMs can do
Q: But can these things meaningfully transform your business?

A: Unclear

5
There are a lot of things LLMs can’t do NOW
Q: But would LLMs still not able to do those in the future?

A: Unclear

“When a distinguished but elderly scientist states that something is possible, he is almost
certainly right. When he states that something is impossible, he is very probably wrong.”

- Arthur Clarke

6
We live in an era of changes and uncertainty

In times of uncertainty, apply a decision-making framework to minimize regrets

(lessons from ﬁnance and reinforcement learning) 7
Minimize risks
1. Evaluate how disruptive gen AI is to your business
2. Figure out your data story
3. Avoid big, sweeping decisions

8
Evaluate how disruptive gen AI is to your business
1. If I don’t do anything, can competitors with gen AI make me obsolete?
a. Creative work: advertising, design, gaming, media, entertainment
b. A lot of document processing: legal, insurance, HR
2. If I don’t do anything, will I miss out opportunities to boost revenue?
a. Customer support: chat, call centers
b. Search & recommendation
c. Productivity enhancement: automated note-taking, summarization, information aggregation
3. If there are opportunities, what advantages do I have to capture them?
a. Proprietary data
b. A100s lying around
c. Existing user base

9
Evaluate how disruptive gen AI is to your business
1. If I don’t do anything, competitors with gen AI can make me obsolete
a. Creative work: advertising, design, gaming, media, entertainment
b. A lot of document processing: legal, Go all inHR
insurance,
2. If I don’t do anything, I’ll miss out opportunities to boost revenue
a. Customer support: chat, call centers
b. Search & recommendation Build vs. buy decision
c. Productivity enhancement: automated note-taking, summarization, information aggregation
3. There are opportunities, and I have competitive advantages to capture them
a. Proprietary data
b. A lot of A100s lying around Make bets
c. Existing user base

10
Figure out your data story
1. Consolidate existing data across departments and sources
2. Update your data terms of use (see StackOverﬂow and Reddit)
3. Put guardrails around data quality + governance

Gen AI made it clear that data is essential to any company that wants to leverage AI.
Reach out if you want us to help you with your data story! 11
Avoid big, sweeping decisions
1. “Stop everything to ﬁgure out our generative AI.”
2. “Let’s buy as many A100s as we can.”

It’s okay to make big bets as long as you can back them up with evidence.
12
Invest in things that last

The future life expectancy of some non-perishable things, like a technology or an

idea, is proportional to their current age

- Lindy’s Law

13
LLM fundamentals have been around for a while
● Language modeling (1951)
● Embeddings (2003)
● Vector databases:
○ Facebook’s Faiss (2017)
○ Google’s ScaNN (2020)
● Making data faster, cheaper, more accessible
will always be important ( Claypot )

Book cover photo from Kuenzig Books 14

Personal litmus test
Does this seem hacky to me?

● Context learning vs. prompt engineering

15
Model architectures, tools, techniques will
certainly evolve
AI literacy will be less about how to build a transformer model from scratch, and
more about how to use AI appropriately

16
Experiment
● Timebox your experiment
● Clarify the decisions you want to make by the end
● APIs are cheap and easy for experiment
○ $100 and one weekend can take you a long way!!

17
Understand LLM behaviors (dealbreakers??)
1. Ambiguous inputs + outputs
2. Hallucination vs. factuality
3. Privacy: how to ensure LLMs don’t reveal your user PII info?
4. Unstable infra: performance + latency
5. Inference cost
6. Forward & backward compatibility

See: Building LLM applications for production

18
Phase 2: Building
1. Understand the LLM stack
2. Implement:
a. Gather data
b. Choose a model
c. Get the most out of each layer of the stack before moving to the next
3. Evaluate

19
The LLM stack
● LLM part
○ Prompt engineering
○ Finetuning, distillation
○ Training a model from scratch
● Infra around LLM
○ Databases
○ Logs
○ Caching

20
Prompting

You’re an unbiased professor. For each

input, give it a score from 0 to 10.

{ examples } Pretrained model { output }

…

{ input }

Finetuning

Pretrained model

{ examples }

{ input } Finetuned model { output }

Low quality data High quality data Human feedback
RLHF

Text Demonstration Comparison

Prompts
e.g. Internet data data data

Trained to give Optimized to generate

Optimized for Finetuned for responses that maximize
a scalar score for
text completion dialogue scores by reward model
(prompt, response)

Language Supervised Reinforcement

Classiﬁcation
modeling ﬁnetuning Learning

Pretrained LLM SFT model Reward model Final model

Scale >1 trillion 10K - 100K 100K - 1M comparisons 10K - 100K

May ‘23 tokens (prompt, response) (prompt, winning_response, losing_response) prompts

Examples GPT-x, Gopher, Falcon, Dolly-v2, Falcon-Instruct InstructGPT, ChatGPT,

Bolded: open LLaMa, Pythia, Bloom, Claude, StableVicuna
sourced StableLM
See: RLHF: Reinforcement Learning from Human Feedback
Choose a model size
7B param model can run on a Macbook 5 - 13B
Cost param Perf
● bﬂoat16 = 14GB memory model
● int8 = 7GB memory

7B param model costs approx*:

● $1,000 to ﬁnetune
● $25,000 to train from scratch
Model size
Finetuned General
for speciﬁc models
tasks

* Highly dependent on how much data 23

Evaluate
● Tie to your OWN business metrics
● Build your own test set
● Beware of standardized evaluation: still catching up with use cases

24
Takeaways
1. Set concrete goals
2. Data story is more important now than ever
3. Invest in things that last
4. Experiment with APIs, build with open-source
5. Understanding LLM behaviors: which is a dealbreaker for your use case?
6. Choose a model size that balances between cost and performance
7. Always tie model evaluation to your business metrics
8. Have fun!

25
Thank you!
@chipro
linkedin.com/in/chiphuyen
[email protected]

Principles of Building a i Agents
No ratings yet
Principles of Building a i Agents
93 pages
Databricks Big Book of GenAI FINAL
100% (6)
Databricks Big Book of GenAI FINAL
118 pages
LS English 8 Unit 2 Test
100% (6)
LS English 8 Unit 2 Test
5 pages
Guide to Bitbucket Pipeline for Advanced CI_CD Workflows _ by Mert Mengü _ Medium
No ratings yet
Guide to Bitbucket Pipeline for Advanced CI_CD Workflows _ by Mert Mengü _ Medium
35 pages
AI For Everyone Notes
No ratings yet
AI For Everyone Notes
6 pages
New Research Report On Enterprise Generative AI Adoption 1692473211
No ratings yet
New Research Report On Enterprise Generative AI Adoption 1692473211
8 pages
AI Made Easy For All
No ratings yet
AI Made Easy For All
54 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
49 pages
Gen AI Content
No ratings yet
Gen AI Content
47 pages
Getting Started With AI for Beginners - 1_23-Compressed
No ratings yet
Getting Started With AI for Beginners - 1_23-Compressed
98 pages
Lenovo_NVIDIA_GenAI_eBook_FINALpdf
No ratings yet
Lenovo_NVIDIA_GenAI_eBook_FINALpdf
18 pages
Tech Trends Deloitte 2025 (0703)
No ratings yet
Tech Trends Deloitte 2025 (0703)
12 pages
AI for Business - addendum
No ratings yet
AI for Business - addendum
30 pages
Responsible Design and Use of Large Language Models
No ratings yet
Responsible Design and Use of Large Language Models
12 pages
(EXTERNAL) AI Trailblazers Workshop - Industry (5th Sep)
No ratings yet
(EXTERNAL) AI Trailblazers Workshop - Industry (5th Sep)
134 pages
Neurons to GenerativeAI V2 Roadmap
No ratings yet
Neurons to GenerativeAI V2 Roadmap
14 pages
Artificial intelligence
No ratings yet
Artificial intelligence
14 pages
Unit2AIML
No ratings yet
Unit2AIML
28 pages
01 - Democratization
No ratings yet
01 - Democratization
46 pages
AI docs
No ratings yet
AI docs
20 pages
Why Machine Learning Is Important and Its Effect
No ratings yet
Why Machine Learning Is Important and Its Effect
14 pages
Ways To Use LLM in Finance Organisation
No ratings yet
Ways To Use LLM in Finance Organisation
5 pages
Ermelinda_AI trainig Vlore
No ratings yet
Ermelinda_AI trainig Vlore
23 pages
AI ML RL GenAI
No ratings yet
AI ML RL GenAI
37 pages
The Future of AI A Game Changer
No ratings yet
The Future of AI A Game Changer
8 pages
Presentation On Ai
No ratings yet
Presentation On Ai
10 pages
Introducing AI
No ratings yet
Introducing AI
16 pages
EL4106Intro 2024
No ratings yet
EL4106Intro 2024
69 pages
PDF Div Class 2qs3tf Truncatedtext Module Wrapper Fg1km9p Classtruncatedtext Module Lineclamped 85ulhh Style Max Lines5building Llms for Production Louis Francois Bouchard p Div Compress
No ratings yet
PDF Div Class 2qs3tf Truncatedtext Module Wrapper Fg1km9p Classtruncatedtext Module Lineclamped 85ulhh Style Max Lines5building Llms for Production Louis Francois Bouchard p Div Compress
120 pages
Master Catalog for GenAI Programs for LNW-19Jul2024
No ratings yet
Master Catalog for GenAI Programs for LNW-19Jul2024
9 pages
2.10 Tool use and agents
No ratings yet
2.10 Tool use and agents
3 pages
Datateam Reading Book Club 2025
No ratings yet
Datateam Reading Book Club 2025
9 pages
Everything-About-AI
No ratings yet
Everything-About-AI
8 pages
Generative AI From Use Cases To Organizational Paradigm v1.1
No ratings yet
Generative AI From Use Cases To Organizational Paradigm v1.1
44 pages
Getting Started with AI
No ratings yet
Getting Started with AI
50 pages
GenAI Survival Guide PM
No ratings yet
GenAI Survival Guide PM
10 pages
ML
No ratings yet
ML
331 pages
Brief Introduction To Artificial Intelligence
100% (10)
Brief Introduction To Artificial Intelligence
71 pages
Chapter 1 - Course Intro
No ratings yet
Chapter 1 - Course Intro
27 pages
Httpswww.databricks.comsitesdefaultfiles2024 04Databricks Big Book of GenAI FINAL.pdf
No ratings yet
Httpswww.databricks.comsitesdefaultfiles2024 04Databricks Big Book of GenAI FINAL.pdf
118 pages
Google Cloud Skills
No ratings yet
Google Cloud Skills
27 pages
Slides
No ratings yet
Slides
46 pages
The GenAI Leap How Networks Are Reinventing Themselves v2
No ratings yet
The GenAI Leap How Networks Are Reinventing Themselves v2
112 pages
Modern AI Pro Essentials
100% (1)
Modern AI Pro Essentials
9 pages
the business of AI
No ratings yet
the business of AI
4 pages
AI-with-ICA-18092024-074806pm
No ratings yet
AI-with-ICA-18092024-074806pm
36 pages
AI - Artificial Intelligence Program Brochure by Weschool, Bangalore (Welingkar Management Institute)
No ratings yet
AI - Artificial Intelligence Program Brochure by Weschool, Bangalore (Welingkar Management Institute)
9 pages
SSRN 4371681
No ratings yet
SSRN 4371681
9 pages
AI for Executive 1738738378
No ratings yet
AI for Executive 1738738378
14 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
28 pages
ai.docx
No ratings yet
ai.docx
13 pages
Reaktor Mastering a GenAI Transition-1
No ratings yet
Reaktor Mastering a GenAI Transition-1
21 pages
State of AI - by Eduardo Mace - ScalePV 2023
No ratings yet
State of AI - by Eduardo Mace - ScalePV 2023
36 pages
unit-1 AI
No ratings yet
unit-1 AI
103 pages
Curriculum GenAI Pinnacle Program
No ratings yet
Curriculum GenAI Pinnacle Program
54 pages
Cambridge Design Partnership 1704284481
No ratings yet
Cambridge Design Partnership 1704284481
20 pages
ai.docx (2)
No ratings yet
ai.docx (2)
13 pages
Ai Assignment No 1
No ratings yet
Ai Assignment No 1
17 pages
Ipsos Views Humanizing AI - 0
No ratings yet
Ipsos Views Humanizing AI - 0
9 pages
Adobe Scan Jul 06, 2023
No ratings yet
Adobe Scan Jul 06, 2023
12 pages
Using ChatGPT
From Everand
Using ChatGPT
ALBERT MUTURI
No ratings yet
Python AI Programming
From Everand
Python AI Programming
Patrick J
No ratings yet
Week 6 - Worksheet N°1
No ratings yet
Week 6 - Worksheet N°1
3 pages
This Is A Dark Time My Love: Literary Devices
No ratings yet
This Is A Dark Time My Love: Literary Devices
4 pages
Archaeological Theory An Introduction 2nd Edition Matthew Johnson instant download
100% (2)
Archaeological Theory An Introduction 2nd Edition Matthew Johnson instant download
48 pages
The Hindu vocab
No ratings yet
The Hindu vocab
37 pages
Quarter2 Exam Ict 10
No ratings yet
Quarter2 Exam Ict 10
2 pages
English Grammar 2020
No ratings yet
English Grammar 2020
27 pages
The Passive with Reporting Verbs
No ratings yet
The Passive with Reporting Verbs
6 pages
Download The Dialogue of Solomon and Marcolf 1st Edition Nancy Mason Bradbury ebook All Chapters PDF
100% (7)
Download The Dialogue of Solomon and Marcolf 1st Edition Nancy Mason Bradbury ebook All Chapters PDF
53 pages
Design Patterns Part 2
No ratings yet
Design Patterns Part 2
15 pages
Early Narrative Christology The Lord in the Gospel of Luke Dissertationsschrift C. Kavin Rowe download pdf
100% (1)
Early Narrative Christology The Lord in the Gospel of Luke Dissertationsschrift C. Kavin Rowe download pdf
61 pages
Quia - Week 5 Grammar - Simple Past Tense
No ratings yet
Quia - Week 5 Grammar - Simple Past Tense
5 pages
Master Thesis Kwalitatief Onderzoek
100% (2)
Master Thesis Kwalitatief Onderzoek
4 pages
Programma 2017
No ratings yet
Programma 2017
64 pages
Teacher As Researcher
No ratings yet
Teacher As Researcher
16 pages
3GPP TS 38.522
No ratings yet
3GPP TS 38.522
17 pages
DLL Matter G7 Q1.W1.D2
No ratings yet
DLL Matter G7 Q1.W1.D2
4 pages
The Correct Tense
No ratings yet
The Correct Tense
2 pages
Triangle Congruence Postulate
No ratings yet
Triangle Congruence Postulate
12 pages
Lalpur Centre
No ratings yet
Lalpur Centre
3 pages
Part 4 Key Word Transformation Practice Test
No ratings yet
Part 4 Key Word Transformation Practice Test
3 pages
Regex Regular Cro
No ratings yet
Regex Regular Cro
1 page
REMOTE CLASS-TRAVEL-HOSPITALITY-Passive Voice-ENGLISH 3-ANGELA-UPDATED-Sept-7
100% (1)
REMOTE CLASS-TRAVEL-HOSPITALITY-Passive Voice-ENGLISH 3-ANGELA-UPDATED-Sept-7
9 pages
Poets of The Poers Review
No ratings yet
Poets of The Poers Review
4 pages
Result
No ratings yet
Result
2 pages
Municipality of Bataraza
No ratings yet
Municipality of Bataraza
20 pages
Biostar A770E
No ratings yet
Biostar A770E
38 pages
MGMT640TeamAssignment1
No ratings yet
MGMT640TeamAssignment1
2 pages
Homelessness Revisited: The Theme of Alienation in Ernest Hemingway's "In Another Country" and "Old Man at The Bridge"
No ratings yet
Homelessness Revisited: The Theme of Alienation in Ernest Hemingway's "In Another Country" and "Old Man at The Bridge"
4 pages