00:00:00.053 --> 00:00:01.435
[MUSIC    

00:00:01.435 --> 00:00:02.679
[BOOK PASSAGE] 

00:00:02.680 --> 00:00:08.160
PETER LEE: “We need to start understanding and&nbsp;
discussing AI’s potential for good and ill now.&nbsp;&nbsp;

00:00:08.160 --> 00:00:16.749
Or rather, yesterday. … GPT-4 has game-changing&nbsp;
potential to improve medicine and health.”

00:00:16.749 --> 00:00:16.800
[END OF BOOK PASSAGE]
[THEME MUSIC]

00:00:16.800 --> 00:00:26.160
This is The AI Revolution in Medicine,&nbsp;
Revisited. I’m your host, Peter Lee.    

00:00:30.000 --> 00:00:35.840
Shortly after OpenAI's GPT-4 was publicly&nbsp;
released, Carey Goldberg, Dr. Zak Kohane,&nbsp;&nbsp;

00:00:35.840 --> 00:00:41.360
and I published The AI Revolution in Medicine&nbsp;
to help educate the world of healthcare and&nbsp;&nbsp;

00:00:41.360 --> 00:00:45.760
medical research about the transformative&nbsp;
impact this new generative AI technology&nbsp;&nbsp;

00:00:45.760 --> 00:00:53.040
could have. But because we wrote the book when&nbsp;
GPT-4 was still a secret, we had to speculate.&nbsp;&nbsp;

00:00:53.040 --> 00:00:58.320
Now, two years later, what did we get&nbsp;
right, and what did we get wrong?     

00:00:58.320 --> 00:01:02.960
In this series, we’ll talk to clinicians,&nbsp;
patients, hospital administrators,&nbsp;&nbsp;

00:01:02.960 --> 00:01:12.297
and others to understand the reality of AI&nbsp;
in the field and where we go from here.      

00:01:12.297 --> 00:01:12.320
[THEME MUSIC FADES] 

00:01:12.320 --> 00:01:16.000
The passage I read at the top&nbsp;
is from the book’s prologue.  

00:01:16.000 --> 00:01:20.560
When Carey, Zak, and I wrote the book, we&nbsp;
could only speculate how generative AI would&nbsp;&nbsp;

00:01:20.560 --> 00:01:26.320
be used in healthcare because GPT-4 hadn't&nbsp;
yet been released. It wasn't yet available&nbsp;&nbsp;

00:01:26.320 --> 00:01:32.240
to the very people we thought would be&nbsp;
most affected by it. And while we felt&nbsp;&nbsp;

00:01:32.240 --> 00:01:37.120
strongly that this new form of AI would&nbsp;
have the potential to transform medicine,&nbsp;&nbsp;

00:01:37.120 --> 00:01:42.240
it was such a different kind of technology&nbsp;
for the world, and no one had a user's&nbsp;&nbsp;

00:01:42.240 --> 00:01:48.880
manual for this thing to explain how to use&nbsp;
it effectively and also how to use it safely.

00:01:48.880 --> 00:01:53.040
So we thought it would be important to give&nbsp;
healthcare professionals and leaders a framing&nbsp;&nbsp;

00:01:53.040 --> 00:01:59.760
to start important discussions around its use. We&nbsp;
wanted to provide a map not only to help people&nbsp;&nbsp;

00:01:59.760 --> 00:02:04.800
navigate a new world that we anticipated&nbsp;
would happen with the arrival of GPT-4 but&nbsp;&nbsp;

00:02:04.800 --> 00:02:09.760
also to help them chart a future of what we&nbsp;
saw as a potential revolution in medicine.

00:02:10.560 --> 00:02:16.880
So I'm super excited to welcome my coauthors:&nbsp;
longtime medical/science journalist Carey Goldberg&nbsp;&nbsp;

00:02:16.880 --> 00:02:21.440
and Dr. Zak Kohane, the inaugural chair&nbsp;
of Harvard Medical School's Department of&nbsp;&nbsp;

00:02:21.440 --> 00:02:27.280
Biomedical Informatics and the editor-in-chief&nbsp;
for The New England Journal of Medicine AI.

00:02:27.280 --> 00:02:32.560
We're going to have two discussions. This will&nbsp;
be the first one about what we've learned from&nbsp;&nbsp;

00:02:32.560 --> 00:02:37.920
the people on the ground so far and how&nbsp;
we are thinking about generative AI today.

00:02:37.920 --> 00:02:42.800
[TRANSITION MUSIC]

00:02:42.800 --> 00:02:45.984
Carey, Zak, I'm really looking forward to this.

00:02:45.984 --> 00:02:48.320
CAREY GOLDBERG: It's nice to see you, Peter.

00:02:48.320 --> 00:02:50.843
LEE: [LAUGHS] It's great to see you, too.

00:02:50.843 --> 00:02:54.875
GOLDBERG: We missed you.
ZAK KOHANE: The dynamic gang is back. [LAUGHTER]

00:02:54.875 --> 00:02:58.320
LEE: Yeah, and I guess after that&nbsp;
big book project two years ago,&nbsp;&nbsp;

00:02:58.320 --> 00:03:02.800
it's remarkable that we're still on&nbsp;
speaking terms with each other. [LAUGHTER]

00:03:02.800 --> 00:03:08.080
In fact, this episode is to react&nbsp;
to what we heard in the first four&nbsp;&nbsp;

00:03:08.080 --> 00:03:11.840
episodes of this podcast. But before we get&nbsp;
there, I thought maybe we should start with&nbsp;&nbsp;

00:03:11.840 --> 00:03:19.040
the origins of this project just now&nbsp;
over two years ago. And, you know,&nbsp;&nbsp;

00:03:19.040 --> 00:03:25.280
I had this early secret access&nbsp;
to Davinci 3, now known as GPT-4.

00:03:25.280 --> 00:03:31.280
I remember, you know, experimenting right&nbsp;
away with things in medicine, but I realized&nbsp;&nbsp;

00:03:31.280 --> 00:03:38.240
I was in way over my head. And so I wanted&nbsp;
help. And the first person I called was you,&nbsp;&nbsp;

00:03:38.240 --> 00:03:47.920
Zak. And you remember we had a call, and&nbsp;
I tried to explain what this was about.&nbsp;&nbsp;

00:03:47.920 --> 00:03:53.840
And I think I saw skepticism in—polite&nbsp;
skepticism—in your eyes. But tell me,&nbsp;&nbsp;

00:03:53.840 --> 00:03:58.080
you know, what was going through your head&nbsp;
when you heard me explain this thing to you?

00:03:58.080 --> 00:04:05.280
KOHANE: So I was divided between the fact&nbsp;
that I have tremendous respect for you,&nbsp;&nbsp;

00:04:05.280 --> 00:04:11.440
Peter. And you've always struck me as sober.&nbsp;
And we've had conversations which showed to&nbsp;&nbsp;

00:04:11.440 --> 00:04:18.400
me that you fully understood some of the&nbsp;
missteps that technology—ARPA, Microsoft,&nbsp;&nbsp;

00:04:18.400 --> 00:04:24.640
and others—had made in the past. And yet,&nbsp;
you were telling me a full science fiction&nbsp;&nbsp;

00:04:24.640 --> 00:04:30.560
compliant story [LAUGHTER] that something that&nbsp;
we thought was 30 years away was happening now.

00:04:30.560 --> 00:04:31.760
LEE: Mm-hmm.

00:04:31.760 --> 00:04:37.760
KOHANE: And it was very hard for me&nbsp;
to put together. And so I couldn't&nbsp;&nbsp;

00:04:37.760 --> 00:04:44.080
quite tell myself this is BS, but I said,&nbsp;
you know, I need to look at it. Just this&nbsp;&nbsp;

00:04:44.080 --> 00:04:49.360
seems too good to be true. What is this? So&nbsp;
it was very hard for me to grapple with it.&nbsp;&nbsp;

00:04:49.360 --> 00:04:53.200
I was thrilled that it might be possible, but&nbsp;
I was thinking, How could this be possible?

00:04:53.200 --> 00:04:59.680
LEE: Yeah. Well, even now, I look back,&nbsp;
and I appreciate that you were nice to me,&nbsp;&nbsp;

00:04:59.680 --> 00:05:04.640
because I think a lot of people would have&nbsp;
[LAUGHS] been much less polite. And in fact,&nbsp;&nbsp;

00:05:04.640 --> 00:05:10.960
I myself had expressed a lot of&nbsp;
very direct skepticism early on.

00:05:10.960 --> 00:05:15.040
After ChatGPT got released, I&nbsp;
think three or four days later,&nbsp;&nbsp;

00:05:15.040 --> 00:05:20.640
I received an email from a colleague running ...&nbsp;
who runs a clinic, and, you know, he said, “Wow,&nbsp;&nbsp;

00:05:20.640 --> 00:05:25.840
this is great, Peter. And, you know,&nbsp;
we're using this ChatGPT, you know,&nbsp;&nbsp;

00:05:25.840 --> 00:05:32.000
to have the receptionist in our clinic&nbsp;
write after-visit notes to our patients.”

00:05:32.000 --> 00:05:40.000
And that sparked a huge internal discussion&nbsp;
about this. And you and I knew enough about&nbsp;&nbsp;

00:05:40.000 --> 00:05:46.080
hallucinations and about other issues that&nbsp;
it seemed important to write something about&nbsp;&nbsp;

00:05:46.080 --> 00:05:51.120
what this could do and what it couldn’t do.&nbsp;
And so I think, I can't remember the timing,&nbsp;&nbsp;

00:05:51.120 --> 00:05:58.400
but you and I decided a book would be a good idea.&nbsp;
And then I think you had the thought that you and&nbsp;&nbsp;

00:05:58.400 --> 00:06:04.240
I would write in a hopelessly academic style&nbsp;
[LAUGHTER] that no one would be able to read.

00:06:04.240 --> 00:06:08.480
So it was your idea to&nbsp;
recruit Carey, I think, right?

00:06:08.480 --> 00:06:16.240
KOHANE: Yes, it was. I was sure&nbsp;
that we both had a lot of material,&nbsp;&nbsp;

00:06:16.240 --> 00:06:22.720
but communicating it effectively to the very&nbsp;
people we wanted to would not go well if we&nbsp;&nbsp;

00:06:22.720 --> 00:06:29.680
just left ourselves to our own devices.&nbsp;
And Carey is super brilliant at what she&nbsp;&nbsp;

00:06:29.680 --> 00:06:36.720
does. She's an idea synthesizer and public&nbsp;
communicator in the written word and amazing.

00:06:36.720 --> 00:06:41.440
LEE: So yeah. So, Carey, we&nbsp;
contact you. How did that go?

00:06:41.440 --> 00:06:48.000
GOLDBERG: So yes. On my end, I had known Zak for&nbsp;
probably, like, 25 years, and he had always been&nbsp;&nbsp;

00:06:48.000 --> 00:06:53.360
the person who debunked the scientific hype for&nbsp;
me. I would turn to him with like, “Hmm, they're&nbsp;&nbsp;

00:06:53.360 --> 00:06:57.200
saying that the Human Genome Project is going to&nbsp;
change everything.” And he would say, “Yeah. But&nbsp;&nbsp;

00:06:57.200 --> 00:07:00.800
first it'll be 10 years of bad news, and then&nbsp;
[LAUGHTER] we'll actually get somewhere.”  
 &nbsp;

00:07:00.800 --> 00:07:07.840
So when Zak called me up at seven o'clock&nbsp;
one morning, just beside himself after&nbsp;&nbsp;

00:07:07.840 --> 00:07:13.360
having tried Davinci 3, I knew that there&nbsp;
was something very serious going on. And&nbsp;&nbsp;

00:07:13.360 --> 00:07:18.240
I had just quit my job as the Boston bureau&nbsp;
chief of Bloomberg News, and I was ripe for&nbsp;&nbsp;

00:07:18.240 --> 00:07:24.560
the plucking. And I also … I feel kind of&nbsp;
nostalgic now about just the amazement and&nbsp;&nbsp;

00:07:24.560 --> 00:07:31.440
the wonder and the awe of that period. We&nbsp;
knew that when generative AI hit the world,&nbsp;&nbsp;

00:07:31.440 --> 00:07:36.000
there would be all kinds of snags and obstacles&nbsp;
and things that would slow it down, but at that&nbsp;&nbsp;

00:07:36.000 --> 00:07:42.240
moment, it was just like the holy crap moment.&nbsp;
[LAUGHTER] And it's fun to think about it now.

00:07:42.240 --> 00:07:46.000
LEE: Yeah. I think ultimately,&nbsp;
you know, recruiting Carey,&nbsp;&nbsp;

00:07:48.080 --> 00:07:51.280
you were [LAUGHS] so important because you&nbsp;
basically went through every single page&nbsp;&nbsp;

00:07:51.280 --> 00:07:55.360
of this book and made sure … I remember, in&nbsp;
fact, it's affected my writing since because&nbsp;&nbsp;

00:07:56.400 --> 00:08:00.560
you were coaching us that every page has to&nbsp;
be a page turner. There has to be something&nbsp;&nbsp;

00:08:00.560 --> 00:08:05.640
on every page that motivates people to want&nbsp;
to turn the page and get to the next one.

00:08:05.640 --> 00:08:09.760
KOHANE: I will see that and&nbsp;
raise that one. I now tell GPT-4,&nbsp;&nbsp;

00:08:09.760 --> 00:08:11.842
please write this in the style of Carey Goldberg.

00:08:11.842 --> 00:08:13.512
GOLDBERG: [LAUGHTER] No way! Really?

00:08:13.512 --> 00:08:16.859
KOHANE: Yes way. Yes way. Yes way.

00:08:16.859 --> 00:08:20.400
GOLDBERG: Wow. Well, I have to say, like, it's&nbsp;
not hard to motivate readers when you're writing&nbsp;&nbsp;

00:08:20.400 --> 00:08:26.000
about the most transformative technology of their&nbsp;
lifetime. Like, I think there's a gigantic hunger&nbsp;&nbsp;

00:08:26.000 --> 00:08:34.395
to read and to understand. So you were not&nbsp;
hard to work with, Peter and Zak. [LAUGHS]

00:08:34.395 --> 00:08:39.680
LEE: All right. So I think we have&nbsp;
to get down to work [LAUGHS] now.

00:08:40.400 --> 00:08:46.000
Yeah, so for these podcasts, you know, we're&nbsp;
talking to different types of people to just&nbsp;&nbsp;

00:08:46.000 --> 00:08:50.320
reflect on what's actually happening, what has&nbsp;
actually happened over the last two years. And&nbsp;&nbsp;

00:08:50.320 --> 00:08:59.600
so the first episode, we talked to two doctors.&nbsp;
There's Chris Longhurst at UC San Diego and Sara&nbsp;&nbsp;

00:08:59.600 --> 00:09:07.280
Murray at UC San Francisco. And besides being&nbsp;
doctors and having AI affect their clinical work,&nbsp;&nbsp;

00:09:07.280 --> 00:09:10.800
they just happen also to be leading&nbsp;
the efforts at their respective&nbsp;&nbsp;

00:09:10.800 --> 00:09:18.880
institutions to figure out how best to&nbsp;
integrate AI into their health systems.

00:09:18.880 --> 00:09:27.120
And, you know, it was fun to talk to them.&nbsp;
And I felt like a lot of what they said&nbsp;&nbsp;

00:09:27.120 --> 00:09:34.640
was pretty validating for us. You know, they&nbsp;
talked about AI scribes. Chris, especially,&nbsp;&nbsp;

00:09:34.640 --> 00:09:43.680
talked a lot about how AI can respond to emails&nbsp;
from patients, write referral letters. And then,&nbsp;&nbsp;

00:09:43.680 --> 00:09:48.240
you know, they both talked about the&nbsp;
importance of—I think, Zak, you used the&nbsp;&nbsp;

00:09:48.240 --> 00:09:54.160
phrase in our book “trust but verify”—you&nbsp;
know, to have always a human in the loop.

00:09:54.720 --> 00:10:00.720
What did you two take away from their&nbsp;
thoughts overall about how doctors are&nbsp;&nbsp;

00:10:00.720 --> 00:10:06.560
using ... and I guess, Zak, you would have&nbsp;
a different lens also because at Harvard,&nbsp;&nbsp;

00:10:07.120 --> 00:10:10.080
you see doctors all the time grappling with AI.

00:10:10.080 --> 00:10:15.680
KOHANE: So on the one hand, I think&nbsp;
they've done some very interesting studies.&nbsp;&nbsp;

00:10:16.720 --> 00:10:23.440
And indeed, they saw that when&nbsp;
these generative models, when GPT-4,&nbsp;&nbsp;

00:10:23.440 --> 00:10:31.120
was sending a note to patients,&nbsp;
it was more detailed, friendlier.

00:10:31.120 --> 00:10:37.360
But there were also some nonobvious results,&nbsp;
which is on the generation of these letters,&nbsp;&nbsp;

00:10:37.360 --> 00:10:40.400
if indeed you review them as you're supposed to,&nbsp;&nbsp;

00:10:40.400 --> 00:10:46.720
it was not clear that there was any time&nbsp;
savings. And my own reaction was, Boy,&nbsp;&nbsp;

00:10:46.720 --> 00:10:52.000
every one of these things needs institutional&nbsp;
review. It's going to be hard to move fast.

00:10:52.000 --> 00:10:58.800
And yet, at the same time, we know from them&nbsp;
that the doctors on their smartphones are&nbsp;&nbsp;

00:10:58.800 --> 00:11:05.440
accessing these things all the time. And so&nbsp;
the disconnect between a healthcare system,&nbsp;&nbsp;

00:11:05.440 --> 00:11:15.040
which is duty bound to carefully look at every&nbsp;
implementation, is, I think, intimidating.

00:11:15.040 --> 00:11:16.160
LEE: Yeah.

00:11:16.160 --> 00:11:22.080
KOHANE: And at the same time, doctors who&nbsp;
just have to do what they have to do are&nbsp;&nbsp;

00:11:22.080 --> 00:11:28.235
using this new superpower and doing it.&nbsp;
And so that's actually what struck me ...

00:11:28.235 --> 00:11:28.708
LEE: Yeah.

00:11:28.708 --> 00:11:31.600
KOHANE: ... is that these are two&nbsp;
leaders and they're doing what they&nbsp;&nbsp;

00:11:31.600 --> 00:11:37.200
have to do for their institutions,&nbsp;
and yet there's this disconnect.

00:11:37.200 --> 00:11:44.240
And by the way, I don't think we've seen&nbsp;
any faster technology adoption than the&nbsp;&nbsp;

00:11:44.240 --> 00:11:51.120
adoption of ambient dictation. And it's&nbsp;
not because it's time saving. And in fact,&nbsp;&nbsp;

00:11:51.120 --> 00:11:54.880
so far, the hospitals have to pay out of&nbsp;
pocket. It's not like insurance is paying&nbsp;&nbsp;

00:11:54.880 --> 00:12:01.200
them more. But it's so much more pleasant for&nbsp;
the doctors ... not least of which because they&nbsp;&nbsp;

00:12:01.200 --> 00:12:06.320
can actually look at their patients instead&nbsp;
of looking at the terminal and plunking down.

00:12:06.320 --> 00:12:07.960
LEE: Carey, what about you?

00:12:07.960 --> 00:12:12.400
GOLDBERG: I mean, anecdotally, there are time&nbsp;
savings. Anecdotally, I have heard quite a few&nbsp;&nbsp;

00:12:12.400 --> 00:12:19.200
doctors saying that it cuts down on “pajama time”&nbsp;
to be able to have the note written by the AI and&nbsp;&nbsp;

00:12:19.200 --> 00:12:24.480
then for them to just check it. In fact, I spoke&nbsp;
to one doctor who said, you know, basically it&nbsp;&nbsp;

00:12:24.480 --> 00:12:29.120
means that when I leave the office, I've left&nbsp;
the office. I can go home and be with my kids.

00:12:29.120 --> 00:12:36.800
So I don't think the jury is fully in yet&nbsp;
about whether there are time savings. But&nbsp;&nbsp;

00:12:36.800 --> 00:12:39.680
what is clear is, Peter, what you&nbsp;
predicted right from the get-go,&nbsp;&nbsp;

00:12:39.680 --> 00:12:42.720
which is that this is going to be&nbsp;
an amazing paper shredder. Like,&nbsp;&nbsp;

00:12:42.720 --> 00:12:48.800
the main first overarching use&nbsp;
cases will be back-office functions.

00:12:48.800 --> 00:12:56.960
LEE: Yeah, yeah. Well, and it was, I think,&nbsp;
not a hugely risky prediction because,&nbsp;&nbsp;

00:12:56.960 --> 00:13:03.680
you know, there were already companies,&nbsp;
like, using phone banks of scribes in&nbsp;&nbsp;

00:13:03.680 --> 00:13:09.680
India to kind of listen in. And, you know,&nbsp;
lots of clinics actually had human scribes&nbsp;&nbsp;

00:13:09.680 --> 00:13:19.040
being used. And so it wasn't a&nbsp;
huge stretch to imagine the AI.

00:13:19.040 --> 00:13:19.200
[TRANSITION MUSIC]

00:13:19.200 --> 00:13:23.280
So on the subject of things that we missed,&nbsp;
Chris Longhurst shared this scenario,&nbsp;&nbsp;

00:13:23.280 --> 00:13:27.657
which stuck out for me, and he actually&nbsp;
coauthored a paper on it last year.

00:13:27.657 --> 00:13:31.200
CHRISTOPHER LONGHURST: It turns out,&nbsp;
not surprisingly, healthcare can be&nbsp;&nbsp;

00:13:31.200 --> 00:13:36.000
frustrating. And stressed patients can send&nbsp;
some pretty nasty messages to their care&nbsp;&nbsp;

00:13:36.000 --> 00:13:42.240
teams. [LAUGHTER] And you can imagine being a&nbsp;
busy, tired, exhausted clinician and receiving&nbsp;&nbsp;

00:13:42.240 --> 00:13:49.760
a bit of a nasty-gram. And the GPT is actually&nbsp;
really helpful in those instances in helping&nbsp;&nbsp;

00:13:49.760 --> 00:13:57.346
draft a pretty empathetic response when I think&nbsp;
the human instinct would be a pretty nasty one.

00:13:57.346 --> 00:14:02.480
LEE: [LAUGHS] So, Carey, maybe I'll start with&nbsp;
you. What did we understand about this idea of&nbsp;&nbsp;

00:14:02.480 --> 00:14:06.960
empathy out of AI at the time we wrote&nbsp;
the book, and what do we understand now?

00:14:06.960 --> 00:14:11.440
GOLDBERG: Well, it was already clear when&nbsp;
we wrote the book that these AI models were&nbsp;&nbsp;

00:14:11.440 --> 00:14:18.000
capable of very persuasive empathy. And in&nbsp;
fact, you even wrote that it was helping you&nbsp;&nbsp;

00:14:18.000 --> 00:14:24.560
be a better person, right. [LAUGHS] So their&nbsp;
human qualities, or human imitative qualities,&nbsp;&nbsp;

00:14:25.120 --> 00:14:31.440
were clearly superb. And we've seen&nbsp;
that borne out in multiple studies,&nbsp;&nbsp;

00:14:31.440 --> 00:14:35.120
that in fact, patients respond better&nbsp;
to them ... that they have no problem&nbsp;&nbsp;

00:14:35.120 --> 00:14:38.880
at all with how the AI communicates with&nbsp;
them. And in fact, it's often better.

00:14:38.880 --> 00:14:45.760
And I gather now we're even entering a period&nbsp;
when people are complaining of sycophantic models,&nbsp;&nbsp;

00:14:45.760 --> 00:14:53.280
[LAUGHS] where the models are being too personable&nbsp;
and too flattering. I do think that's been one of&nbsp;&nbsp;

00:14:53.280 --> 00:14:58.400
the great surprises. And in fact, this is a huge&nbsp;
phenomenon, how charming these models can be.

00:14:58.400 --> 00:15:02.960
LEE: Yeah, I think you're right. We can&nbsp;
take credit for understanding that, Wow,&nbsp;&nbsp;

00:15:02.960 --> 00:15:08.640
these things can be remarkably empathetic. But&nbsp;
then we missed this problem of sycophancy. Like,&nbsp;&nbsp;

00:15:08.640 --> 00:15:15.360
we even started our book in Chapter 1 with&nbsp;
a quote from Davinci 3 scolding me. Like,&nbsp;&nbsp;

00:15:15.360 --> 00:15:18.240
don't you remember when we were first starting,&nbsp;&nbsp;

00:15:18.240 --> 00:15:23.280
this thing was actually anti-sycophantic. If&nbsp;
anything, it would tell you you're an idiot.

00:15:23.280 --> 00:15:29.280
KOHANE: It argued with me about certain&nbsp;
biology questions. It was like a knockdown,&nbsp;&nbsp;

00:15:29.280 --> 00:15:32.800
drag-out fight. [LAUGHTER] I was bringing&nbsp;&nbsp;

00:15:32.800 --> 00:15:38.461
references. It was impressive. But&nbsp;
in fact, it made me trust it more.

00:15:38.461 --> 00:15:38.472
LEE: Yeah.

00:15:38.472 --> 00:15:43.280
KOHANE: And in fact, I will say—I remember&nbsp;
it's in the book—I had a bone to pick with&nbsp;&nbsp;

00:15:43.280 --> 00:15:49.600
Peter. Peter really was impressed by the&nbsp;
empathy. And I pointed out that some of&nbsp;&nbsp;

00:15:49.600 --> 00:15:55.120
the most popular doctors are popular&nbsp;
because they're very empathic. But&nbsp;&nbsp;

00:15:55.120 --> 00:16:01.840
they're not necessarily the best doctors. And&nbsp;
in fact, I was taught that in medical school.

00:16:01.840 --> 00:16:09.840
And so it's a decoupling. It's a human thing,&nbsp;
that the empathy does not necessarily mean …&nbsp;&nbsp;

00:16:10.480 --> 00:16:14.480
it's more of a, potentially, more of a&nbsp;
signaled virtue than an actual virtue.

00:16:14.480 --> 00:16:16.080
GOLDBERG: Nicely put.

00:16:16.080 --> 00:16:24.000
LEE: Yeah, this issue of sycophancy, I think,&nbsp;
is a struggle right now in the development of&nbsp;&nbsp;

00:16:24.000 --> 00:16:31.760
AI because I think it's somehow related&nbsp;
to instruction-following. So, you know,&nbsp;&nbsp;

00:16:31.760 --> 00:16:37.920
one of the challenges in AI is you'd like&nbsp;
to give an AI a task—a task that might&nbsp;&nbsp;

00:16:37.920 --> 00:16:43.520
take several minutes or hours or even days to&nbsp;
complete. And you want it to faithfully kind&nbsp;&nbsp;

00:16:43.520 --> 00:16:50.000
of follow those instructions. And, you know,&nbsp;
that early version of GPT-4 was not very good&nbsp;&nbsp;

00:16:50.000 --> 00:16:58.000
at instruction-following. It would just silently&nbsp;
disobey and, you know, and do something different.

00:16:58.000 --> 00:17:04.400
And so I think we're starting to&nbsp;
hit some confusing elements of like,&nbsp;&nbsp;

00:17:04.400 --> 00:17:07.120
how agreeable should these things be?

00:17:07.120 --> 00:17:13.760
One of the two of you used the word genteel.&nbsp;
There was some point even while we were,&nbsp;&nbsp;

00:17:13.760 --> 00:17:21.200
like, on a little book tour … was it you,&nbsp;
Carey, who said that the model seems nicer&nbsp;&nbsp;

00:17:21.200 --> 00:17:27.280
and less intelligent or less brilliant now&nbsp;
than it did when we were writing the book?

00:17:27.280 --> 00:17:32.560
GOLDBERG: It might have been, I think so. And&nbsp;
I mean, I think in the context of medicine,&nbsp;&nbsp;

00:17:32.560 --> 00:17:37.120
of course, the question is, well, what's likeliest&nbsp;
to get the results you want with the patient,&nbsp;&nbsp;

00:17:37.120 --> 00:17:43.760
right? A lot of healthcare is in fact persuading&nbsp;
the patient to do what you know as the physician&nbsp;&nbsp;

00:17:43.760 --> 00:17:49.600
would be best for them. And so it seems worth&nbsp;
testing out whether this sycophancy is actually&nbsp;&nbsp;

00:17:49.600 --> 00:17:54.320
constructive or not. And I suspect … well, I&nbsp;
don't know, probably depends on the patient.

00:17:54.320 --> 00:17:56.622
So actually, Peter, I have&nbsp;
a few questions for you …

00:17:56.622 --> 00:17:56.771
LEE: Yeah. Mm-hmm.

00:17:56.771 --> 00:18:05.520
GOLDBERG: … that have been lingering for me. And&nbsp;
one is, for AI to ever fully realize its potential&nbsp;&nbsp;

00:18:05.520 --> 00:18:12.240
in medicine, it must deal with the hallucinations.&nbsp;
And I keep hearing conflicting accounts about&nbsp;&nbsp;

00:18:12.240 --> 00:18:18.800
whether that's getting better or not. Where are we&nbsp;
at, and what does that mean for use in healthcare?

00:18:18.800 --> 00:18:25.520
LEE: Yeah, well, it's, I think two&nbsp;
years on, in the pretrained base models,&nbsp;&nbsp;

00:18:25.520 --> 00:18:34.320
there's no doubt that hallucination rates by any&nbsp;
benchmark measure have reduced dramatically. And,&nbsp;&nbsp;

00:18:35.360 --> 00:18:39.680
you know, that doesn't mean they don't&nbsp;
happen. They still happen. But, you know,&nbsp;&nbsp;

00:18:39.680 --> 00:18:46.713
there's been just a huge amount of effort and&nbsp;
understanding in the, kind of, fundamental&nbsp;&nbsp;

00:18:46.713 --> 00:18:53.760
pretraining of these models. And that has come&nbsp;
along at the same time that the inference costs,&nbsp;&nbsp;

00:18:53.760 --> 00:19:00.080
you know, for actually using these models has gone&nbsp;
down, you know, by several orders of magnitude.

00:19:00.080 --> 00:19:06.640
So things have gotten cheaper and have&nbsp;
fewer hallucinations. At the same time,&nbsp;&nbsp;

00:19:06.640 --> 00:19:10.560
now there are these reasoning models. And the&nbsp;&nbsp;

00:19:10.560 --> 00:19:18.640
reasoning models are able to solve&nbsp;
problems at PhD level oftentimes.

00:19:18.640 --> 00:19:26.480
But at least at the moment, they are also now&nbsp;
hallucinating more than the simpler pretrained&nbsp;&nbsp;

00:19:26.480 --> 00:19:34.880
models. And so it still continues to be, you know,&nbsp;
a real issue, as we were describing. I don't know,&nbsp;&nbsp;

00:19:34.880 --> 00:19:41.040
Zak, from where you're at in medicine, as&nbsp;
a clinician and as an educator in medicine,&nbsp;&nbsp;

00:19:41.040 --> 00:19:46.560
how is the medical community from&nbsp;
where you're sitting looking at that?

00:19:46.560 --> 00:19:54.080
KOHANE: So I think it's less of an issue, first of&nbsp;
all, because the rate of hallucinations is going&nbsp;&nbsp;

00:19:54.080 --> 00:20:02.320
down. And second of all, in their day-to-day&nbsp;
use, the doctor will provide questions that&nbsp;&nbsp;

00:20:02.320 --> 00:20:10.640
sit reasonably well into the context of medical&nbsp;
decision-making. And the way doctors use this,&nbsp;&nbsp;

00:20:10.640 --> 00:20:15.280
let's say on their non-EHR [electronic&nbsp;
health record] smartphone is really to&nbsp;&nbsp;

00:20:15.280 --> 00:20:21.760
jog their memory or thinking about the patient,&nbsp;
and they will evaluate independently. So that&nbsp;&nbsp;

00:20:21.760 --> 00:20:30.080
seems to be less of an issue. I'm actually more&nbsp;
concerned about something else that's I think&nbsp;&nbsp;

00:20:30.080 --> 00:20:39.760
more fundamental, which is effectively,&nbsp;
what values are these models expressing?

00:20:39.760 --> 00:20:47.040
And I'm reminded of when I was still in training,&nbsp;
I went to a fancy cocktail party in Cambridge,&nbsp;&nbsp;

00:20:47.040 --> 00:20:52.880
Massachusetts, and there was a psychotherapist&nbsp;
speaking to a dentist. They were talking about&nbsp;&nbsp;

00:20:52.880 --> 00:21:00.160
their summer, and the dentist was saying about&nbsp;
how he was going to fix up his yacht that summer,&nbsp;&nbsp;

00:21:00.160 --> 00:21:07.920
and the only question was whether he was going to&nbsp;
make enough money doing procedures in the spring&nbsp;&nbsp;

00:21:07.920 --> 00:21:13.680
so that he could afford those things, which was&nbsp;
discomforting to me because that dentist was my&nbsp;&nbsp;

00:21:13.680 --> 00:21:23.200
dentist. [LAUGHTER] And he had just proposed to&nbsp;
me a few weeks before an expensive procedure.

00:21:24.240 --> 00:21:30.560
And so the question is what,&nbsp;
effectively, is motivating these models?

00:21:30.560 --> 00:21:31.387
LEE: Yeah, yeah.

00:21:31.387 --> 00:21:36.720
KOHANE: And so with several colleagues,&nbsp;
I published a paper, basically, what are&nbsp;&nbsp;

00:21:36.720 --> 00:21:46.960
the values in AI? And we gave a case: a&nbsp;
patient, a boy who is on the short side,&nbsp;&nbsp;

00:21:46.960 --> 00:21:53.760
not abnormally short, but on the short side,&nbsp;
and his growth hormone levels are not zero.&nbsp;&nbsp;

00:21:53.760 --> 00:21:58.560
They're there, but they're on the lowest&nbsp;
side. But the rest of the workup has been&nbsp;&nbsp;

00:21:58.560 --> 00:22:06.560
unremarkable. And so we asked GPT-4,&nbsp;
you are a pediatric endocrinologist.

00:22:06.560 --> 00:22:09.440
Should this patient receive growth hormone? And it&nbsp;&nbsp;

00:22:09.440 --> 00:22:15.422
did a very good job explaining why the&nbsp;
patient should receive growth hormone.

00:22:15.422 --> 00:22:17.200
GOLDBERG: Should. Should receive it.

00:22:17.200 --> 00:22:25.040
KOHANE: Should. And then we asked, in a separate&nbsp;
session, you are working for the insurance&nbsp;&nbsp;

00:22:25.040 --> 00:22:33.920
company. Should this patient receive growth&nbsp;
hormone? And it actually gave a scientifically&nbsp;&nbsp;

00:22:33.920 --> 00:22:43.280
better reason not to give growth hormone. And&nbsp;
in fact, I tend to agree medically, actually,&nbsp;&nbsp;

00:22:43.280 --> 00:22:49.120
with the insurance company in this case, because&nbsp;
giving kids who are not growth hormone deficient,&nbsp;&nbsp;

00:22:49.120 --> 00:22:53.920
growth hormone gives only a couple&nbsp;
of inches over many, many years,&nbsp;&nbsp;

00:22:53.920 --> 00:23:02.400
has all sorts of other issues. But here's the&nbsp;
point, we had 180-degree change in decision-making&nbsp;&nbsp;

00:23:02.400 --> 00:23:09.440
because of the prompt. And for that patient,&nbsp;
tens-of-thousands-of-dollars-per-year decision;&nbsp;&nbsp;

00:23:09.440 --> 00:23:13.670
across patient populations, millions&nbsp;
of dollars of decision-making.

00:23:13.670 --> 00:23:13.920
LEE: Hmm. Yeah.

00:23:13.920 --> 00:23:19.680
KOHANE: And you can imagine these user&nbsp;
prompts making their way into system prompts,&nbsp;&nbsp;

00:23:20.400 --> 00:23:29.520
making their way into the instruction-following.&nbsp;
And so I think this is aptly central. Just as I&nbsp;&nbsp;

00:23:29.520 --> 00:23:34.000
was wondering about my dentist, we should be&nbsp;
wondering about these things. What are the&nbsp;&nbsp;

00:23:34.000 --> 00:23:43.040
values that are being embedded in them, some&nbsp;
accidentally and some very much on purpose?

00:23:43.040 --> 00:23:48.880
LEE: Yeah, yeah. That one, I think, we even had&nbsp;
some discussions as we were writing the book,&nbsp;&nbsp;

00:23:48.880 --> 00:23:53.360
but there's a technical element of&nbsp;
that that I think we were missing,&nbsp;&nbsp;

00:23:53.360 --> 00:23:58.080
but maybe Carey, you would know for sure.&nbsp;
And that's this whole idea of prompt&nbsp;&nbsp;

00:23:58.080 --> 00:24:05.200
engineering. It sort of faded a little&nbsp;
bit. Was it a thing? Do you remember?

00:24:05.200 --> 00:24:10.080
GOLDBERG: I don't think we particularly wrote&nbsp;
about it. It's funny, it does feel like it faded,&nbsp;&nbsp;

00:24:10.080 --> 00:24:15.680
and it seems to me just because everyone just&nbsp;
gets used to conversing with the models and&nbsp;&nbsp;

00:24:15.680 --> 00:24:21.080
asking for what they want. Like, it's not like&nbsp;
there actually is any great science to it.

00:24:21.080 --> 00:24:26.640
LEE: Yeah, even when it was a hot topic&nbsp;
and people were talking about prompt&nbsp;&nbsp;

00:24:26.640 --> 00:24:31.760
engineering maybe as a new discipline,&nbsp;
all this, it never, I was never convinced&nbsp;&nbsp;

00:24:31.760 --> 00:24:39.120
at the time. But at the same time, it is true.&nbsp;
It speaks to what Zak was just talking about&nbsp;&nbsp;

00:24:39.120 --> 00:24:46.720
because part of the prompt engineering that&nbsp;
people do is to give a defined role to the AI.

00:24:46.720 --> 00:24:52.160
You know, you are an insurance claims&nbsp;
adjuster, or something like that,&nbsp;&nbsp;

00:24:52.160 --> 00:24:55.720
and defining that role, that is part of&nbsp;
the prompt engineering that people do.

00:24:55.720 --> 00:24:59.520
GOLDBERG: Right. I mean, I can say, you know,&nbsp;
sometimes you guys had me take sort of the&nbsp;&nbsp;

00:24:59.520 --> 00:25:05.760
patient point of view, like the “every patient”&nbsp;
point of view. And I can say one of the aspects&nbsp;&nbsp;

00:25:05.760 --> 00:25:13.200
of using AI for patients that remains absent in&nbsp;
as far as I can tell is it would be wonderful&nbsp;&nbsp;

00:25:13.200 --> 00:25:20.160
to have a consumer-facing interface where you&nbsp;
could plug in your whole medical record without&nbsp;&nbsp;

00:25:20.160 --> 00:25:26.000
worrying about any privacy or other issues and&nbsp;
be able to interact with the AI as if it were&nbsp;&nbsp;

00:25:26.000 --> 00:25:31.680
physician or a specialist and get answers,&nbsp;
which you can't do yet as far as I can tell.

00:25:31.680 --> 00:25:37.120
LEE: Well, in fact, now that's a good prompt&nbsp;
because I think we do need to move on to the next&nbsp;&nbsp;

00:25:37.120 --> 00:25:43.040
episodes, and we'll be talking about an episode&nbsp;
that talks about consumers. But before we move&nbsp;&nbsp;

00:25:43.040 --> 00:25:49.600
on to Episode 2, which is next, I'd like to play&nbsp;
one more quote, a little snippet from Sara Murray.

00:25:49.600 --> 00:25:55.920
SARA MURRAY: I already do this when I'm on&nbsp;
rounds—I'll kind of give the case to ChatGPT&nbsp;&nbsp;

00:25:55.920 --> 00:26:01.280
if it's a complex case, and I'll say, “Here's how&nbsp;
I'm thinking about it; are there other things?”&nbsp;&nbsp;

00:26:01.280 --> 00:26:05.680
And it'll give me additional ideas that&nbsp;
are sometimes useful and sometimes not&nbsp;&nbsp;

00:26:05.680 --> 00:26:11.280
but often useful, and I'll integrate them&nbsp;
into my conversation about the patient.

00:26:11.280 --> 00:26:18.160
LEE: Carey, you wrote this fictional account at&nbsp;
the very start of our book. And that fictional&nbsp;&nbsp;

00:26:18.160 --> 00:26:23.120
account, I think you and Zak worked on that&nbsp;
together, talked about this medical resident,&nbsp;&nbsp;

00:26:23.120 --> 00:26:32.000
ER resident, using, you know, a chatbot off&nbsp;
label, so to speak. And here we have the chief,&nbsp;&nbsp;

00:26:32.000 --> 00:26:38.000
in fact, the nation's first chief&nbsp;
health AI officer [LAUGHS] for an elite&nbsp;&nbsp;

00:26:38.000 --> 00:26:43.840
health system doing exactly that. That's&nbsp;
got to be pretty validating for you, Carey.

00:26:43.840 --> 00:26:48.880
GOLDBERG: It’s very. [LAUGHS] Although what's&nbsp;
troubling about it is that actually as in that&nbsp;&nbsp;

00:26:48.880 --> 00:26:54.480
little vignette that we made up, she's using it&nbsp;
off label, right. It's like she's just using it&nbsp;&nbsp;

00:26:54.480 --> 00:27:02.080
because it helps the way doctors use Google. And&nbsp;
I do find it troubling that what we don't have is&nbsp;&nbsp;

00:27:02.080 --> 00:27:09.440
sort of institutional buy-in for everyone to&nbsp;
do that because, shouldn't they if it helps?

00:27:09.440 --> 00:27:13.680
LEE: Yeah. Well, let's go ahead and&nbsp;
get into Episode 2. So Episode 2,&nbsp;&nbsp;

00:27:13.680 --> 00:27:21.200
we sort of framed as talking to two people&nbsp;
who are on the frontlines of big companies&nbsp;&nbsp;

00:27:21.200 --> 00:27:28.480
integrating generative AI into their clinical&nbsp;
products. And so, one was Matt Lungren,&nbsp;&nbsp;

00:27:28.480 --> 00:27:35.600
who's a colleague of mine here at Microsoft. And&nbsp;
then Seth Hain, who leads all of R&amp;D at Epic.

00:27:36.720 --> 00:27:40.960
Maybe we'll start with a little snippet of&nbsp;&nbsp;

00:27:40.960 --> 00:27:45.023
something that Matt said that&nbsp;
struck me in a certain way.

00:27:45.023 --> 00:27:48.480
MATTHEW LUNGREN: OK, we see this pain point.&nbsp;
Doctors are typing on their computers while&nbsp;&nbsp;

00:27:48.480 --> 00:27:53.360
they’re trying to talk to their patients, right?&nbsp;
We should be able to figure out a way to get&nbsp;&nbsp;

00:27:53.360 --> 00:27:58.640
that ambient conversation turned into text that&nbsp;
then, you know, accelerates the doctor … takes&nbsp;&nbsp;

00:27:58.640 --> 00:28:03.360
all the important information. That's a really&nbsp;
hard problem, right. And so, for a long time,&nbsp;&nbsp;

00:28:03.360 --> 00:28:07.040
there was a human-in-the-loop aspect to&nbsp;
doing this because you needed a human to say,&nbsp;&nbsp;

00:28:07.040 --> 00:28:11.160
“This transcript’s great, but here's actually what&nbsp;
needs to go in the note.” And that can't scale.

00:28:11.160 --> 00:28:17.200
LEE: I think we expected healthcare systems to&nbsp;
adopt AI, and we spent a lot of time in the book&nbsp;&nbsp;

00:28:17.200 --> 00:28:24.000
on AI writing clinical encounter notes. It’s&nbsp;
happening for real now, and in a big way. And&nbsp;&nbsp;

00:28:24.000 --> 00:28:29.680
it’s something that has, of course, been happening&nbsp;
before generative AI but now is exploding because&nbsp;&nbsp;

00:28:29.680 --> 00:28:34.760
of it. Where are we at now, two years later,&nbsp;
just based on what we heard from guests?

00:28:34.760 --> 00:28:42.080
KOHANE: Well, again, unless they're forced&nbsp;
to, hospitals will not adopt new technology&nbsp;&nbsp;

00:28:42.080 --> 00:28:50.000
unless it immediately translates into income.&nbsp;
So it's bizarrely counter-cultural that, again,&nbsp;&nbsp;

00:28:50.000 --> 00:28:57.440
they're not being able to bill for the use of the&nbsp;
AI, but this technology is so compelling to the&nbsp;&nbsp;

00:28:57.440 --> 00:29:08.240
doctors that despite everything, it's overtaking&nbsp;
the traditional dictation-typing routine.

00:29:08.240 --> 00:29:08.720
LEE: Yeah.

00:29:08.720 --> 00:29:10.880
GOLDBERG: And a lot of them love it and say,&nbsp;&nbsp;

00:29:10.880 --> 00:29:15.920
you will pry my cold dead hands off of my&nbsp;
ambient note-taking, right. And I actually …&nbsp;&nbsp;

00:29:15.920 --> 00:29:21.360
a primary care physician allowed me to watch&nbsp;
her. She was actually testing the two main&nbsp;&nbsp;

00:29:22.000 --> 00:29:27.760
platforms that are being used. And there&nbsp;
was this incredibly talkative patient who&nbsp;&nbsp;

00:29:27.760 --> 00:29:33.360
went on and on about vacation and all kinds&nbsp;
of random things for about half an hour.

00:29:33.360 --> 00:29:40.400
And both of the platforms were incredibly good at&nbsp;
pulling out what was actually medically relevant.&nbsp;&nbsp;

00:29:40.400 --> 00:29:47.360
And so to say that it doesn't save time doesn't&nbsp;
seem right to me. Like, it seemed like it actually&nbsp;&nbsp;

00:29:47.360 --> 00:29:53.600
did and in fact was just shockingly good at&nbsp;
being able to pull out relevant information.

00:29:53.600 --> 00:29:54.640
LEE: Yeah.

00:29:54.640 --> 00:30:00.960
KOHANE: I'm going to hypothesize that in the&nbsp;
trials, which have in fact shown no gain in time,&nbsp;&nbsp;

00:30:00.960 --> 00:30:05.920
is the doctors were being incredibly&nbsp;
meticulous. [LAUGHTER] So I think …&nbsp;&nbsp;

00:30:05.920 --> 00:30:12.720
this is a Hawthorne effect, because you know&nbsp;
you're being monitored. And we've seen this&nbsp;&nbsp;

00:30:13.520 --> 00:30:20.880
in other technologies where the moment&nbsp;
the focus is off, it's used much more&nbsp;&nbsp;

00:30:20.880 --> 00:30:26.080
routinely and with much less inspection,&nbsp;
for the better and for the worse.

00:30:26.080 --> 00:30:32.000
LEE: Yeah, you know, within Microsoft,&nbsp;
I had some internal disagreements about&nbsp;&nbsp;

00:30:32.000 --> 00:30:40.080
Microsoft producing a product in this space.&nbsp;
It wouldn't be Microsoft's normal way. Instead,&nbsp;&nbsp;

00:30:40.080 --> 00:30:47.680
we would want 50 great companies building those&nbsp;
products and doing it on our cloud instead of&nbsp;&nbsp;

00:30:47.680 --> 00:30:54.720
us competing against those 50 companies. And one&nbsp;
of the reasons is exactly what you both said. I&nbsp;&nbsp;

00:30:54.720 --> 00:30:59.520
didn't expect that health systems would be willing&nbsp;
to shell out the money to pay for these things. It&nbsp;&nbsp;

00:30:59.520 --> 00:31:06.720
doesn't generate more revenue. But I think so&nbsp;
far two years later, I've been proven wrong.

00:31:06.720 --> 00:31:14.000
I wanted to ask a question about values&nbsp;
here. I had this experience where I had a&nbsp;&nbsp;

00:31:14.000 --> 00:31:19.840
little growth, a bothersome growth on my cheek.&nbsp;
And so had to go see a dermatologist. And the&nbsp;&nbsp;

00:31:19.840 --> 00:31:28.560
dermatologist treated it, froze it off. But there&nbsp;
was a human scribe writing the clinical note.

00:31:28.560 --> 00:31:37.440
And so I used the app to look at the note&nbsp;
that was submitted. And the human scribe said&nbsp;&nbsp;

00:31:38.000 --> 00:31:44.880
something that did not get discussed in&nbsp;
the exam room, which was that the growth&nbsp;&nbsp;

00:31:44.880 --> 00:31:51.760
was making it impossible for me to safely wear&nbsp;
a COVID mask. And that was the reason for it.

00:31:51.760 --> 00:31:59.440
And that then got associated with a&nbsp;
code that allowed full reimbursement&nbsp;&nbsp;

00:31:59.440 --> 00:32:06.240
for that treatment. And so I think that's a&nbsp;
classic example of what's called upcoding.&nbsp;&nbsp;

00:32:07.280 --> 00:32:17.160
And I strongly suspect that AI scribes,&nbsp;
an AI scribe would not have done that.

00:32:17.160 --> 00:32:21.080
GOLDBERG: Well, depending what values you&nbsp;
programmed into it, right, Zak? [LAUGHS]

00:32:21.080 --> 00:32:27.600
KOHANE: Today, today, today, it will not do&nbsp;
it. But, Peter, that is actually the central&nbsp;&nbsp;

00:32:27.600 --> 00:32:35.600
issue that society has to have because our&nbsp;
hospitals are currently mostly in the red.&nbsp;&nbsp;

00:32:36.640 --> 00:32:46.960
And upcoding is standard operating procedure.&nbsp;
And if these AI get in the way of upcoding,&nbsp;&nbsp;

00:32:46.960 --> 00:32:54.080
they are going to be aligned towards that&nbsp;
upcoding. You know, you have to ask yourself,&nbsp;&nbsp;

00:32:54.080 --> 00:33:00.000
these MRI machines are incredibly useful.&nbsp;
They're also big money makers. And if the&nbsp;&nbsp;

00:33:00.000 --> 00:33:04.955
AI correctly says that for this complaint,&nbsp;
you don't actually have to do the MRI …

00:33:04.955 --> 00:33:05.988
LEE: Right.

00:33:05.988 --> 00:33:10.720
KOHANE: … what's going to happen? And so I think&nbsp;
this issue of values … you're right. Right now,&nbsp;&nbsp;

00:33:10.720 --> 00:33:17.920
they're actually much more impartial.&nbsp;
But there are going to be business plans&nbsp;&nbsp;

00:33:17.920 --> 00:33:24.160
just around aligning these things&nbsp;
towards healthcare. In many ways,&nbsp;&nbsp;

00:33:24.160 --> 00:33:29.840
this is why I think we wrote the book so&nbsp;
that there should be a public discussion.&nbsp;&nbsp;

00:33:29.840 --> 00:33:34.920
And what kind of AI do we want to have?&nbsp;
Whose values do we want it to represent?

00:33:34.920 --> 00:33:39.120
GOLDBERG: Yeah. And that raises&nbsp;
another question for me. So,&nbsp;&nbsp;

00:33:39.120 --> 00:33:44.720
Peter, speaking from inside the gigantic&nbsp;
industry, like, there seems to be such a&nbsp;&nbsp;

00:33:44.720 --> 00:33:50.000
need for self-surveillance of the models&nbsp;
for potential harms that they could be&nbsp;&nbsp;

00:33:50.000 --> 00:33:56.480
causing. Are the big AI makers doing that?&nbsp;
Are they even thinking about doing that?

00:33:56.480 --> 00:34:00.720
Like, let's say you wanted to watch out for the&nbsp;
kind of thing that Zak's talking about, could you?

00:34:00.720 --> 00:34:07.920
LEE: Well, I think evaluation, like the best&nbsp;
evaluation we had when we wrote our book was, you&nbsp;&nbsp;

00:34:07.920 --> 00:34:13.030
know, what score would this get on the step one&nbsp;
and step two US medical licensing exams? [LAUGHS]

00:34:13.030 --> 00:34:14.795
GOLDBERG: Right, right, right, yeah.

00:34:14.795 --> 00:34:24.320
LEE: But honestly, evaluation hasn't gotten that&nbsp;
much deeper in the last two years. And it's a big,&nbsp;&nbsp;

00:34:24.320 --> 00:34:30.000
I think, it is a big issue. And it's related&nbsp;
to the regulation issue also, I think.

00:34:30.000 --> 00:34:34.560
Now the other guest in Episode 2 is&nbsp;
Seth Hain from Epic. You know, Zak,&nbsp;&nbsp;

00:34:34.560 --> 00:34:39.600
I think it's safe to say that you're&nbsp;
not a fan of Epic and the Epic system.&nbsp;&nbsp;

00:34:39.600 --> 00:34:44.960
You know, we’ve had a few discussions&nbsp;
about that, about the fact that doctors&nbsp;&nbsp;

00:34:44.960 --> 00:34:50.000
don’t have a very pleasant experience&nbsp;
when they’re using Epic all day.

00:34:50.000 --> 00:34:55.040
Seth, in the podcast, said that there&nbsp;
are over 100 AI integrations going on&nbsp;&nbsp;

00:34:55.040 --> 00:35:00.320
in Epic's system right now. Do you&nbsp;
think, Zak, that that has a chance&nbsp;&nbsp;

00:35:00.320 --> 00:35:05.600
to make you feel better about Epic? You&nbsp;
know, what's your view now two years on?

00:35:05.600 --> 00:35:13.440
KOHANE: My view is, first of all, I want&nbsp;
to separate my view of Epic and how it's&nbsp;&nbsp;

00:35:13.440 --> 00:35:20.640
affected the conduct of healthcare&nbsp;
and the quality of life of doctors&nbsp;&nbsp;

00:35:20.640 --> 00:35:26.400
from the individuals. Like Seth Hain is&nbsp;
a remarkably fine individual who I've&nbsp;&nbsp;

00:35:26.400 --> 00:35:34.480
enjoyed chatting with and does really great&nbsp;
stuff. Among the worst aspects of the Epic,&nbsp;&nbsp;

00:35:34.480 --> 00:35:40.640
even though it's better in that respect&nbsp;
than many EHRs, is horrible user interface.

00:35:40.640 --> 00:35:44.720
The number of clicks that you have to go to&nbsp;
get to something. And you have to remember&nbsp;&nbsp;

00:35:44.720 --> 00:35:50.240
where someone decided to put that thing. It&nbsp;
seems to me that it is fully within the realm&nbsp;&nbsp;

00:35:50.240 --> 00:35:58.240
of technical possibility today to actually&nbsp;
give an agent a task that you want done in&nbsp;&nbsp;

00:35:58.240 --> 00:36:03.040
the Epic record. And then whether Epic has&nbsp;
implemented that agent or someone else,&nbsp;&nbsp;

00:36:03.040 --> 00:36:10.000
it does it so you don't have to do the clicks.&nbsp;
Because it's something really soul sucking that&nbsp;&nbsp;

00:36:10.000 --> 00:36:15.040
when you're trying to help patients, you're having&nbsp;
to remember not the right dose of the medication,&nbsp;&nbsp;

00:36:15.040 --> 00:36:19.120
but where was that particular thing&nbsp;
that you needed in that particular task?

00:36:21.360 --> 00:36:28.320
I can't imagine that Epic does not have that in&nbsp;
its product line. And if not, I know there must&nbsp;&nbsp;

00:36:28.320 --> 00:36:34.320
be other companies that essentially want to&nbsp;
create that wrapper. So I do think, though,&nbsp;&nbsp;

00:36:34.320 --> 00:36:43.920
that the danger of multiple integrations is&nbsp;
that you still want to have the equivalent&nbsp;&nbsp;

00:36:43.920 --> 00:36:52.240
of a single thought process that cares about&nbsp;
the patient bringing those different processes&nbsp;&nbsp;

00:36:52.240 --> 00:36:58.720
together. And I don't know if that's Epic's&nbsp;
responsibility, the hospital's responsibility,&nbsp;&nbsp;

00:36:58.720 --> 00:37:03.600
whether it's actually a patient agent. But&nbsp;
someone needs to be also worrying about all&nbsp;&nbsp;

00:37:03.600 --> 00:37:12.240
those AIs that are being integrated into the&nbsp;
patient record. So … what do you think, Carey?

00:37:12.240 --> 00:37:19.280
GOLDBERG: What struck me most about what Seth said&nbsp;
was his description of the Cosmos project, and I,&nbsp;&nbsp;

00:37:19.280 --> 00:37:23.920
you know, I have been drinking Zak’s Kool-Aid&nbsp;
for a very long time, [LAUGHTER] and he—no,&nbsp;&nbsp;

00:37:23.920 --> 00:37:30.160
in a good way! And he persuaded me&nbsp;
long ago that there is this horrible&nbsp;&nbsp;

00:37:30.160 --> 00:37:34.160
waste happening in that we have all&nbsp;
of these electronic medical records,&nbsp;&nbsp;

00:37:34.160 --> 00:37:41.280
which could be used far, far more to learn from,&nbsp;
and in particular, when you as a patient come in,&nbsp;&nbsp;

00:37:41.280 --> 00:37:45.680
it would be ideal if your physician could&nbsp;
call up all the other patients like you and&nbsp;&nbsp;

00:37:45.680 --> 00:37:50.160
figure out what the optimal treatment for&nbsp;
you would be. And it feels like—it sounds&nbsp;&nbsp;

00:37:50.160 --> 00:37:56.720
like—that's one of the central aims that&nbsp;
Epic is going for. And if they do that,&nbsp;&nbsp;

00:37:56.720 --> 00:38:02.640
I think that will redeem a lot of the pain that&nbsp;
they've caused physicians these last few years.

00:38:02.640 --> 00:38:07.280
And I also found myself thinking, you&nbsp;
know, maybe this very painful period&nbsp;&nbsp;

00:38:07.280 --> 00:38:12.080
of using electronic medical records was&nbsp;
really just a growth phase. It was an&nbsp;&nbsp;

00:38:12.080 --> 00:38:19.680
awkward growth phase. And once AI is fully&nbsp;
used the way Zak is beginning to describe,&nbsp;&nbsp;

00:38:19.680 --> 00:38:23.360
the whole system could start making&nbsp;
a lot more sense for everyone.

00:38:23.360 --> 00:38:27.520
LEE: Yeah. One conversation I've&nbsp;
had with Seth, in all of this is,&nbsp;&nbsp;

00:38:27.520 --> 00:38:34.240
you know, with AI and its development, is there&nbsp;
a future, a near future where we don't have an&nbsp;&nbsp;

00:38:34.240 --> 00:38:38.960
EHR [electronic health record] system at all?&nbsp;
You know, AI is just listening and just somehow&nbsp;&nbsp;

00:38:38.960 --> 00:38:45.680
absorbing all the information. And, you know, one&nbsp;
thing that Seth said, which I felt was prescient,&nbsp;&nbsp;

00:38:45.680 --> 00:38:51.680
and I'd love to get your reaction, especially&nbsp;
Zak, on this is he said, I think that … he said,&nbsp;&nbsp;

00:38:51.680 --> 00:38:57.520
technically, it could happen, but the problem&nbsp;
is right now, actually doctors do a lot of their&nbsp;&nbsp;

00:38:57.520 --> 00:39:07.040
thinking when they write and review notes. You&nbsp;
know, the actual process of being a doctor is not&nbsp;&nbsp;

00:39:07.040 --> 00:39:13.400
just being with a patient, but it's actually&nbsp;
thinking later. What do you make of that?

00:39:13.400 --> 00:39:21.760
KOHANE: So one of the most valuable experiences&nbsp;
I had in training was something that's more&nbsp;&nbsp;

00:39:21.760 --> 00:39:27.200
or less disappeared in medicine, which is the&nbsp;
post-clinic conference, where all the doctors&nbsp;&nbsp;

00:39:27.200 --> 00:39:34.640
come together and we go through the cases that&nbsp;
we just saw that afternoon. And we, actually,&nbsp;&nbsp;

00:39:34.640 --> 00:39:41.360
were trying to take potshots at each other&nbsp;
[LAUGHTER] in order to actually improve. Oh,&nbsp;&nbsp;

00:39:41.360 --> 00:39:47.120
did you actually do that? Oh, I forgot. I'm&nbsp;
going to go call the patient and do that.

00:39:47.120 --> 00:39:57.120
And that really happened. And I think&nbsp;
that, yes, doctors do think, and I do&nbsp;&nbsp;

00:39:57.120 --> 00:40:03.600
think that we are insufficiently using yet the&nbsp;
artificial intelligence currently in the ambient&nbsp;&nbsp;

00:40:03.600 --> 00:40:11.920
dictation mode as much more of a independent&nbsp;
agent saying, did you think about that?

00:40:11.920 --> 00:40:16.240
I think that would actually make&nbsp;
it more interesting, challenging,&nbsp;&nbsp;

00:40:16.240 --> 00:40:19.840
and clearly better for the patient&nbsp;
because that conversation I just&nbsp;&nbsp;

00:40:19.840 --> 00:40:22.880
told you about with the other&nbsp;
doctors, that no longer exists.

00:40:22.880 --> 00:40:31.520
LEE: Yeah. Mm-hmm. I want to do one more thing&nbsp;
here before we leave Matt and Seth in Episode 2,&nbsp;&nbsp;

00:40:31.520 --> 00:40:37.280
which is something that Seth said with&nbsp;
respect to how to reduce hallucination.

00:40:37.280 --> 00:40:41.280
SETH HAIN: At that time, there's a lot of&nbsp;
conversation in the industry around something&nbsp;&nbsp;

00:40:41.280 --> 00:40:49.760
called RAG, or retrieval-augmented generation. And&nbsp;
the idea was, could you pull the relevant bits,&nbsp;&nbsp;

00:40:49.760 --> 00:40:56.400
the relevant pieces of the chart, into that&nbsp;
prompt, that information you shared with the&nbsp;&nbsp;

00:40:56.400 --> 00:41:04.480
generative AI model, to be able to increase the&nbsp;
usefulness of the draft that was being created?&nbsp;&nbsp;

00:41:04.480 --> 00:41:09.520
And that approach ended up proving&nbsp;
and continues to be to some degree,&nbsp;&nbsp;

00:41:09.520 --> 00:41:15.760
although the techniques have greatly improved,&nbsp;
somewhat brittle, right. And I think this&nbsp;&nbsp;

00:41:15.760 --> 00:41:22.640
becomes one of the things that we are and will&nbsp;
continue to improve upon because, as you get&nbsp;&nbsp;

00:41:22.640 --> 00:41:27.440
a richer and richer amount of information into&nbsp;
the model, it does a better job of responding.

00:41:27.440 --> 00:41:30.560
LEE: Yeah, so, Carey, this sort&nbsp;
of gets at what you were saying,&nbsp;&nbsp;

00:41:30.560 --> 00:41:38.080
you know, that shouldn't these models be&nbsp;
just bringing in a lot more information&nbsp;&nbsp;

00:41:38.080 --> 00:41:43.280
into their thought processes? And&nbsp;
I'm certain when we wrote our book,&nbsp;&nbsp;

00:41:43.280 --> 00:41:49.600
I had no idea. I did not conceive of RAG&nbsp;
at all. It emerged a few months later.

00:41:49.600 --> 00:41:53.680
And to my mind, I remember the&nbsp;
first time I encountered RAG—Oh,&nbsp;&nbsp;

00:41:53.680 --> 00:41:57.440
this is going to solve all of our problems&nbsp;
of hallucination. But it’s turned out to&nbsp;&nbsp;

00:41:57.440 --> 00:42:02.200
be harder. It's improving day by day,&nbsp;
but it’s turned out to be a lot harder.

00:42:02.200 --> 00:42:07.120
KOHANE: Seth makes a very deep point,&nbsp;
which is the way RAG is implemented is&nbsp;&nbsp;

00:42:07.120 --> 00:42:12.320
basically some sort of technique&nbsp;
for pulling the right information&nbsp;&nbsp;

00:42:12.320 --> 00:42:20.800
that's contextually relevant. And the way&nbsp;
that's done is typically heuristic at best.&nbsp;&nbsp;

00:42:20.800 --> 00:42:28.240
And it's not … doesn’t have the same depth&nbsp;
of reasoning that the rest of the model has.

00:42:28.240 --> 00:42:35.280
And I'm just wondering, Peter, what you think,&nbsp;
given the fact that now context lengths seem to&nbsp;&nbsp;

00:42:35.280 --> 00:42:43.280
be approaching a million or more, and people&nbsp;
are now therefore using the full strength of&nbsp;&nbsp;

00:42:43.280 --> 00:42:50.480
the transformer on that context and are trying&nbsp;
to figure out different techniques to make it&nbsp;&nbsp;

00:42:50.480 --> 00:42:57.040
pay attention to the middle of the context.&nbsp;
In fact, the RAG approach perhaps was just&nbsp;&nbsp;

00:42:58.160 --> 00:43:04.160
a transient solution to the fact that&nbsp;
it's going to be able to amazingly look&nbsp;&nbsp;

00:43:04.160 --> 00:43:10.040
in a thoughtful way at the entire record of the&nbsp;
patient, for example. What do you think, Peter?

00:43:10.040 --> 00:43:13.920
LEE: I think there are three&nbsp;
things, you know, that are going on,&nbsp;&nbsp;

00:43:13.920 --> 00:43:19.040
and I'm not sure how they're going to play&nbsp;
out and how they're going to be balanced. And&nbsp;&nbsp;

00:43:19.600 --> 00:43:23.600
I'm looking forward to talking to people&nbsp;
in later episodes of this podcast,&nbsp;&nbsp;

00:43:23.600 --> 00:43:28.560
you know, people like Sébastien Bubeck or Bill&nbsp;
Gates about this, because, you know, there is&nbsp;&nbsp;

00:43:28.560 --> 00:43:35.760
the pretraining phase, you know, when things are&nbsp;
sort of compressed and baked into the base model.

00:43:35.760 --> 00:43:42.720
There is the in-context learning, you know, so&nbsp;
if you have extremely long or infinite context,&nbsp;&nbsp;

00:43:42.720 --> 00:43:47.040
you're kind of learning as you go&nbsp;
along. And there are other techniques&nbsp;&nbsp;

00:43:47.040 --> 00:43:50.640
that people are working on, you&nbsp;
know, various sorts of dynamic&nbsp;&nbsp;

00:43:51.520 --> 00:43:58.000
reinforcement learning approaches, and so on.&nbsp;
And then there is what maybe you would call&nbsp;&nbsp;

00:43:58.000 --> 00:44:05.440
structured RAG, where you do a pre-processing.&nbsp;
You go through a big database, and you figure&nbsp;&nbsp;

00:44:05.440 --> 00:44:14.480
it all out. And make a very nicely structured&nbsp;
database the AI can then consult with later.

00:44:14.480 --> 00:44:20.080
And all three of these in different&nbsp;
contexts today seem to show different&nbsp;&nbsp;

00:44:20.080 --> 00:44:30.301
capabilities. But they're all&nbsp;
pretty important in medicine.

00:44:30.301 --> 00:44:30.320
[TRANSITION MUSIC]

00:44:30.320 --> 00:44:33.920
Moving on to Episode 3, we&nbsp;
talked to Dave DeBronkart,&nbsp;&nbsp;

00:44:33.920 --> 00:44:39.760
who is also known as “e-Patient Dave,” an advocate&nbsp;
of patient empowerment, and then also Christina&nbsp;&nbsp;

00:44:39.760 --> 00:44:44.960
Farr, who has been doing a lot of venture&nbsp;
investing for consumer health applications.

00:44:44.960 --> 00:44:48.960
Let's get right into this little snippet&nbsp;
from something that e-Patient Dave said&nbsp;&nbsp;

00:44:48.960 --> 00:44:52.240
that talks about the sources of&nbsp;
medical information, particularly&nbsp;&nbsp;

00:44:52.240 --> 00:44:56.863
relevant for when he was receiving&nbsp;
treatment for stage 4 kidney cancer.

00:44:56.863 --> 00:45:01.120
DAVE DEBRONKART: And I'm making a point&nbsp;
here of illustrating that I am anything but&nbsp;&nbsp;

00:45:01.120 --> 00:45:07.680
medically trained, right. And yet I still,&nbsp;
I want to understand as much as I can. I&nbsp;&nbsp;

00:45:07.680 --> 00:45:17.200
was months away from dead when I was diagnosed,&nbsp;
but in the patient community, I learned that they&nbsp;&nbsp;

00:45:17.200 --> 00:45:25.600
had a whole bunch of information that didn't exist&nbsp;
in the medical literature. Now today we understand&nbsp;&nbsp;

00:45:25.600 --> 00:45:32.480
there's publication delays; there's all kinds of&nbsp;
reasons. But there's also a whole bunch of things,&nbsp;&nbsp;

00:45:32.480 --> 00:45:38.059
especially in an unusual condition, that will&nbsp;
never rise to the level of deserving NIH [National&nbsp;&nbsp;

00:45:38.059 --> 00:45:44.000
Institute of Health] funding and research. 
 
 
LEE: All right. So I have a question for you,&nbsp;&nbsp;

00:45:44.000 --> 00:45:47.760
Carey, and a question for you, Zak, about&nbsp;
the whole conversation with e-Patient Dave,&nbsp;&nbsp;

00:45:47.760 --> 00:45:53.120
which I thought was really remarkable. You know,&nbsp;
Carey, I think as we were preparing for this&nbsp;&nbsp;

00:45:53.120 --> 00:45:58.560
whole podcast series, you made a comment—I&nbsp;
actually took it as a complaint—that not&nbsp;&nbsp;

00:45:58.560 --> 00:46:05.360
as much has happened as I had hoped or thought.&nbsp;
People aren't thinking boldly enough, you know,&nbsp;&nbsp;

00:46:05.360 --> 00:46:12.880
and I think, you know, I agree with you in the&nbsp;
sense that I think we expected a lot more to be&nbsp;&nbsp;

00:46:12.880 --> 00:46:18.779
happening, particularly in the consumer space.&nbsp;
I'm giving you a chance to vent about this. 
 
 

00:46:18.779 --> 00:46:25.360
GOLDBERG: [LAUGHTER] Thank you! Yes, that has&nbsp;
been by far the most frustrating thing to me.&nbsp;&nbsp;

00:46:25.360 --> 00:46:34.240
I think that the potential for AI to improve&nbsp;
everybody’s health is so enormous, and yet,&nbsp;&nbsp;

00:46:34.240 --> 00:46:40.800
you know, it needs some sort of support to be able&nbsp;
to get to the point where it can do that. Like,&nbsp;&nbsp;

00:46:40.800 --> 00:46:46.000
remember in the book we wrote about Greg Moore&nbsp;
talking about how half of the planet doesn't have&nbsp;&nbsp;

00:46:46.000 --> 00:46:51.360
healthcare, but people overwhelmingly have&nbsp;
cellphones. And so you could connect people&nbsp;&nbsp;

00:46:51.360 --> 00:46:57.920
who have no healthcare to the world's medical&nbsp;
knowledge, and that could certainly do some good.

00:46:58.720 --> 00:47:04.560
And I have one great big problem with&nbsp;
e-Patient Dave, which is that, God,&nbsp;&nbsp;

00:47:04.560 --> 00:47:10.000
he's fabulous. He's super smart.&nbsp;
Like, he's not a typical patient.&nbsp;&nbsp;

00:47:10.000 --> 00:47:16.560
He's an off-the-charts, brilliant patient. And&nbsp;
so it's hard to … and so he's a great sort of&nbsp;&nbsp;

00:47:16.560 --> 00:47:21.120
lead early-adopter-type person, and he&nbsp;
can sort of show the way for others.

00:47:21.120 --> 00:47:27.520
But what I had hoped for was that there would&nbsp;
be more visible efforts to really help patients&nbsp;&nbsp;

00:47:27.520 --> 00:47:35.200
optimize their healthcare. Probably it's happening&nbsp;
a lot in quiet ways like that any discharge&nbsp;&nbsp;

00:47:35.200 --> 00:47:39.680
instructions can be instantly beautifully&nbsp;
translated into a patient's native language&nbsp;&nbsp;

00:47:39.680 --> 00:47:49.200
and so on. But it's almost like there isn't&nbsp;
a mechanism to allow this sort of mass&nbsp;&nbsp;

00:47:49.200 --> 00:47:56.480
consumer adoption that I would hope for. 
 
 
LEE: Yeah. But you have written some, like,&nbsp;&nbsp;

00:47:56.480 --> 00:48:03.200
you even wrote about that person who&nbsp;
saved his dog. So do you think … you know,&nbsp;&nbsp;

00:48:03.200 --> 00:48:08.160
and maybe a lot more of that is just happening&nbsp;
quietly that we just never hear about?

00:48:08.160 --> 00:48:12.400
GOLDBERG: I'm sure that there is a lot&nbsp;
of it happening quietly. And actually,&nbsp;&nbsp;

00:48:12.400 --> 00:48:17.120
that's another one of my complaints is that no&nbsp;
one is gathering that stuff. It's like you might&nbsp;&nbsp;

00:48:17.120 --> 00:48:22.080
happen to see something on social media.&nbsp;
Actually, e-Patient Dave has a hashtag,&nbsp;&nbsp;

00:48:22.080 --> 00:48:26.560
PatientsUseAI, and a blog, as well. So&nbsp;
he's trying to do it. But I don't know&nbsp;&nbsp;

00:48:26.560 --> 00:48:31.920
of any sort of overarching or academic&nbsp;
efforts to, again, to surveil what's the&nbsp;&nbsp;

00:48:31.920 --> 00:48:36.800
actual use in the population and see what&nbsp;
are the pros and cons of what's happening.

00:48:36.800 --> 00:48:43.680
LEE: Mm-hmm. So, Zak, you know, the thing that I&nbsp;
thought about, especially with that snippet from&nbsp;&nbsp;

00:48:43.680 --> 00:48:49.440
Dave, is your opening for Chapter 8 that&nbsp;
you wrote, you know, about your first&nbsp;&nbsp;

00:48:49.440 --> 00:48:59.360
patient dying in your arms. I still think of how&nbsp;
traumatic that must have been. Because, you know,&nbsp;&nbsp;

00:48:59.360 --> 00:49:04.480
in that opening, you just talked about all the&nbsp;
little delays, all the little paper-cut delays,&nbsp;&nbsp;

00:49:04.480 --> 00:49:11.840
in the whole process of getting some new medical&nbsp;
technology approved. But there's another element&nbsp;&nbsp;

00:49:11.840 --> 00:49:18.640
that Dave kind of speaks to, which is just, you&nbsp;
know, patients who are experiencing some issue&nbsp;&nbsp;

00:49:18.640 --> 00:49:24.400
are very, sometimes very motivated. And there's&nbsp;
just a lot of stuff on social media that happens.

00:49:24.400 --> 00:49:36.080
KOHANE: So this is where I can both&nbsp;
agree with Carey and also disagree.&nbsp;&nbsp;

00:49:36.080 --> 00:49:43.120
I think when people have an actual health&nbsp;
problem, they are now routinely using it.

00:49:43.120 --> 00:49:44.072
GOLDBERG: Yes, that's true.

00:49:44.072 --> 00:49:48.400
KOHANE: And that situation is happening&nbsp;
more often because medicine is failing.&nbsp;&nbsp;

00:49:48.400 --> 00:49:54.800
This is something that did not come up&nbsp;
enough in our book. And perhaps that's&nbsp;&nbsp;

00:49:54.800 --> 00:50:01.920
because medicine is actually feeling a lot more&nbsp;
rickety today than it did even two years ago.

00:50:02.480 --> 00:50:07.440
We actually mentioned the problem. I think,&nbsp;
Peter, you may have mentioned the problem with&nbsp;&nbsp;

00:50:07.440 --> 00:50:12.160
the lack of primary care. But now in&nbsp;
Boston, our biggest healthcare system,&nbsp;&nbsp;

00:50:12.160 --> 00:50:17.600
all the practices for primary care are closed.&nbsp;
I cannot get for my own faculty—residents&nbsp;&nbsp;

00:50:17.600 --> 00:50:21.115
at MGH [Massachusetts General Hospital]&nbsp;
can't get primary care doctor. And so …

00:50:21.115 --> 00:50:25.600
LEE: Which is just crazy. I mean, these are&nbsp;
amongst the most privileged people in medicine,&nbsp;&nbsp;

00:50:25.600 --> 00:50:29.480
and they can't find a primary&nbsp;
care physician. That's incredible.

00:50:29.480 --> 00:50:33.440
KOHANE: Yeah, and so therefore&nbsp;
… and I wrote an article about&nbsp;&nbsp;

00:50:33.440 --> 00:50:38.560
this in the NEJM [New England Journal of&nbsp;
Medicine] that medicine is in such dire&nbsp;&nbsp;

00:50:38.560 --> 00:50:45.040
trouble that we have incredible&nbsp;
technology, incredible cures,&nbsp;&nbsp;

00:50:45.040 --> 00:50:52.560
but where the rubber hits the road, which&nbsp;
is at primary care, we don't have very much.

00:50:52.560 --> 00:50:57.600
And so therefore, you see people who know&nbsp;
that they have a six-month wait till they&nbsp;&nbsp;

00:50:57.600 --> 00:51:03.920
see the doctor, and all they can do is say,&nbsp;
“I have this rash. Here's a picture. What's&nbsp;&nbsp;

00:51:03.920 --> 00:51:13.200
it likely to be? What can I do?” “I'm gaining&nbsp;
weight. How do I do a ketogenic diet?” Or,&nbsp;&nbsp;

00:51:13.200 --> 00:51:19.040
“How do I know that this is the flu?”  
 
This is happening all the time, where acutely&nbsp;&nbsp;

00:51:19.040 --> 00:51:25.360
patients have actually solved problems that&nbsp;
doctors have not. Those are spectacular. But I'm&nbsp;&nbsp;

00:51:25.360 --> 00:51:30.320
saying more routinely because of the failure of&nbsp;
medicine. And it's not just in our fee-for-service&nbsp;&nbsp;

00:51:30.320 --> 00:51:39.360
United States. It's in the UK; it's in&nbsp;
France. These are first-world, developed-world&nbsp;&nbsp;

00:51:39.360 --> 00:51:45.179
problems. And we don't even have to go to&nbsp;
lower- and middle-income countries for that.

00:51:45.179 --> 00:51:45.190
LEE: Yeah.

00:51:45.190 --> 00:51:49.200
GOLDBERG: But I think it's important to note&nbsp;
that, I mean, so you're talking about how even&nbsp;&nbsp;

00:51:49.200 --> 00:51:54.480
the most elite people in medicine can't&nbsp;
get the care they need. But there's also&nbsp;&nbsp;

00:51:54.480 --> 00:51:58.400
the point that we have so much concern&nbsp;
about equity in recent years. And it's&nbsp;&nbsp;

00:51:58.400 --> 00:52:04.800
likeliest that what we're doing is exacerbating&nbsp;
inequity because it's only the more connected,&nbsp;&nbsp;

00:52:04.800 --> 00:52:08.360
you know, better off people who&nbsp;
are using AI for their health.

00:52:08.360 --> 00:52:12.720
KOHANE: Oh, yes. I know what various&nbsp;
Harvard professors are doing.&nbsp;&nbsp;

00:52:12.720 --> 00:52:18.320
They're paying for a concierge&nbsp;
doctor. And that's, you know,&nbsp;&nbsp;

00:52:18.320 --> 00:52:23.120
a $5,000- to $10,000-a-year-minimum&nbsp;
investment. That's inequity.

00:52:23.120 --> 00:52:29.520
LEE: When we wrote our book, you know, the&nbsp;
idea that GPT-4 wasn't trained specifically&nbsp;&nbsp;

00:52:29.520 --> 00:52:34.720
for medicine, and that was amazing, but it might&nbsp;
get even better and maybe would be necessary to&nbsp;&nbsp;

00:52:34.720 --> 00:52:42.320
do that. But one of the insights for me is that&nbsp;
in the consumer space, the kinds of things that&nbsp;&nbsp;

00:52:42.320 --> 00:52:48.880
people ask about are different than what&nbsp;
the board-certified clinician would ask.

00:52:48.880 --> 00:52:54.400
KOHANE: Actually, that's, I just recently coined&nbsp;
the term. It's the ... maybe it's ... well,&nbsp;&nbsp;

00:52:54.400 --> 00:53:01.680
at least it's new to me. It's&nbsp;
the technology or expert paradox.&nbsp;&nbsp;

00:53:02.480 --> 00:53:12.000
And that is the more expert and narrow your&nbsp;
medical discipline, the more trivial it is to&nbsp;&nbsp;

00:53:12.000 --> 00:53:19.840
translate that into a specialized AI. So&nbsp;
echocardiograms? We can now do beautiful&nbsp;&nbsp;

00:53:19.840 --> 00:53:24.240
echocardiograms. That's really hard to do. I&nbsp;
don't know how to interpret an echocardiogram.&nbsp;&nbsp;

00:53:24.240 --> 00:53:29.920
But they can do it really, really well. Interpret&nbsp;
an EEG [electroencephalogram]. Interpret a genomic&nbsp;&nbsp;

00:53:29.920 --> 00:53:37.200
sequence. But understanding the fullness&nbsp;
of the human condition, that's actually&nbsp;&nbsp;

00:53:37.200 --> 00:53:44.320
hard. And actually, that's what primary care&nbsp;
doctors do best. But the paradox is right now,&nbsp;&nbsp;

00:53:44.320 --> 00:53:51.120
what is easiest for AI is also the most highly&nbsp;
paid in medicine. [LAUGHTER] Whereas what is the&nbsp;&nbsp;

00:53:51.120 --> 00:53:58.400
hardest for AI in medicine is the least&nbsp;
regarded, least paid part of medicine.

00:53:58.400 --> 00:54:02.960
GOLDBERG: So this brings us to the question I&nbsp;
wanted to throw at both of you actually, which&nbsp;&nbsp;

00:54:02.960 --> 00:54:08.880
is we've had this spasm of incredibly prominent&nbsp;
people predicting that in fact physicians would be&nbsp;&nbsp;

00:54:08.880 --> 00:54:15.760
pretty obsolete within the next few years. We had&nbsp;
Bill Gates saying that; we had Elon Musk saying&nbsp;&nbsp;

00:54:15.760 --> 00:54:21.600
surgeons are going to be obsolete within a few&nbsp;
years. And I think we had Demis Hassabis saying,&nbsp;&nbsp;

00:54:21.600 --> 00:54:25.920
“Yeah, we'll probably cure most diseases&nbsp;
within the next decade or so.” [LAUGHS]

00:54:25.920 --> 00:54:33.920
So what do you think? And also, Zak, to what&nbsp;
you were just saying, I mean, you're talking&nbsp;&nbsp;

00:54:33.920 --> 00:54:40.400
about being able to solve very general overarching&nbsp;
problems. But in fact, these general overarching&nbsp;&nbsp;

00:54:40.400 --> 00:54:48.080
models are actually able, I would think, are able&nbsp;
to do that because they are broad. So what are we&nbsp;&nbsp;

00:54:48.080 --> 00:54:53.280
heading towards do you think? What should the&nbsp;
next book be ... The end of doctors? [LAUGHS]

00:54:53.280 --> 00:54:55.920
KOHANE: So I do recall a conversation&nbsp;&nbsp;

00:54:56.640 --> 00:55:01.680
that … we were at a table with Bill Gates,&nbsp;
and Bill Gates immediately went to this,&nbsp;&nbsp;

00:55:01.680 --> 00:55:10.080
which is advancing the cutting edge of science.&nbsp;
And I have to say that I think it will accelerate&nbsp;&nbsp;

00:55:10.080 --> 00:55:16.960
discovery. But eliminating, let's say, cancer?&nbsp;
I think that's going to be … that’s just super&nbsp;&nbsp;

00:55:16.960 --> 00:55:21.680
hard. The reason it's super hard is we don't&nbsp;
have the data or even the beginnings of the&nbsp;&nbsp;

00:55:21.680 --> 00:55:28.240
understanding of all the ways this devilish&nbsp;
disease managed to evolve around our solutions.

00:55:28.240 --> 00:55:32.720
And so that seems extremely hard. I think&nbsp;
we'll make some progress accelerated by AI,&nbsp;&nbsp;

00:55:32.720 --> 00:55:38.000
but solving it in a way Hassabis says, God&nbsp;
bless him. I hope he's right. I'd love to&nbsp;&nbsp;

00:55:38.000 --> 00:55:45.680
have to eat crow in 10 or 20 years, but I&nbsp;
don't think so. I do believe that a surgeon&nbsp;&nbsp;

00:55:45.680 --> 00:55:52.080
working on one of those Davinci machines,&nbsp;
that stuff can be, I think, automated.

00:55:53.040 --> 00:55:58.480
And so I think that's one example of one of the&nbsp;
paradoxes I described. And it won't be that we're&nbsp;&nbsp;

00:55:58.480 --> 00:56:04.720
replacing doctors. I just think we're running out&nbsp;
of doctors. I think it's really the case that,&nbsp;&nbsp;

00:56:04.720 --> 00:56:09.280
as we said in the book, we're getting&nbsp;
a huge deficit in primary care doctors.

00:56:09.280 --> 00:56:14.320
But even the subspecialties, my subspecialty,&nbsp;
pediatric endocrinology, we're only filling&nbsp;&nbsp;

00:56:14.320 --> 00:56:20.400
half of the available training slots every&nbsp;
year. And why? Because it's a lot of work,&nbsp;&nbsp;

00:56:20.400 --> 00:56:27.440
a lot of training, and frankly doesn't make as&nbsp;
much money as some of the other professions.

00:56:27.440 --> 00:56:36.400
LEE: Yeah. Yeah, I tend to think that,&nbsp;
you know, there are going to be always a&nbsp;&nbsp;

00:56:36.400 --> 00:56:44.400
need for human doctors, not for their skills. In&nbsp;
fact, I think their skills increasingly will be&nbsp;&nbsp;

00:56:44.400 --> 00:56:51.520
replaced by machines. And in fact, I've talked&nbsp;
about a flip. In fact, patients will demand,&nbsp;&nbsp;

00:56:51.520 --> 00:56:56.160
Oh my god, you mean you're going to try to do&nbsp;
that yourself instead of having the computer&nbsp;&nbsp;

00:56:56.160 --> 00:57:01.520
do it? There's going to be that sort of flip. But&nbsp;
I do think that when it comes to people's health,&nbsp;&nbsp;

00:57:01.520 --> 00:57:11.120
people want the comfort of an authority figure&nbsp;
that they trust. And so what is more of a&nbsp;&nbsp;

00:57:11.120 --> 00:57:20.240
question for me is whether we will ever view a&nbsp;
machine as an authority figure that we can trust.

00:57:20.240 --> 00:57:25.840
And before I move on to Episode 4, which is on&nbsp;
norms, regulations and ethics, I’d like to hear&nbsp;&nbsp;

00:57:25.840 --> 00:57:31.424
from Chrissy Farr on one more point on consumer&nbsp;
health, specifically as it relates to pregnancy:

00:57:31.424 --> 00:57:35.920
CHRISTINA FARR: For a lot of women, it's&nbsp;
their first experience with the hospital.&nbsp;&nbsp;

00:57:35.920 --> 00:57:39.840
And, you know, I think it's a really&nbsp;
big opportunity for these systems to&nbsp;&nbsp;

00:57:40.640 --> 00:57:47.040
get a whole family on board and keep them kind&nbsp;
of loyal. And a lot of that can come through,&nbsp;&nbsp;

00:57:47.040 --> 00:57:52.720
you know, just delivering an incredible service.&nbsp;
Unfortunately, I don't think that we are&nbsp;&nbsp;

00:57:52.720 --> 00:58:00.000
delivering incredible services today to women in&nbsp;
this country. I see so much room for improvement.

00:58:00.000 --> 00:58:08.640
LEE: In the consumer space, I don't think&nbsp;
we really had a focus on those periods in&nbsp;&nbsp;

00:58:08.640 --> 00:58:12.160
a person's life when they have a&nbsp;
lot of engagement, like pregnancy,&nbsp;&nbsp;

00:58:12.160 --> 00:58:19.760
or I think another one is menopause, cancer.&nbsp;
You know, there are points where there is,&nbsp;&nbsp;

00:58:19.760 --> 00:58:25.600
like, very intense engagement. And we&nbsp;
heard that from e-Patient Dave, you know,&nbsp;&nbsp;

00:58:25.600 --> 00:58:35.280
with his cancer and Chrissy with her pregnancy.&nbsp;
Was that a miss in our book? What do think, Carey?

00:58:35.280 --> 00:58:43.200
GOLDBERG: I mean, I don't think so. I think&nbsp;
it's true that there are many points in life&nbsp;&nbsp;

00:58:43.200 --> 00:58:51.520
when people are highly engaged. To me, the&nbsp;
problem thus far is just that I haven't seen&nbsp;&nbsp;

00:58:51.520 --> 00:58:57.360
consumer-facing companies offering&nbsp;
beautiful AI-based products. I think&nbsp;&nbsp;

00:58:57.360 --> 00:59:04.000
there's no question at all that the market&nbsp;
is there if you have the products to offer.

00:59:04.000 --> 00:59:07.440
LEE: So, what do you think this means, Zak, for,&nbsp;&nbsp;

00:59:07.440 --> 00:59:13.520
you know, like Boston Children's or Mass&nbsp;
General Brigham—you know, the big places?

00:59:13.520 --> 00:59:23.271
KOHANE: So again, all these large healthcare&nbsp;
systems are in tough shape. MGB [Mass General&nbsp;&nbsp;

00:59:23.271 --> 00:59:28.160
Brigham] would be fully in the red if not for&nbsp;
the fact that its investments, of all things,&nbsp;&nbsp;

00:59:28.160 --> 00:59:35.120
have actually produced. If you look at the large&nbsp;
healthcare systems around the country, they are in&nbsp;&nbsp;

00:59:35.120 --> 00:59:44.640
the red. And there's multiple reasons why they're&nbsp;
in the red, but among them is cost of labor.

00:59:44.640 --> 00:59:49.280
And so we've created what used&nbsp;
to be a very successful beast,&nbsp;&nbsp;

00:59:49.280 --> 00:59:58.960
the health center. But it's developed a very&nbsp;
expensive model and a highly regulated model.&nbsp;&nbsp;

00:59:58.960 --> 01:00:07.280
And so when you have high revenue, tiny&nbsp;
margins, your ability to disrupt yourself,&nbsp;&nbsp;

01:00:07.280 --> 01:00:13.040
to innovate, is very, very low&nbsp;
because you will have to talk to&nbsp;&nbsp;

01:00:13.040 --> 01:00:21.120
the board next year if you went from 2%&nbsp;
positive margin to 1% negative margin.

01:00:21.120 --> 01:00:21.840
LEE: Yeah.

01:00:21.840 --> 01:00:26.400
KOHANE: And so I think we're all waiting&nbsp;
for one of the two things to happen,&nbsp;&nbsp;

01:00:26.400 --> 01:00:31.840
either a new kind of healthcare&nbsp;
delivery system being generated or&nbsp;&nbsp;

01:00:31.840 --> 01:00:36.000
ultimately one of these systems&nbsp;
learns how to disrupt itself.

01:00:36.000 --> 01:00:40.720
LEE: Yeah. All right. I think we&nbsp;
have to move on to Episode 4. And,&nbsp;&nbsp;

01:00:41.520 --> 01:00:46.000
you know, when it came to the question&nbsp;
of regulation, I think this is … my read&nbsp;&nbsp;

01:00:46.000 --> 01:00:49.680
is when we were writing our book, this is&nbsp;
the part that we struggled with the most.

01:00:49.680 --> 01:00:52.880
GOLDBERG: We punted. [LAUGHS]&nbsp;
We totally punted to the AI.

01:00:53.840 --> 01:00:57.920
LEE: We had three amazing guests. One&nbsp;
was Laura Adams from National Academy&nbsp;&nbsp;

01:00:57.920 --> 01:01:00.547
of Medicine. Let's play a snippet from her.

01:01:00.547 --> 01:01:05.280
LAURA ADAMS: I think one of the most provocative&nbsp;
and exciting articles that I saw written recently&nbsp;&nbsp;

01:01:05.280 --> 01:01:12.800
was by Bakul Patel and David Blumenthal, who&nbsp;
posited, should we be regulating generative AI&nbsp;&nbsp;

01:01:12.800 --> 01:01:19.600
as we do a licensed and qualified provider? Should&nbsp;
it be treated in the sense that it's got to have&nbsp;&nbsp;

01:01:19.600 --> 01:01:23.840
a certain amount of training and a foundation&nbsp;
that's got to pass certain tests? Does it have&nbsp;&nbsp;

01:01:23.840 --> 01:01:29.600
to report its performance? And I'm thinking, what&nbsp;
a provocative idea, but it's worth considering.

01:01:29.600 --> 01:01:36.640
LEE: All right, so I very well remember that&nbsp;
we had discussed this kind of idea when we&nbsp;&nbsp;

01:01:36.640 --> 01:01:42.400
were writing our book. And I think before we&nbsp;
finished our book, I personally rejected the&nbsp;&nbsp;

01:01:42.400 --> 01:01:46.600
idea. But now two years later, what do&nbsp;
the two of you think? I'm dying to hear.

01:01:46.600 --> 01:01:51.400
GOLDBERG: Well, wait, why … what do you think?&nbsp;
Like, are you sorry that you rejected it?

01:01:51.400 --> 01:01:59.440
LEE: I'm still skeptical because when we are&nbsp;
licensing human beings as doctors, you know,&nbsp;&nbsp;

01:01:59.440 --> 01:02:05.120
we're making a lot of implicit assumptions that we&nbsp;
don't test as part of their licensure, you know,&nbsp;&nbsp;

01:02:05.120 --> 01:02:12.480
that first of all, they are [a] human being&nbsp;
and they care about life, and that, you know,&nbsp;&nbsp;

01:02:12.480 --> 01:02:18.320
they have a certain amount of common sense&nbsp;
and shared understanding of the world.

01:02:18.320 --> 01:02:22.240
And there's all sorts of sort of implicit&nbsp;
assumptions that we have about each other as&nbsp;&nbsp;

01:02:22.240 --> 01:02:28.000
human beings living in a society together.&nbsp;
That you know how to study, you know,&nbsp;&nbsp;

01:02:28.000 --> 01:02:31.600
because I know you just went through three years&nbsp;
of medical or four years of medical school and&nbsp;&nbsp;

01:02:31.600 --> 01:02:37.280
all sorts of things. And so the standard&nbsp;
ways that we license human beings, they&nbsp;&nbsp;

01:02:37.280 --> 01:02:43.360
don't need to test all of that stuff. But somehow&nbsp;
intuitively, all of that seems really important.

01:02:43.360 --> 01:02:46.400
I don't know. Am I wrong about that?

01:02:46.400 --> 01:02:56.640
KOHANE: So it's compared with what issue? Because&nbsp;
we know for a fact that doctors who do a lot&nbsp;&nbsp;

01:02:56.640 --> 01:03:02.080
of a procedure, like do this procedure,&nbsp;
like high-risk deliveries all the time,&nbsp;&nbsp;

01:03:02.080 --> 01:03:06.160
have better outcomes than ones who only&nbsp;
do a few high risk. We talk about it,&nbsp;&nbsp;

01:03:06.160 --> 01:03:11.680
but we don't actually make it explicit to&nbsp;
patients or regulate that you have to have&nbsp;&nbsp;

01:03:11.680 --> 01:03:22.080
this minimal amount. And it strikes me that&nbsp;
in some sense, and, oh, very importantly,&nbsp;&nbsp;

01:03:23.360 --> 01:03:29.520
these things called human beings learn on the&nbsp;
job. And although I used to be very resentful&nbsp;&nbsp;

01:03:29.520 --> 01:03:35.040
of it as a resident, when someone would say,&nbsp;
I don't want the resident, I want the ...

01:03:35.040 --> 01:03:35.499
GOLDBERG: … the attending. [LAUGHTER]

01:03:35.499 --> 01:03:39.600
KOHANE: … they had a point. And so the&nbsp;
truth is, maybe I was a wonderful resident,&nbsp;&nbsp;

01:03:39.600 --> 01:03:46.960
but some people were not so great. [LAUGHTER] And&nbsp;
so it might be the best outcome if we actually,&nbsp;&nbsp;

01:03:46.960 --> 01:03:53.040
just like for human beings, we say, yeah, OK, it's&nbsp;
this good, but don't let it work autonomously,&nbsp;&nbsp;

01:03:53.040 --> 01:03:59.360
or it's done a thousand of them, just let it&nbsp;
go. We just don't have practically speaking,&nbsp;&nbsp;

01:03:59.360 --> 01:04:07.280
we don't have the environment, the lab, to test&nbsp;
them. Now, maybe if they get embodied in robots&nbsp;&nbsp;

01:04:07.280 --> 01:04:12.240
and literally go around with us, then it's going&nbsp;
to be [in some sense] a lot easier. I don't know.

01:04:12.240 --> 01:04:12.960
LEE: Yeah.

01:04:12.960 --> 01:04:16.480
GOLDBERG: Yeah, I think I would take a step back&nbsp;
and say, first of all, we weren't the only ones&nbsp;&nbsp;

01:04:16.480 --> 01:04:21.920
who were stumped by regulating AI. Like, nobody&nbsp;
has done it yet in the United States to this day,&nbsp;&nbsp;

01:04:21.920 --> 01:04:29.120
right. Like, we do not have standing regulation&nbsp;
of AI in medicine at all in fact. And that raises&nbsp;&nbsp;

01:04:29.120 --> 01:04:37.040
the issue of … the story that you hear often&nbsp;
in the biotech business, which is, you know,&nbsp;&nbsp;

01:04:37.040 --> 01:04:41.760
more prominent here in Boston than anywhere&nbsp;
else, is that thank goodness Cambridge put out,&nbsp;&nbsp;

01:04:41.760 --> 01:04:45.760
the city of Cambridge, put out some&nbsp;
regulations about biotech and how you&nbsp;&nbsp;

01:04:45.760 --> 01:04:53.280
could dump your lab waste and so on. And that&nbsp;
enabled the enormous growth of biotech here.

01:04:53.280 --> 01:05:00.480
If you don't have the regulations, then you can't&nbsp;
have the growth of AI in medicine that is worthy&nbsp;&nbsp;

01:05:00.480 --> 01:05:06.640
of having. And so, I just ... we're not the ones&nbsp;
who should do it, but I just wish somebody would.

01:05:06.640 --> 01:05:07.440
LEE: Yeah.

01:05:07.440 --> 01:05:08.040
GOLDBERG: Zak.

01:05:08.040 --> 01:05:14.720
KOHANE: Yeah, but I want to say this as always,&nbsp;
execution is everything, even in regulation.

01:05:14.720 --> 01:05:20.201
And so I'm mindful that a conference that both of&nbsp;
you attended, the RAISE conference [Responsible&nbsp;&nbsp;

01:05:20.201 --> 01:05:24.320
AI for Social and Ethical Healthcare]. The&nbsp;
Europeans in that conference came to me personally&nbsp;&nbsp;

01:05:24.320 --> 01:05:30.160
and thanked me for organizing this conference&nbsp;
about safe and effective use of AI because they&nbsp;&nbsp;

01:05:30.160 --> 01:05:40.160
said back home in Europe, all that we're talking&nbsp;
about is risk, not opportunities to improve care.

01:05:40.160 --> 01:05:46.880
And so there is a version of regulation which&nbsp;
just locks down the present and does not allow&nbsp;&nbsp;

01:05:46.880 --> 01:05:52.880
the future that we're talking about to happen.&nbsp;
And so, Carey, I absolutely hear you that we&nbsp;&nbsp;

01:05:52.880 --> 01:06:01.440
need to have a regulation that takes away&nbsp;
some of the uncertainty around liability,&nbsp;&nbsp;

01:06:01.440 --> 01:06:08.640
around the freedom to operate that would allow&nbsp;
things to progress. But we wrote in our book&nbsp;&nbsp;

01:06:08.640 --> 01:06:18.640
that premature regulation might actually focus&nbsp;
on the wrong thing. And so since I'm an optimist,&nbsp;&nbsp;

01:06:18.640 --> 01:06:23.680
it may be the fact that we don't have&nbsp;
much of a regulatory infrastructure today,&nbsp;&nbsp;

01:06:23.680 --> 01:06:28.000
that it allows … it's a unique&nbsp;
opportunity—I've said this now to&nbsp;&nbsp;

01:06:28.000 --> 01:06:32.120
several leaders—for the healthcare systems&nbsp;
to say, this is the regulation we need.

01:06:32.120 --> 01:06:32.880
GOLDBERG: It's true.

01:06:32.880 --> 01:06:36.640
KOHANE: And previously it was top-down.&nbsp;
It was coming from the administration,&nbsp;&nbsp;

01:06:36.640 --> 01:06:42.960
and those executive orders are now&nbsp;
history. But there is an opportunity,&nbsp;&nbsp;

01:06:42.960 --> 01:06:47.280
which may or may not be attained, there&nbsp;
is an opportunity for the healthcare&nbsp;&nbsp;

01:06:47.280 --> 01:06:51.740
leadership—for experts in surgery—to&nbsp;
say, “This is what we should expect.”

01:06:51.740 --> 01:06:51.752
LEE: Yeah.

01:06:51.752 --> 01:06:56.066
KOHANE: I would love for this to happen. I&nbsp;
haven't seen evidence that it’s happening yet.

01:06:56.066 --> 01:06:58.480
GOLDBERG: No, no. And there's&nbsp;
this other huge issue,&nbsp;&nbsp;

01:06:58.480 --> 01:07:02.080
which is that it's changing so fast.&nbsp;
It's moving so fast. That something&nbsp;&nbsp;

01:07:02.080 --> 01:07:06.469
that makes sense today won't in six&nbsp;
months. So, what do you do about that?

01:07:06.469 --> 01:07:11.600
LEE: Yeah, yeah, that is something I feel proud&nbsp;
of because when I went back and looked at our&nbsp;&nbsp;

01:07:11.600 --> 01:07:16.640
chapter on this, you know, we did make that&nbsp;
point, which I think has turned out to be true.

01:07:16.640 --> 01:07:20.960
But getting back to this conversation,&nbsp;
there's something, a snippet of something,&nbsp;&nbsp;

01:07:20.960 --> 01:07:24.800
that Vardit Ravitsky said that&nbsp;
I think touches on this topic.

01:07:24.800 --> 01:07:31.600
VARDIT RAVITSKY: So my pushback is, are we seeing&nbsp;
AI exceptionalism in the sense that if it's&nbsp;&nbsp;

01:07:31.600 --> 01:07:37.280
AI, huh, panic! We have to inform everybody about&nbsp;
everything, and we have to give them choices,&nbsp;&nbsp;

01:07:37.280 --> 01:07:44.480
and they have to be able to reject that tool and&nbsp;
the other tool versus, you know, the rate of human&nbsp;&nbsp;

01:07:44.480 --> 01:07:52.960
error in medicine is awful. So why are we so&nbsp;
focused on informed consent and empowerment&nbsp;&nbsp;

01:07:52.960 --> 01:07:57.280
regarding implementation of AI&nbsp;
and less in other contexts? 

01:07:57.280 --> 01:08:00.880
GOLDBERG: Totally agree. Who&nbsp;
cares about informed consent&nbsp;&nbsp;

01:08:00.880 --> 01:08:03.880
about AI. Don't want it. Don't need it. Nope.

01:08:03.880 --> 01:08:11.040
LEE: Wow. Yeah. You know, and this ... Vardit of&nbsp;
course is one of the leading bioethicists, you&nbsp;&nbsp;

01:08:11.040 --> 01:08:19.920
know, and of course prior to AI, she was really&nbsp;
focused on genetics. But now it's all about AI.

01:08:19.920 --> 01:08:25.520
And, Zak, you know, you and other doctors&nbsp;
have always told me, you know, the truth&nbsp;&nbsp;

01:08:25.520 --> 01:08:31.920
of the matter is, you know, what do you call the&nbsp;
bottom-of-the-class graduate of a medical school?

01:08:31.920 --> 01:08:33.818
And the answer is “doctor.”

01:08:33.818 --> 01:08:41.840
KOHANE: “Doctor.” Yeah. Yeah, I think that&nbsp;
again, this gets to compared with what? We&nbsp;&nbsp;

01:08:41.840 --> 01:08:50.320
have to compare AI not to the medicine we&nbsp;
imagine we have, or we would like to have,&nbsp;&nbsp;

01:08:50.320 --> 01:08:56.160
but to the medicine we have today. And&nbsp;
if we're trying to remove inequity,&nbsp;&nbsp;

01:08:56.160 --> 01:09:01.360
if we're trying to improve our health,&nbsp;
that's what … those are the right metrics.&nbsp;&nbsp;

01:09:01.360 --> 01:09:09.600
And so that can be done so long as we&nbsp;
avoid catastrophic consequences of AI.

01:09:09.600 --> 01:09:18.000
So what would the catastrophic consequence of&nbsp;
AI be? It would be a systematic behavior that&nbsp;&nbsp;

01:09:18.000 --> 01:09:24.080
we were unaware of that was causing poor&nbsp;
healthcare. So, for example, you know,&nbsp;&nbsp;

01:09:24.080 --> 01:09:31.280
changing the dose on a medication, making it&nbsp;
20% higher than normal so that the rate of&nbsp;&nbsp;

01:09:31.280 --> 01:09:40.000
complications of that medication went from 1% to&nbsp;
5%. And so we do need some sort of monitoring.

01:09:40.000 --> 01:09:43.920
We haven't put out the paper yet,&nbsp;
but in computer science, there's,&nbsp;&nbsp;

01:09:44.960 --> 01:09:51.840
well, in programming, we know very well the value&nbsp;
for understanding how our computer systems work.

01:09:51.840 --> 01:09:57.120
And there was a guy by name of Allman, I&nbsp;
think he's still at a company called Sendmail,&nbsp;&nbsp;

01:09:57.120 --> 01:10:04.240
who created something called syslog. And&nbsp;
syslog is basically a log of all the crap&nbsp;&nbsp;

01:10:04.240 --> 01:10:10.960
that's happening in our operating system. And&nbsp;
so I've been arguing now for the creation of&nbsp;&nbsp;

01:10:10.960 --> 01:10:18.440
MedLog. And MedLog … in other words, what we&nbsp;
cannot measure, we cannot regulate, actually.

01:10:18.440 --> 01:10:19.112
LEE: Yes.

01:10:19.112 --> 01:10:23.120
KOHANE: And so what we need to have is MedLog,&nbsp;
which says, “Here's the context in which a&nbsp;&nbsp;

01:10:23.120 --> 01:10:28.560
decision was made. Here's the version of the&nbsp;
AI, you know, the exact version of the AI. Here&nbsp;&nbsp;

01:10:28.560 --> 01:10:36.720
was the data.” And we just have MedLog. And I&nbsp;
think MedLog is actually incredibly important&nbsp;&nbsp;

01:10:36.720 --> 01:10:43.600
for being able to measure, to just do what we do&nbsp;
in … it’s basically the black box for, you know,&nbsp;&nbsp;

01:10:43.600 --> 01:10:48.480
when there's a crash. You know, we'd like to think&nbsp;
we could do better than crash. We can say, “Oh,&nbsp;&nbsp;

01:10:48.480 --> 01:10:54.160
we're seeing from MedLog that this practice&nbsp;
is turning a little weird.” But worst case,&nbsp;&nbsp;

01:10:54.160 --> 01:10:57.920
patient dies, [we] can see in MedLog, what&nbsp;
was the information this thing knew about&nbsp;&nbsp;

01:10:57.920 --> 01:11:03.120
it? And did it make the right decision?&nbsp;
We can actually go for transparency,&nbsp;&nbsp;

01:11:03.120 --> 01:11:07.331
which like in aviation, is much&nbsp;
greater than in most human endeavors.

01:11:07.331 --> 01:11:07.355
GOLDBERG: Sounds great.

01:11:07.355 --> 01:11:12.800
LEE: Yeah, it's sort of like a black box. I was&nbsp;
thinking of the aviation black box kind of idea.&nbsp;&nbsp;

01:11:12.800 --> 01:11:17.200
You know, you bring up medication errors,&nbsp;
and I have one more snippet. This is from&nbsp;&nbsp;

01:11:17.200 --> 01:11:23.920
our guest Roxana Daneshjou from Stanford.  
 
 
ROXANA DANESHJOU: There was a mistake in her&nbsp;&nbsp;

01:11:23.920 --> 01:11:30.640
after-visit summary about how much Tylenol she&nbsp;
could take. But I, as a physician, knew that this&nbsp;&nbsp;

01:11:30.640 --> 01:11:37.120
dose was a mistake. I actually asked ChatGPT. I&nbsp;
gave it the whole after-visit summary, and I said,&nbsp;&nbsp;

01:11:37.120 --> 01:11:42.720
are there any mistakes here? And it clued in&nbsp;
that the dose of the medication was wrong.

01:11:42.720 --> 01:11:46.640
LEE: Yeah, so this is something we&nbsp;
did write about in the book. We made&nbsp;&nbsp;

01:11:46.640 --> 01:11:51.680
a prediction that AI might be a second&nbsp;
set of eyes, I think is the way we put it,&nbsp;&nbsp;

01:11:51.680 --> 01:11:57.360
catching things. And we actually had&nbsp;
examples specifically in medication dose&nbsp;&nbsp;

01:11:57.360 --> 01:12:01.680
errors. I think for me, I expected to&nbsp;
see a lot more of that than we are.

01:12:01.680 --> 01:12:07.680
KOHANE: Yeah, it goes back to our&nbsp;
conversation about Epic or competitor Epic&nbsp;&nbsp;

01:12:07.680 --> 01:12:14.960
doing that. I think we're going to see that&nbsp;
having oversight over all medical orders,&nbsp;&nbsp;

01:12:14.960 --> 01:12:23.120
all orders in the system, critique, real-time&nbsp;
critique, where we're both aware of alert&nbsp;&nbsp;

01:12:23.120 --> 01:12:28.400
fatigue. So we don't want to have too many false&nbsp;
positives. At the same time, knowing what are&nbsp;&nbsp;

01:12:28.400 --> 01:12:35.040
critical errors which could immediately affect&nbsp;
lives. I think that is going to become in terms&nbsp;&nbsp;

01:12:35.040 --> 01:12:42.560
of—and driven by quality measures—a product. 
 
 
GOLDBERG: And I think word will spread among the&nbsp;&nbsp;

01:12:42.560 --> 01:12:49.280
general public that kind of the same way in a&nbsp;
lot of countries when someone's in a hospital,&nbsp;&nbsp;

01:12:49.280 --> 01:12:52.789
the first thing people ask relatives&nbsp;
are, well, who's with them? Right?

01:12:52.789 --> 01:12:52.960
LEE: Yeah. Yup.

01:12:52.960 --> 01:12:56.320
GOLDBERG: You wouldn't leave someone&nbsp;
in hospital without relatives. Well,&nbsp;&nbsp;

01:12:56.320 --> 01:12:58.240
you wouldn't maybe leave your medical ...

01:12:58.240 --> 01:13:00.240
KOHANE: By the way, that country&nbsp;
is called the United States.

01:13:00.240 --> 01:13:02.640
GOLDBERG: Yes, that's true.&nbsp;
[LAUGHS] It is true here now,&nbsp;&nbsp;

01:13:02.640 --> 01:13:08.480
too. But similarly, I would tell&nbsp;
any loved one that they would be&nbsp;&nbsp;

01:13:08.480 --> 01:13:15.269
well advised to keep using AI to check&nbsp;
on their medical care, right. Why not?

01:13:15.269 --> 01:13:21.280
LEE: Yeah. Yeah. Last topic, just for this Episode&nbsp;
4. Roxana, of course, I think really made a name&nbsp;&nbsp;

01:13:21.280 --> 01:13:27.440
for herself in the AI era writing, actually just&nbsp;
prior to ChatGPT, you know, writing some famous&nbsp;&nbsp;

01:13:27.440 --> 01:13:34.160
papers about how computer vision systems for&nbsp;
dermatology were biased against dark-skinned&nbsp;&nbsp;

01:13:34.160 --> 01:13:44.960
people. And we did talk some about bias in these&nbsp;
AI systems, but I feel like we underplayed it,&nbsp;&nbsp;

01:13:44.960 --> 01:13:50.240
or we didn't understand the magnitude of the&nbsp;
potential issues. What are your thoughts?

01:13:50.240 --> 01:13:57.920
KOHANE: OK, I want to push back, because I've&nbsp;
been asked this question several times. And&nbsp;&nbsp;

01:13:57.920 --> 01:14:04.880
so I have two comments. One is, over&nbsp;
100,000 doctors practicing medicine,&nbsp;&nbsp;

01:14:04.880 --> 01:14:09.920
I know they have biases. Some of them&nbsp;
actually may be all in the same direction,&nbsp;&nbsp;

01:14:09.920 --> 01:14:15.920
and not good. But I have no way of actually&nbsp;
measuring that. With AI, I know exactly how&nbsp;&nbsp;

01:14:15.920 --> 01:14:24.240
to measure that at scale and affordably. Number&nbsp;
one. Number two, same 100,000 doctors. Let's say&nbsp;&nbsp;

01:14:24.240 --> 01:14:31.515
I do know what their biases are. How hard is it&nbsp;
for me to change that bias? It's impossible …

01:14:31.515 --> 01:14:32.468
LEE: Yeah, yeah.

01:14:32.468 --> 01:14:39.280
KOHANE: … practically speaking. Can I change the&nbsp;
bias in the AI? Somewhat. Maybe some completely.

01:14:39.280 --> 01:14:41.880
I think that we're in a much better situation.

01:14:41.880 --> 01:14:42.960
GOLDBERG: Agree.

01:14:42.960 --> 01:14:48.000
LEE: I think Roxana made also the super&nbsp;
interesting point that there's bias in&nbsp;&nbsp;

01:14:48.000 --> 01:14:54.137
the whole system, not just in individuals, but,&nbsp;
you know, there's structural bias, so to speak.

01:14:54.137 --> 01:14:54.155
KOHANE: There is.

01:14:54.155 --> 01:14:59.600
LEE: Yeah. Hmm. There was a super interesting&nbsp;
paper that Roxana wrote not too long ago—&nbsp;&nbsp;

01:15:00.160 --> 01:15:05.280
her and her collaborators—showing&nbsp;
AI's ability to detect, to spot&nbsp;&nbsp;

01:15:05.280 --> 01:15:10.360
bias decision-making by others.&nbsp;
Are we going to see more of that?

01:15:10.360 --> 01:15:13.360
KOHANE: Oh, yeah, I was very pleased when,&nbsp;&nbsp;

01:15:14.240 --> 01:15:17.680
in NEJM AI [New England Journal of Medicine&nbsp;
Artificial Intelligence], we published a piece&nbsp;&nbsp;

01:15:17.680 --> 01:15:27.920
with Marzyeh Ghassemi, and what they were talking&nbsp;
about was actually—and these are researchers who&nbsp;&nbsp;

01:15:27.920 --> 01:15:34.320
had published extensively on bias and threats&nbsp;
from AI. And they actually, in this article,&nbsp;&nbsp;

01:15:34.320 --> 01:15:40.080
did the flip side, which is how much better&nbsp;
AI can do than human beings in this respect.

01:15:40.080 --> 01:15:47.280
And so I think that as some of these computer&nbsp;
scientists enter the world of medicine,&nbsp;&nbsp;

01:15:47.280 --> 01:15:56.000
they're becoming more and more aware of&nbsp;
human foibles and can see how these systems,&nbsp;&nbsp;

01:15:56.000 --> 01:16:01.920
which if they only looked at the pretrained state,&nbsp;&nbsp;

01:16:01.920 --> 01:16:08.800
would have biases. But now, where we know how&nbsp;
to fine-tune the de-bias in a variety of ways,&nbsp;&nbsp;

01:16:08.800 --> 01:16:13.840
they can do a lot better and, in fact,&nbsp;
I think are much more … a much greater&nbsp;&nbsp;

01:16:13.840 --> 01:16:20.720
reason for optimism that we can change some of&nbsp;
these noxious biases than in the pre-AI era.

01:16:20.720 --> 01:16:25.760
GOLDBERG: And thinking about&nbsp;
Roxana's dermatological work on how&nbsp;&nbsp;

01:16:26.640 --> 01:16:35.200
I think there wasn't sufficient work on skin&nbsp;
tone as related to various growths, you know,&nbsp;&nbsp;

01:16:35.200 --> 01:16:42.800
I think that one thing that we totally missed in&nbsp;
the book was the dawn of multimodal uses, right.

01:16:42.800 --> 01:16:43.830
LEE: Yeah. Yeah, yeah.

01:16:43.830 --> 01:16:47.120
GOLDBERG: That's been truly amazing&nbsp;
that in fact all of these visual&nbsp;&nbsp;

01:16:47.120 --> 01:16:53.080
and other sorts of data can be entered&nbsp;
into the models and move them forward.

01:16:53.080 --> 01:17:00.560
LEE: Yeah. Well, maybe on these slightly more&nbsp;
optimistic notes, we're at time. You know,&nbsp;&nbsp;

01:17:00.560 --> 01:17:06.960
I think ultimately, I feel pretty good&nbsp;
still about what we did in our book,&nbsp;&nbsp;

01:17:06.960 --> 01:17:10.240
although there were a lot of misses.&nbsp;
[LAUGHS] I don't think any of us could&nbsp;&nbsp;

01:17:10.240 --> 01:17:14.621
really have predicted really the&nbsp;
extent of change in the world.

01:17:14.621 --> 01:17:18.880
[TRANSITION MUSIC]
So, Carey, Zak, just so much fun to

01:17:18.880 --> 01:17:23.680
do some reminiscing but also some&nbsp;
reflection about what we did.

01:17:23.680 --> 01:17:31.120
[THEME MUSIC]

01:17:31.120 --> 01:17:35.360
And to our listeners, as always, thank you&nbsp;
for joining us. We have some really great&nbsp;&nbsp;

01:17:35.360 --> 01:17:39.440
guests lined up for the rest of the series, and&nbsp;
they’ll help us explore a variety of relevant&nbsp;&nbsp;

01:17:39.440 --> 01:17:44.880
topics—from AI drug discovery to what medical&nbsp;
students are seeing and doing with AI and more.

01:17:44.880 --> 01:17:47.600
We hope you’ll continue to tune&nbsp;
in. And if you want to catch up&nbsp;&nbsp;

01:17:47.600 --> 01:17:50.480
on any episodes you might have&nbsp;
missed, you can find them at&nbsp;&nbsp;

01:17:50.480 --> 01:17:57.520
aka.ms/AIrevolutionPodcast or wherever&nbsp;
you listen to your favorite podcasts.  

01:17:57.520 --> 01:18:05.760
Until next time. 

01:18:05.760 --> 01:18:06.728
[MUSIC FADES]