WEBVTT

00:00:00.000 --> 00:00:02.307
[MUSIC PLAYS UNDER DIALOGUE]

00:00:02.307 --> 00:00:06.480
KALIKA BALI: I do think, in some sense,&nbsp;
the pushback that I got for my idea makes&nbsp;&nbsp;

00:00:06.480 --> 00:00:11.120
me think it was outrageous. I didn't think it&nbsp;
was outrageous at all at that time! I thought&nbsp;&nbsp;

00:00:11.120 --> 00:00:17.280
it was a very reasonable idea! But there was&nbsp;
a very solid pushback and not just from your&nbsp;&nbsp;

00:00:17.280 --> 00:00:22.120
colleagues. You know, for researchers,&nbsp;
publishing papers is important! No one&nbsp;&nbsp;

00:00:22.120 --> 00:00:26.040
would publish a paper which focused only&nbsp;
on, say, Indian languages or low-resource&nbsp;&nbsp;

00:00:26.040 --> 00:00:30.680
languages. We've come a very long way&nbsp;
even in the research community on that,&nbsp;&nbsp;

00:00:30.680 --> 00:00:37.120
right. We kept pushing, pushing, pushing! And&nbsp;
now there are tracks, there are workshops,&nbsp;&nbsp;

00:00:37.120 --> 00:00:43.848
there are conferences which are devoted to&nbsp;
multilingual and low-resource languages.

00:00:43.848 --> 00:00:43.861
[TEASER ENDS]

00:00:43.861 --> 00:00:48.280
GRETCHEN HUIZINGA: You’re listening to Ideas,&nbsp;
a Microsoft Research Podcast that dives deep&nbsp;&nbsp;

00:00:48.280 --> 00:00:52.120
into the world of technology research&nbsp;
and the profound questions behind the&nbsp;&nbsp;

00:00:52.120 --> 00:01:04.680
code. I’m Dr. Gretchen Huizinga. In this&nbsp;
series, we’ll explore the technologies&nbsp;&nbsp;

00:01:04.680 --> 00:01:10.667
that are shaping our future and the&nbsp;
big ideas that propel them forward.

00:01:10.667 --> 00:01:10.680
[MUSIC FADES]

00:01:10.680 --> 00:01:15.520
I'm excited to be live in the booth today with&nbsp;
Kalika Bali, a principal researcher at Microsoft&nbsp;&nbsp;

00:01:15.520 --> 00:01:20.560
Research India. Kalika is working on language&nbsp;
technologies that she hopes will bring the&nbsp;&nbsp;

00:01:20.560 --> 00:01:25.880
benefits of generative AI to under-resourced&nbsp;
and underserved language communities around&nbsp;&nbsp;

00:01:25.880 --> 00:01:31.147
the world. Kalika, it's a pleasure to&nbsp;
speak with you today. Welcome to Ideas!

00:01:31.147 --> 00:01:33.640
KALIKA BALI: Thank you. Thank you,&nbsp;
Gretchen. Thank you for having me.

00:01:33.640 --> 00:01:38.640
HUIZINGA: So before we dive in on the&nbsp;
big ideas behind Kalika Bali's research,&nbsp;&nbsp;

00:01:38.640 --> 00:01:44.040
let's talk about you for a second. Tell&nbsp;
us about your “origin story,” as it were,&nbsp;&nbsp;

00:01:44.040 --> 00:01:49.360
and if there is one, what “big idea”&nbsp;
or animating “what if?” captured your&nbsp;&nbsp;

00:01:49.360 --> 00:01:52.340
imagination and inspired you&nbsp;
to do what you're doing today?

00:01:52.340 --> 00:01:59.240
BALI: So, you know, I’m a great reader. I started&nbsp;
reading well before I was taught in school how to&nbsp;&nbsp;

00:01:59.240 --> 00:02:06.120
read, and I loved science fiction. I come from&nbsp;
a family where reading was very much a part of&nbsp;&nbsp;

00:02:06.120 --> 00:02:14.680
our everyday lives. My dad was a journalist, and&nbsp;
I had read a lot of science fiction growing up,&nbsp;&nbsp;

00:02:14.680 --> 00:02:23.560
and I also saw a lot of science fiction, you know,&nbsp;
movies … Star Trek … everything that I could get&nbsp;&nbsp;

00:02:23.560 --> 00:02:34.240
hold of in India. And I remember watching 2001:&nbsp;
Space Odyssey. And there was this HAL that spoke.&nbsp;&nbsp;

00:02:34.240 --> 00:02:41.200
He actually communicated that he was a computer.&nbsp;
And I was just so struck by it. I was like,&nbsp;&nbsp;

00:02:41.200 --> 00:02:49.160
this is so cool! You know, here are computers that&nbsp;
can talk! Now, how cool would that be if it would&nbsp;&nbsp;

00:02:49.160 --> 00:02:54.880
happen in real life? I was not at all aware of&nbsp;
what was happening in speech technology, whether&nbsp;&nbsp;

00:02:54.880 --> 00:03:00.400
it was possible or not possible, but that's&nbsp;
something that really got me into it. I've always,&nbsp;&nbsp;

00:03:00.400 --> 00:03:05.720
like, kind of, been very curious about languages&nbsp;
and how they work and, you know, how people use&nbsp;&nbsp;

00:03:05.720 --> 00:03:12.440
different things in languages to express not just&nbsp;
meaning, not just communicating, but you know&nbsp;&nbsp;

00:03:12.440 --> 00:03:19.560
expressing themselves, really. And so I think&nbsp;
it's a combination of HAL and this curiosity I&nbsp;&nbsp;

00:03:19.560 --> 00:03:26.760
had about the various ways in which people use&nbsp;
languages that got me into what I'm doing now.

00:03:26.760 --> 00:03:30.440
HUIZINGA: OK. So that's an interesting path,&nbsp;
and I want to go into that just a little bit,&nbsp;&nbsp;

00:03:30.440 --> 00:03:34.630
but let me anchor this: how old were&nbsp;
you when you saw this talking computer?

00:03:34.630 --> 00:03:36.280
BALI: Oh, I was in my early teens.

00:03:36.280 --> 00:03:41.554
HUIZINGA: OK. And so at that time,&nbsp;
did you have any conception that … ?

00:03:41.554 --> 00:03:48.720
BALI: No. You know, there weren't computers&nbsp;
around me when I was growing up. We saw,&nbsp;&nbsp;

00:03:48.720 --> 00:03:54.230
you know, some at school, you&nbsp;
know, people coded in BASIC …

00:03:54.230 --> 00:03:54.960
HUIZINGA: Right?

00:03:54.960 --> 00:04:01.060
BALI: And we heard about them a lot, but I&nbsp;
hadn't seen one since I was in high school.

00:04:01.060 --> 00:04:05.240
HUIZINGA: OK. So there's this&nbsp;
inception moment, an aha moment,&nbsp;&nbsp;

00:04:05.240 --> 00:04:11.680
of that little spark and then you kind of&nbsp;
drifted away from the computer side of it,&nbsp;&nbsp;

00:04:11.680 --> 00:04:15.200
and what … tell us about how&nbsp;
you went from there to that!

00:04:15.200 --> 00:04:19.720
BALI: So that, that's actually a very funny story&nbsp;
because I actually wanted to study chemistry. I&nbsp;&nbsp;

00:04:19.720 --> 00:04:26.840
was really fascinated by how these, you know,&nbsp;
molecular parts rotate around each other and,&nbsp;&nbsp;

00:04:26.840 --> 00:04:32.040
you know, we can't even tell where an electron&nbsp;
is, etc. It sounded, like, really fun and cool.&nbsp;&nbsp;

00:04:32.040 --> 00:04:38.320
So I actually studied chemistry, but then I&nbsp;
was actually going to pick up the admission&nbsp;&nbsp;

00:04:38.320 --> 00:04:44.760
form for my sister, who wanted to study in this&nbsp;
university, and … or, no, she wanted to take an&nbsp;&nbsp;

00:04:44.760 --> 00:04:52.840
exam for her master's. And I went there. I picked&nbsp;
up the form, and I said, this is a cool place.&nbsp;&nbsp;

00:04:52.840 --> 00:04:57.720
I would love to study here! And then I started&nbsp;
looking at everything like, you know, what can I&nbsp;&nbsp;

00:04:57.720 --> 00:05:03.880
apply for here? And something called linguistics&nbsp;
came up, and I had no idea what linguistics was.&nbsp;&nbsp;

00:05:03.880 --> 00:05:09.800
So I went to the British Library, got like a thin&nbsp;
book on ntroduction to linguistics, and it sounded&nbsp;&nbsp;

00:05:09.800 --> 00:05:15.160
fun! And I took the exam. And then, as they&nbsp;
say, that was history. Then I just got into it.

00:05:15.160 --> 00:05:20.000
HUIZINGA: OK. I mean, so much has&nbsp;
happened in between then and now,&nbsp;&nbsp;

00:05:20.000 --> 00:05:25.360
and I think we'll kind of get there in … but&nbsp;
I do want you to connect the larger dot from&nbsp;&nbsp;

00:05:25.360 --> 00:05:31.640
how you got from linguistics to Microsoft&nbsp;
Research [LAUGHTER] as a computer scientist.

00:05:31.640 --> 00:05:38.640
BALI: So I actually started teaching at the&nbsp;
University of South Pacific as a linguistics&nbsp;&nbsp;

00:05:38.640 --> 00:05:46.000
faculty in Fiji. And I was very interested in&nbsp;
acoustics of speech sounds, etc., etc. That's&nbsp;&nbsp;

00:05:46.000 --> 00:05:52.080
what I was teaching. And then there was a speech&nbsp;
company in Belgium that was looking to start&nbsp;&nbsp;

00:05:52.080 --> 00:06:00.400
some work in Indian languages, and they contacted&nbsp;
me, and at that time, you needed people who knew&nbsp;&nbsp;

00:06:00.400 --> 00:06:06.640
about languages to build language technology,&nbsp;
especially people who knew about phonetics,&nbsp;&nbsp;

00:06:06.640 --> 00:06:13.320
acoustics, for speech technology. And that's how&nbsp;
I got into it. And then, you know, I just went&nbsp;&nbsp;

00:06:13.320 --> 00:06:20.440
from startups to companies and then Microsoft&nbsp;
Research, 18 years ago, almost 18 years ago.

00:06:20.440 --> 00:06:25.400
HUIZINGA: Wow. OK. I would love to actually talk&nbsp;
to you about all that time. But we don't have time&nbsp;&nbsp;

00:06:25.400 --> 00:06:29.960
because I have a lot more things to talk to you&nbsp;
about, technology-wise. But I do want to know,&nbsp;&nbsp;

00:06:29.960 --> 00:06:35.360
you know, how would you describe the ideas&nbsp;
behind your overarching research philosophy,&nbsp;&nbsp;

00:06:35.360 --> 00:06:40.000
and who are your influences, as they say&nbsp;
in the rock-and-roll world? [LAUGHTER] Who&nbsp;&nbsp;

00:06:40.000 --> 00:06:47.320
inspired you? Real-life person, scientist&nbsp;
or not, besides, HAL 9000, who’s fictional,&nbsp;&nbsp;

00:06:47.320 --> 00:06:51.480
and any seminal papers that, sort of,&nbsp;
got you interested in that along the way?

00:06:51.480 --> 00:06:58.840
BALI: So since I was really into speech, Ken&nbsp;
Stevens—who was a professor, who sadly is&nbsp;&nbsp;

00:06:58.840 --> 00:07:05.520
no longer with us anymore, at MIT—was a big&nbsp;
influence. He, kind of, had this whole idea&nbsp;&nbsp;

00:07:05.520 --> 00:07:13.520
of how speech is produced. And, you know, the&nbsp;
first time I was exposed to the whole idea of&nbsp;&nbsp;

00:07:13.520 --> 00:07:21.560
the mathematics behind the speech, and I think he&nbsp;
influenced me a lot on the speech side of things.&nbsp;&nbsp;

00:07:21.560 --> 00:07:29.280
For the language side of things, you know, my&nbsp;
professor in India Professor Anvita Abbi—you know,&nbsp;&nbsp;

00:07:29.280 --> 00:07:35.760
she's a Padma Shri, like, she's been awarded by&nbsp;
the Indian government for her work in, you know,&nbsp;&nbsp;

00:07:35.760 --> 00:07:43.840
very obscure, endangered languages—you know, she&nbsp;
kind of gave me a feel for what languages are,&nbsp;&nbsp;

00:07:43.840 --> 00:07:50.140
and why they are important, and why it's&nbsp;
important to save them and not let them die away.

00:07:50.140 --> 00:07:50.720
HUIZINGA: Right.

00:07:50.720 --> 00:07:56.960
BALI: So I think I would say both of them.&nbsp;
But what really got me into wanting to work&nbsp;&nbsp;

00:07:56.960 --> 00:08:02.480
with Indian language technology in a big way was&nbsp;
I was working in Belgium, I was working in London,&nbsp;&nbsp;

00:08:02.480 --> 00:08:09.680
and I saw the beginning of how technology&nbsp;
is, kind of, you know, making things easier,&nbsp;&nbsp;

00:08:09.680 --> 00:08:16.080
exciting; there’s cool technology available&nbsp;
for English, for French, for German … But&nbsp;&nbsp;

00:08:16.080 --> 00:08:23.120
in a country like India, it was more about&nbsp;
giving access to people who have no access,&nbsp;&nbsp;

00:08:23.120 --> 00:08:28.960
right? It actually mattered, because here&nbsp;
are people who may not be very literate&nbsp;&nbsp;

00:08:28.960 --> 00:08:33.960
and therefore not be able to use technology&nbsp;
in the way we know it, but they can talk.

00:08:33.960 --> 00:08:34.580
HUIZINGA: Right.

00:08:34.580 --> 00:08:39.880
BALI: And they can speak, and they should&nbsp;
be able to access technology by doing that.

00:08:39.880 --> 00:08:43.640
HUIZINGA: Right. OK. So just real quickly,&nbsp;&nbsp;

00:08:43.640 --> 00:08:52.540
that was then. What have you seen change in that&nbsp;
time, and how profoundly have the ideas evolved?

00:08:52.540 --> 00:08:59.640
BALI: So just from pure methodology and what's&nbsp;
possible, you know, I have seen it all. When I&nbsp;&nbsp;

00:08:59.640 --> 00:09:05.520
started working in language technology, mainly for&nbsp;
Indian languages, but even for other languages,&nbsp;&nbsp;

00:09:05.520 --> 00:09:12.400
it was all a rule-based system. So everybody&nbsp;
had to create all these rules that then were,&nbsp;&nbsp;

00:09:12.400 --> 00:09:18.000
you know, responsible for building or like&nbsp;
making that technology work. But then,&nbsp;&nbsp;

00:09:18.000 --> 00:09:22.400
just at that time, you know, all the&nbsp;
statistical systems and methodologies&nbsp;&nbsp;

00:09:22.400 --> 00:09:27.960
came into being. So we had hidden Markov&nbsp;
models, you know, doing their thing in speech,&nbsp;&nbsp;

00:09:27.960 --> 00:09:33.720
and it was all about a lot of data. But that&nbsp;
data still had to be procured in a certain way,&nbsp;&nbsp;

00:09:33.720 --> 00:09:39.440
labeled, annotated. It was still a very&nbsp;
long and resource-intensive process. Now,&nbsp;&nbsp;

00:09:39.440 --> 00:09:46.296
with generative AI, the thing that I am excited&nbsp;
about is, we have a very powerful tool, right?

00:09:46.296 --> 00:09:46.314
HUIZINGA: Mm-hmm.

00:09:46.314 --> 00:09:52.240
BALI: And, yes, it requires a lot&nbsp;
of data, but it can learn also;&nbsp;&nbsp;

00:09:52.240 --> 00:09:55.430
you know, we can fine-tune&nbsp;
stuff on smaller datasets …

00:09:55.430 --> 00:09:55.670
HUIZINGA: Yeah …

00:09:55.670 --> 00:10:01.040
BALI: … to work for, you know, relevant things.&nbsp;
So it's not going to take me years and years&nbsp;&nbsp;

00:10:01.040 --> 00:10:06.680
and years to first procure the data, then&nbsp;
have it tagged for part of speech … then,&nbsp;&nbsp;

00:10:06.680 --> 00:10:11.600
you know, have it tagged for sentiment, have&nbsp;
it tagged for this, have it tagged for that,&nbsp;&nbsp;

00:10:11.600 --> 00:10:14.700
and then, only can I think of building anything.

00:10:14.700 --> 00:10:15.180
HUIZINGA: Right.

00:10:15.180 --> 00:10:22.760
BALI: So it just shortens that timeline&nbsp;
so much, and it's very exciting.

00:10:22.760 --> 00:10:28.200
HUIZINGA: Right. As an ex-English teacher—which I&nbsp;
don't think there is such a thing as an ex-English&nbsp;&nbsp;

00:10:28.200 --> 00:10:32.080
teacher; you're always silently correcting&nbsp;
someone's grammar! [LAUGHTER]—just what you&nbsp;&nbsp;

00:10:32.080 --> 00:10:38.480
said about tagging parts of speech as what they&nbsp;
are, right? And that, I used to teach that. And&nbsp;&nbsp;

00:10:38.480 --> 00:10:43.920
then you start to think, how would you translate&nbsp;
that for a machine? So fascinating. So, Kalika,&nbsp;&nbsp;

00:10:43.920 --> 00:10:48.040
you have said that your choice of career was&nbsp;
accidental—and you’ve alluded to the, sort of,&nbsp;&nbsp;

00:10:48.040 --> 00:10:54.840
the fortuitous things that happened along&nbsp;
the way—but that linguistics is one subject&nbsp;&nbsp;

00:10:54.840 --> 00:10:59.720
that goes from absolute science to&nbsp;
absolute philosophy. Can you unpack&nbsp;&nbsp;

00:10:59.720 --> 00:11:04.320
that a little bit more and how this idea&nbsp;
impacted your work in language technology?

00:11:04.320 --> 00:11:10.320
BALI: Yeah. So, so if you think about it, you&nbsp;
know, language has a physical aspect, right. We&nbsp;&nbsp;

00:11:10.320 --> 00:11:16.600
move our various speech organs in a certain way.&nbsp;
Our ears are constructed in a certain way. There&nbsp;&nbsp;

00:11:16.600 --> 00:11:24.240
is a physics of it where, when I speak, there are&nbsp;
sound waves, right, which are going into your ear,&nbsp;&nbsp;

00:11:24.240 --> 00:11:31.720
and that's being interpreted. So, you know, if you&nbsp;
think about that, that's like an absolute science&nbsp;&nbsp;

00:11:32.680 --> 00:11:38.080
behind it, right? But then, when you come to&nbsp;
the structure of language, you know, the syntax,&nbsp;&nbsp;

00:11:38.080 --> 00:11:43.720
like you're an English teacher, so you know this&nbsp;
really well, that you know, there’s semantics;&nbsp;&nbsp;

00:11:43.720 --> 00:11:50.240
there’s, you know, morphology, how our words form,&nbsp;
how our sentences form. And that’s like a very&nbsp;&nbsp;

00:11:50.240 --> 00:11:58.454
abstract kind of method that allows us to put,&nbsp;
you know, meaningful sentences out there, right?

00:11:58.454 --> 00:11:58.474
HUIZINGA: Right …

00:11:58.474 --> 00:12:05.160
BALI: But then there's this other part of how&nbsp;
language works in society, right. The way I&nbsp;&nbsp;

00:12:05.160 --> 00:12:09.920
talk to my mother would be probably very&nbsp;
different to the way I'm talking to you,&nbsp;&nbsp;

00:12:09.920 --> 00:12:14.800
would be very different from the way I talk to my&nbsp;
friends, at a very basic level, right? The way,&nbsp;&nbsp;

00:12:14.800 --> 00:12:19.600
in India, I would greet someone older&nbsp;
to me would be very different from the&nbsp;&nbsp;

00:12:19.600 --> 00:12:24.600
way I would greet somebody here, because here&nbsp;
it's like much less formal and that, you know,&nbsp;&nbsp;

00:12:24.600 --> 00:12:30.160
age hierarchy is probably less? If I did the same&nbsp;
thing in India, I would be considered the rudest&nbsp;&nbsp;

00:12:30.160 --> 00:12:37.160
creature ever. [LAUGHS] So … and then, you know,&nbsp;
you go into the whole philosophy—psycholinguistics&nbsp;&nbsp;

00:12:37.160 --> 00:12:42.320
part. What happens in our brains, you know, when&nbsp;
we are speaking? Because language is controlled&nbsp;&nbsp;

00:12:42.320 --> 00:12:48.360
by various parts of our brain, right. And&nbsp;
then, you go to the pure philosophy part,&nbsp;&nbsp;

00:12:48.360 --> 00:12:57.200
like why? How does language even occur? Why do&nbsp;
we name things the way we name things? You know,&nbsp;&nbsp;

00:12:57.200 --> 00:13:03.220
why do we have a language of thought? You know,&nbsp;
what language are we thinking in? [LAUGHTER]

00:13:03.220 --> 00:13:03.880
HUIZINGA: Right.

00:13:03.880 --> 00:13:09.439
BALI: So, so it really does cover&nbsp;
the entire gamut of language …

00:13:09.439 --> 00:13:09.470
HUIZINGA: Yeah, yeah, yeah …

00:13:09.470 --> 00:13:11.480
BALI: … like from science to philosophy.

00:13:11.480 --> 00:13:15.640
HUIZINGA: Yeah, as I said before, when we&nbsp;
were talking out there, my mother-in-law&nbsp;&nbsp;

00:13:15.640 --> 00:13:22.800
was from Holland, and every time she did math or&nbsp;
adding, she would do it in Dutch, which—she'd be&nbsp;&nbsp;

00:13:22.800 --> 00:13:27.280
speaking in English and then she'd go over here&nbsp;
and count in Dutch out loud. And it's like, yeah,&nbsp;&nbsp;

00:13:27.280 --> 00:13:32.840
your brain switches back and forth. This is so&nbsp;
exciting to me. I had no idea how much I would&nbsp;&nbsp;

00:13:32.840 --> 00:13:38.600
love this podcast! So, much of your research&nbsp;
is centered on this big idea called “design&nbsp;&nbsp;

00:13:38.600 --> 00:13:44.000
thinking,” and it's got a whole discipline in&nbsp;
universities around the world. And you've talked&nbsp;&nbsp;

00:13:44.000 --> 00:13:49.760
about using something you call the 4D process&nbsp;
for your work. Could you explain that process,&nbsp;&nbsp;

00:13:49.760 --> 00:13:53.160
and how it plays out in the research&nbsp;
you do with the communities you serve?

00:13:53.160 --> 00:13:59.200
BALI: Yeah, so we've kind of adapted this.&nbsp;
My ex-colleague Monojit Choudhury and I,&nbsp;&nbsp;

00:13:59.200 --> 00:14:07.280
kind of, came up with this whole thing about&nbsp;
4D thinking, which is essentially discover,&nbsp;&nbsp;

00:14:07.280 --> 00:14:14.440
design, develop and deploy, right. And when we&nbsp;
are working with, especially with, marginalized&nbsp;&nbsp;

00:14:14.440 --> 00:14:20.280
or low-resource-language communities, the very&nbsp;
basic thing we have to do is discover, because&nbsp;&nbsp;

00:14:20.280 --> 00:14:26.200
we cannot go with, you know, our own ideas and&nbsp;
perceptions about what is required. And I can give&nbsp;&nbsp;

00:14:26.200 --> 00:14:31.320
you a very good example of this, right. You know,&nbsp;
most of us, as researchers and technologists,&nbsp;&nbsp;

00:14:31.320 --> 00:14:34.720
when we think of language technology, we are&nbsp;
thinking about machine translation; we're thinking&nbsp;&nbsp;

00:14:34.720 --> 00:14:39.240
about speech recognition; we are thinking about&nbsp;
state-of-the-art technology. And here we were&nbsp;&nbsp;

00:14:39.240 --> 00:14:45.720
talking to a community that spoke the language&nbsp;
Idu Mishmi, which is a very small community in&nbsp;&nbsp;

00:14:45.720 --> 00:14:51.760
northeast of India. And we were talking about,&nbsp;
you know, we can do this, we can do that. And&nbsp;&nbsp;

00:14:51.760 --> 00:14:58.630
they just turned to us and said, what we really&nbsp;
want is a mobile digital dictionary! [LAUGHS]

00:14:58.630 --> 00:14:59.554
HUIZINGA: Wow. Yeah …

00:14:59.554 --> 00:15:06.840
BALI: Right? And, you know, if you don't talk,&nbsp;
if you don't observe, if you are not open to what&nbsp;&nbsp;

00:15:06.840 --> 00:15:13.280
the community's needs might be, then you'll&nbsp;
miss that, right. You’ll miss the real thing&nbsp;&nbsp;

00:15:13.280 --> 00:15:18.080
that will make a difference to that community.&nbsp;
So that's the discover part. The design part,&nbsp;&nbsp;

00:15:18.080 --> 00:15:25.320
again, you have to design with the community. You&nbsp;
cannot go and design a system that they are unable&nbsp;&nbsp;

00:15:25.320 --> 00:15:32.720
to use properly, right. And again, another&nbsp;
very good example, one of the people I know,&nbsp;&nbsp;

00:15:32.720 --> 00:15:38.920
you know, he gave me this very good example of&nbsp;
why you have to think, even at the architecture&nbsp;&nbsp;

00:15:38.920 --> 00:15:44.760
level when you’re designing such things,&nbsp;
is like a lot of applications in India and&nbsp;&nbsp;

00:15:44.760 --> 00:15:51.440
around the world require your telephone&nbsp;
number for verification. Now, for women,&nbsp;&nbsp;

00:15:51.440 --> 00:15:56.760
it might be a safety issue. They might not want&nbsp;
to give their telephone number. Or in India,&nbsp;&nbsp;

00:15:56.760 --> 00:16:00.400
many women might not even have a&nbsp;
telephone, like a mobile number,&nbsp;&nbsp;

00:16:00.400 --> 00:16:05.720
right. So how do you think of other ways in&nbsp;
which they can verify, right? And so that's&nbsp;&nbsp;

00:16:05.720 --> 00:16:11.040
the design part. The develop and the deploy part,&nbsp;
kind of, go hand in hand, because I think it's a&nbsp;&nbsp;

00:16:11.040 --> 00:16:19.510
very iterative process. You develop quickly, you&nbsp;
put it out there, allow it to fail and, you know …

00:16:19.510 --> 00:16:20.634
HUIZINGA: Mm-hmm. Iterate …

00:16:20.634 --> 00:16:26.620
BALI: Iterate. So that's like the,&nbsp;
kind of, design thinking that we have.

00:16:26.620 --> 00:16:31.120
HUIZINGA: Yeah, I see that happening&nbsp;
in accessibility technology areas,&nbsp;&nbsp;

00:16:31.120 --> 00:16:33.514
too, as well as language …

00:16:33.514 --> 00:16:36.040
BALI: Yeah, and, you know,&nbsp;
working with the communities,&nbsp;&nbsp;

00:16:36.040 --> 00:16:39.360
very quickly, you become really humble.

00:16:39.360 --> 00:16:39.560
HUIZINGA: Sure.

00:16:39.560 --> 00:16:46.040
BALI: There's a lot of humility in me now.&nbsp;
Though I have progressed in my career and,&nbsp;&nbsp;

00:16:46.040 --> 00:16:51.280
you know, supposedly become wiser,&nbsp;
I am much more humble about what&nbsp;&nbsp;

00:16:51.280 --> 00:16:56.660
I know and what I can do than I&nbsp;
was when I started off, you know.

00:16:56.660 --> 00:17:02.240
HUIZINGA: I love that. Well, one thing I want&nbsp;
to talk to you about that has intrigued me,&nbsp;&nbsp;

00:17:02.240 --> 00:17:05.580
there's a thing that happens in&nbsp;
India where you mix languages …

00:17:05.580 --> 00:17:05.590
BALI: Yes!

00:17:05.590 --> 00:17:09.600
HUIZINGA: You speak both Hindi and English at the&nbsp;
same time, and you think, oh, you speak English,&nbsp;&nbsp;

00:17:09.600 --> 00:17:13.280
but it's like, no, there's words I don't&nbsp;
understand in that. What do you call that,&nbsp;&nbsp;

00:17:13.280 --> 00:17:16.480
and how did that drive your interest? I mean,&nbsp;&nbsp;

00:17:16.480 --> 00:17:19.820
that was kind of an early-on kind of thing&nbsp;
in your work, right? Talk about that.

00:17:19.820 --> 00:17:24.960
BALI: So that's called code-mixing or&nbsp;
code-switching. The only linguistic&nbsp;&nbsp;

00:17:24.960 --> 00:17:27.360
difference is code-mixing&nbsp;
happens within a sentence,&nbsp;&nbsp;

00:17:27.360 --> 00:17:30.200
and code-switching means one&nbsp;
sentence in one language and another.

00:17:30.200 --> 00:17:30.880
HUIZINGA: Oh, really?

00:17:30.880 --> 00:17:35.760
BALI: Yeah. So … but this is, like, not&nbsp;
just India. This is a very, very common&nbsp;&nbsp;

00:17:35.760 --> 00:17:41.640
feature of multilingual societies all over the&nbsp;
world. So it's not multilingual individuals,&nbsp;&nbsp;

00:17:41.640 --> 00:17:46.840
but at the societal level, when you&nbsp;
have multilingualism, then, you know,&nbsp;&nbsp;

00:17:46.840 --> 00:17:53.160
this is a marker of multilingualism. But&nbsp;
code-mixing particularly means that you&nbsp;&nbsp;

00:17:53.160 --> 00:17:58.520
have to be fluent in both languages to actually&nbsp;
code-mix, right. You have to have a certain amount&nbsp;&nbsp;

00:17:58.520 --> 00:18:04.160
of fluency in both languages. And there are&nbsp;
various reasons why people do this. You know,&nbsp;&nbsp;

00:18:04.160 --> 00:18:10.840
it's been studied by psychologists and linguists&nbsp;
for a long time. And for most people like me,&nbsp;&nbsp;

00:18:10.840 --> 00:18:16.720
multilingual people, that's the language we dream&nbsp;
in, we think about. [LAUGHTER] That's the language&nbsp;&nbsp;

00:18:16.720 --> 00:18:22.450
we talk to our siblings and friends in, right. And&nbsp;
for us, it's, like, just natural. We just keep …

00:18:22.450 --> 00:18:22.470
HUIZINGA: Mixing …

00:18:22.470 --> 00:18:28.240
BALI: … flipping between the two languages for a&nbsp;
variety of reasons. We might do it for emphasis;&nbsp;&nbsp;

00:18:28.240 --> 00:18:33.720
we might do it for humor. We might&nbsp;
just decide, OK, I'm going to pick&nbsp;&nbsp;

00:18:33.720 --> 00:18:38.095
this from this … the brain decides I'm&nbsp;
going to pick this from this language …

00:18:38.095 --> 00:18:38.110
HUIZINGA: Sure.

00:18:38.110 --> 00:18:42.160
BALI: … and this … So the reason we&nbsp;
got interested in, like, looking into&nbsp;&nbsp;

00:18:42.160 --> 00:18:48.280
code-mixing was that when we are saying that we&nbsp;
want humans to be able to interact with machines&nbsp;&nbsp;

00:18:48.280 --> 00:18:57.538
in their most natural language, then by some&nbsp;
estimates, half the world speaks like this!

00:18:57.538 --> 00:18:57.554
HUIZINGA: Right.

00:18:57.554 --> 00:19:03.400
BALI: So we have to be able to understand&nbsp;
exactly how they speak and, you know,&nbsp;&nbsp;

00:19:03.400 --> 00:19:08.504
be able to process and understand&nbsp;
their language, which is code-mixed …

00:19:08.504 --> 00:19:11.760
HUIZINGA: Sure. Well, it seems&nbsp;
like the human brain can pick&nbsp;&nbsp;

00:19:11.760 --> 00:19:15.360
this up and process it fairly quickly and easily,&nbsp;&nbsp;

00:19:15.360 --> 00:19:20.680
especially if it knows many languages. For&nbsp;
a machine, it would be much more difficult?

00:19:20.680 --> 00:19:25.760
BALI: It is. So initially, it was&nbsp;
really difficult because, you know,&nbsp;&nbsp;

00:19:25.760 --> 00:19:29.710
the way we created systems&nbsp;
was one language at a time …

00:19:29.710 --> 00:19:30.590
HUIZINGA: Right!

00:19:30.590 --> 00:19:34.560
BALI: … right. And it's not about&nbsp;
having an English engine and a&nbsp;&nbsp;

00:19:34.560 --> 00:19:38.781
Hindi engine available. It doesn't work that way.

00:19:38.781 --> 00:19:38.794
HUIZINGA: No!

00:19:38.794 --> 00:19:43.040
BALI: So you’d really need something that,&nbsp;
you know, is able to tackle the languages&nbsp;&nbsp;

00:19:43.040 --> 00:19:47.640
together. And in some theories, this is&nbsp;
almost considered a language of its own&nbsp;&nbsp;

00:19:47.640 --> 00:19:51.230
because it's not like you're randomly&nbsp;
mixing. There is a structure to …

00:19:51.230 --> 00:19:51.720
HUIZINGA: Oh, is there?

00:19:51.720 --> 00:19:54.560
BALI: Yeah. Where you can, where you can't …

00:19:54.560 --> 00:19:55.080
HUIZINGA: Gotcha.

00:19:55.080 --> 00:19:58.800
BALI: You know, so there is a&nbsp;
structure or grammar, you can say,&nbsp;&nbsp;

00:19:58.800 --> 00:20:05.120
of code-mixing. So we went after that.&nbsp;
We, kind of, created tools which could&nbsp;&nbsp;

00:20:05.120 --> 00:20:11.758
generate grammatically viable code-mixed&nbsp;
sentences given parallel data, etc.

00:20:11.758 --> 00:20:11.794
HUIZINGA: That’s awesome. Amazing.

00:20:11.794 --> 00:20:16.160
BALI: So, yeah, it takes effort&nbsp;
to do it. But again, right now,&nbsp;&nbsp;

00:20:16.160 --> 00:20:20.560
because the generative AI models&nbsp;
have at their disposal, you know,&nbsp;&nbsp;

00:20:20.560 --> 00:20:25.560
so many languages and at least, like,&nbsp;
theoretically can work in many, many,&nbsp;&nbsp;

00:20:25.560 --> 00:20:30.280
many languages, you know, code-mixing might&nbsp;
be an easier problem to solve right now.

00:20:30.280 --> 00:20:36.840
HUIZINGA: Right. OK. So we're talking&nbsp;
mostly about widely used languages,&nbsp;&nbsp;

00:20:36.840 --> 00:20:42.960
and you're very concerned right now on this&nbsp;
idea of low-resource languages. So unpack&nbsp;&nbsp;

00:20:42.960 --> 00:20:49.060
what you mean by low-resource, and what's missing&nbsp;
from the communities that speak those languages?

00:20:49.060 --> 00:20:55.440
BALI: Yeah. So when we say low-resource languages,&nbsp;
we typically mean that languages do not have, say,&nbsp;&nbsp;

00:20:55.440 --> 00:21:00.320
digital resources, linguistic resources,&nbsp;
language resources, that would enable&nbsp;&nbsp;

00:21:00.320 --> 00:21:06.160
technology building. It doesn't mean that the&nbsp;
communities themselves are impoverished in&nbsp;&nbsp;

00:21:06.160 --> 00:21:12.680
culture or linguistic richness, etc., right.&nbsp;
But the reason why these communities do not&nbsp;&nbsp;

00:21:12.680 --> 00:21:18.720
have a lot of language resources, linguistic&nbsp;
resources, digital resources, most of the time,&nbsp;&nbsp;

00:21:18.720 --> 00:21:25.140
it is because they are also marginalized in&nbsp;
other ways … social and economic marginalization.

00:21:25.140 --> 00:21:25.940
HUIZINGA: Right.

00:21:25.940 --> 00:21:30.560
BALI: And these are … if you look at them, they’re&nbsp;
not ti—I mean, of course, some of them are tiny,&nbsp;&nbsp;

00:21:30.560 --> 00:21:34.420
but when we say low-resource communities,&nbsp;
we are talking about really big numbers.

00:21:34.420 --> 00:21:35.160
HUIZINGA: Oh, really?

00:21:35.160 --> 00:21:39.000
BALI: Yeah. So one of the languages that I have&nbsp;
worked with—language communities that I've worked&nbsp;&nbsp;

00:21:39.000 --> 00:21:45.280
with—speak a language called Gondi, which is like&nbsp;
a Dravidian language that is spoken in … like a&nbsp;&nbsp;

00:21:45.280 --> 00:21:52.400
South Indian language that is spoken in north,&nbsp;
central-north area. It's a tribal language,&nbsp;&nbsp;

00:21:53.000 --> 00:21:58.520
and it's got around three million speakers.
 
HUIZINGA: Oh, wow!

00:21:58.520 --> 00:22:01.190
BALI: Yeah. That's like more than Welsh, …

00:22:01.190 --> 00:22:02.710
HUIZINGA: Yeah! [LAUGHS]

00:22:02.710 --> 00:22:07.200
BALI: … right? But because socio-politically,&nbsp;
they have been—or economically, they have&nbsp;&nbsp;

00:22:07.200 --> 00:22:12.400
been marginalized, they do not have the&nbsp;
resources to build technologies. And,&nbsp;&nbsp;

00:22:12.400 --> 00:22:21.000
you know, when we say empower everyone and&nbsp;
we only empower the top tier, I don't think&nbsp;&nbsp;

00:22:21.000 --> 00:22:29.040
we fulfill our ambition to empower everyone. And&nbsp;
like I said earlier, for these communities, all&nbsp;&nbsp;

00:22:29.040 --> 00:22:36.200
the technology that we have, digital tools that we&nbsp;
have access to, they really matter for them. So,&nbsp;&nbsp;

00:22:36.200 --> 00:22:43.880
for example, you know, a lot of government schemes&nbsp;
or the forest reserve laws are provided, say,&nbsp;&nbsp;

00:22:43.880 --> 00:22:50.940
in Hindi. If they are provided in Gondi, these&nbsp;
people have a real idea of what they can do.

00:22:50.940 --> 00:22:51.240
HUIZINGA: Yeah. … Sure.

00:22:51.240 --> 00:22:56.360
BALI: Similarly, for education, you know, there&nbsp;
are books and books and books in Hindi. There's&nbsp;&nbsp;

00:22:56.360 --> 00:23:03.300
no book available for Gondi. So how is the next&nbsp;
generation even going to learn the language?

00:23:03.300 --> 00:23:04.080
HUIZINGA: Right.

00:23:04.080 --> 00:23:09.040
BALI: And there are many, many languages&nbsp;
which are low resource. In fact, you know,&nbsp;&nbsp;

00:23:09.040 --> 00:23:14.920
we did a study sometime in 2020, I think, we&nbsp;
published this paper on linguistic diversity,&nbsp;&nbsp;

00:23:14.920 --> 00:23:22.600
and there we saw that, you know, we divided&nbsp;
languages in five categories, and the top most&nbsp;&nbsp;

00:23:22.600 --> 00:23:27.720
which have all the resources to build every&nbsp;
possible technology have only five languages,&nbsp;&nbsp;

00:23:27.720 --> 00:23:35.720
right. And more than half of the world's languages&nbsp;
are at the bottom. So it is a big problem.

00:23:35.720 --> 00:23:39.160
HUIZINGA: Yeah. Let's talk about some of&nbsp;
the specific technologies you're working&nbsp;&nbsp;

00:23:39.160 --> 00:23:44.600
on. And I want to go from platform to&nbsp;
project because you've got a big idea&nbsp;&nbsp;

00:23:44.600 --> 00:23:47.580
in a platform you call VeLLM. Talk about that.

00:23:47.580 --> 00:23:51.920
BALI: So VeLLM, which actually&nbsp;
means jaggery—the sweet,&nbsp;&nbsp;

00:23:51.920 --> 00:23:56.470
sugary jaggery—in Tamil, one&nbsp;
of the languages in India …

00:23:56.470 --> 00:24:00.880
HUIZINGA: Let me, let me interject that it's not&nbsp;
vellum like the paper, or what you're talking&nbsp;&nbsp;

00:24:00.880 --> 00:24:07.140
about. It's capital V, little e, and then&nbsp;
LLM, which stands for large language model?

00:24:07.140 --> 00:24:11.520
BALI: So universal, the “V”&nbsp;
comes from there. Empowerment,&nbsp;&nbsp;

00:24:11.520 --> 00:24:14.990
“e” comes from there. Through&nbsp;
large language models …

00:24:14.990 --> 00:24:17.820
HUIZINGA: Got it. OK. But&nbsp;
you shortened it to VeLLM.

00:24:17.820 --> 00:24:18.440
BALI: Yeah.

00:24:18.440 --> 00:24:19.220
HUIZINGA: OK.

00:24:19.220 --> 00:24:26.240
BALI: So, so the thing with VeLLM is that a bunch&nbsp;
of us got together just when this whole GPT was&nbsp;&nbsp;

00:24:26.240 --> 00:24:30.920
released, etc. We have a very strong group&nbsp;
that works on technologies for empowerment&nbsp;&nbsp;

00:24:30.920 --> 00:24:38.040
in the India lab, Microsoft Research India. And&nbsp;
we got together to see what it is that we can&nbsp;&nbsp;

00:24:38.040 --> 00:24:46.320
do now that we have access to such a strong&nbsp;
and powerful tool. And we started thinking&nbsp;&nbsp;

00:24:46.320 --> 00:24:52.960
of the work that we've been doing, which is to,&nbsp;
you know, build these technologies for specific&nbsp;&nbsp;

00:24:52.960 --> 00:24:58.680
areas and specific languages, specific&nbsp;
demographies. So we, kind of, put all&nbsp;&nbsp;

00:24:58.680 --> 00:25:06.920
that knowledge and all that experience we had and&nbsp;
thought of like, how can we scale that, really,&nbsp;&nbsp;

00:25:06.920 --> 00:25:15.320
across everything that we do? So VeLLM, at its&nbsp;
base, you know, takes a GPT-like LLM, you know,&nbsp;&nbsp;

00:25:15.320 --> 00:25:22.840
as a horizontal across everything. On top of it,&nbsp;
we have again, horizontals of machine learning,&nbsp;&nbsp;

00:25:22.840 --> 00:25:31.560
of multilingual tools and processes, which allow&nbsp;
us to take the outputs from, say, GPT-like things&nbsp;&nbsp;

00:25:31.560 --> 00:25:37.920
and adapt it to different languages or,&nbsp;
you know, some different kind of domain,&nbsp;&nbsp;

00:25:37.920 --> 00:25:45.640
etc. And then we have verticals on top of it,&nbsp;
which allow people to build specific applications.

00:25:45.640 --> 00:25:51.560
HUIZINGA: Let me just go back and say GPT&nbsp;
… I think most of our audience will know&nbsp;&nbsp;

00:25:51.560 --> 00:25:57.000
that that stands for generative pretrained&nbsp;
transformer models. But just so we have that&nbsp;&nbsp;

00:25:57.000 --> 00:26:03.994
for anyone who doesn't know, let's anchor that.&nbsp;
So VeLLM basically was an enabling platform …

00:26:03.994 --> 00:26:04.226
BALI: Yes.

00:26:04.226 --> 00:26:07.640
HUIZINGA: … on which to build&nbsp;
specific technologies that&nbsp;&nbsp;

00:26:07.640 --> 00:26:11.589
would solve problems in a vertical application.

00:26:11.589 --> 00:26:13.520
BALI: Yes. Yes. And because it's a platform,&nbsp;&nbsp;

00:26:13.520 --> 00:26:17.386
we're also working on tools&nbsp;
that are needed across domains …

00:26:17.386 --> 00:26:17.910
HUIZINGA: Oh, interesting.

00:26:17.910 --> 00:26:21.160
BALI: … as well as tools that&nbsp;
are needed for specific domains.

00:26:21.160 --> 00:26:23.840
HUIZINGA: OK, so let's talk&nbsp;
about some of the specifics&nbsp;&nbsp;

00:26:23.840 --> 00:26:27.080
because we could get into the weeds&nbsp;
on the tools that everybody needs,&nbsp;&nbsp;

00:26:27.080 --> 00:26:31.360
but I like the ideas that you're working on&nbsp;
and the specific needs that you're meeting,&nbsp;&nbsp;

00:26:31.360 --> 00:26:38.200
the felt-need thing that gets an idea going.&nbsp;
So talk about this project that you've worked&nbsp;&nbsp;

00:26:38.200 --> 00:26:43.520
on called Kahani. Could you explain what that is,&nbsp;
and how it works? It’s really interesting to me.

00:26:43.520 --> 00:26:47.520
BALI: So Kahani, actually, is about storytelling,&nbsp;&nbsp;

00:26:47.520 --> 00:26:53.200
culturally appropriate storytelling, with&nbsp;
spectacular images, as well as like textual story.

00:26:53.200 --> 00:26:54.320
HUIZINGA: So visual storytelling?

00:26:54.320 --> 00:27:01.760
BALI: Visual storytelling with the text. So this&nbsp;
actually started when my colleague Sameer Segal,&nbsp;&nbsp;

00:27:01.760 --> 00:27:09.640
he was trying to use generative AI to create&nbsp;
stories for his daughter, and he discovered that,&nbsp;&nbsp;

00:27:09.640 --> 00:27:13.720
you know, things are not very culturally&nbsp;
appropriate! So I'll give an example that,&nbsp;&nbsp;

00:27:13.720 --> 00:27:19.560
you know, if you want to take Frozen and take&nbsp;
it to, like, the south Indian state of Kerala,&nbsp;&nbsp;

00:27:19.560 --> 00:27:24.480
you'll have the beaches of Kerala,&nbsp;
you'll have even have the coconut trees,&nbsp;&nbsp;

00:27:24.480 --> 00:27:30.652
but then you will have this blond&nbsp;
princess in a princess gown …

00:27:30.652 --> 00:27:30.670
HUIZINGA: Sure …

00:27:30.670 --> 00:27:35.480
BALI: … who's there, right? So that's where&nbsp;
we started discussing this, and we, kind of,&nbsp;&nbsp;

00:27:35.480 --> 00:27:42.720
started talking about, how can we create visuals&nbsp;
that are anchored on text of a story that's&nbsp;&nbsp;

00:27:42.720 --> 00:27:49.080
culturally appropriate? So when we're talking&nbsp;
about, say, Little Red Riding Hood, if we ask the&nbsp;&nbsp;

00:27:49.080 --> 00:27:55.640
generative AI model, OK, that I want the story of&nbsp;
Little Red Riding Hood but in an Indian context,&nbsp;&nbsp;

00:27:55.640 --> 00:28:01.360
it does a fantastic job. It actually gives&nbsp;
you a very nice story, which, you know,&nbsp;&nbsp;

00:28:01.360 --> 00:28:10.240
just reflects the Red Riding Hood story into an&nbsp;
Indian context. But the images don't really …

00:28:10.240 --> 00:28:10.270
HUIZINGA: Match … [LAUGHTER]

00:28:10.270 --> 00:28:16.360
BALI: … Match at all. So that's where the whole&nbsp;
Kahani thing started. And we did a hackathon&nbsp;&nbsp;

00:28:16.360 --> 00:28:21.040
project on it. And then a lot of people&nbsp;
got interested. It's an ongoing project,&nbsp;&nbsp;

00:28:21.040 --> 00:28:25.360
so I won't say that it's out there&nbsp;
yet, but we are very excited about it,&nbsp;&nbsp;

00:28:25.360 --> 00:28:30.920
but because think of it, we can actually&nbsp;
create stories for children, you know,&nbsp;&nbsp;

00:28:30.920 --> 00:28:35.040
which is what we started with, but&nbsp;
we can create so much more media,&nbsp;&nbsp;

00:28:35.040 --> 00:28:42.040
so much more culturally appropriate storytelling,&nbsp;
which is not necessarily targeted at children.

00:28:42.040 --> 00:28:42.834
HUIZINGA: Yeah, yeah.

00:28:42.834 --> 00:28:44.500
BALI: So that's what Kahani is about.

00:28:44.500 --> 00:28:49.960
HUIZINGA: OK. And I saw a demo of it that&nbsp;
your colleague did for Research Forum here,&nbsp;&nbsp;

00:28:49.960 --> 00:28:52.960
and there was an image of a girl—it was&nbsp;&nbsp;

00:28:52.960 --> 00:28:57.340
beautiful—and then there was a mask&nbsp;
of some kind or a … what was that?

00:28:57.340 --> 00:29:03.280
BALI: So the mask is called Nazar Battu,&nbsp;
which is actually, you have these masks&nbsp;&nbsp;

00:29:03.280 --> 00:29:10.560
which are supposed to drive away the evil eye.&nbsp;
So that's what the mask was about. It's a very&nbsp;&nbsp;

00:29:10.560 --> 00:29:14.400
Indian thing. You know, when you build a&nbsp;
nice house, you put one on top of it so&nbsp;&nbsp;

00:29:14.400 --> 00:29:20.520
that the envious glances are, like, kept&nbsp;
at bay. So, yeah, so that's what it was.

00:29:20.520 --> 00:29:22.960
HUIZINGA: And was there some issue of&nbsp;&nbsp;

00:29:22.960 --> 00:29:25.640
the generative AI not really&nbsp;
understanding what that was?

00:29:25.640 --> 00:29:28.040
BALI: No, it didn't understand what it was.

00:29:28.040 --> 00:29:30.820
HUIZINGA: So then can you fix that&nbsp;
and make it more culturally aware?

00:29:30.820 --> 00:29:36.600
BALI: So that's what we are trying to do for the&nbsp;
image thing. So we have another project on culture&nbsp;&nbsp;

00:29:36.600 --> 00:29:44.251
awareness where we are looking at understanding&nbsp;
how much generative AI knows about other cultures.

00:29:44.251 --> 00:29:44.274
HUIZINGA: Interesting.

00:29:44.274 --> 00:29:48.480
BALI: So that's a simultaneous project&nbsp;
that's happening. But in Kahani,&nbsp;&nbsp;

00:29:48.480 --> 00:29:52.678
a lot of it is, like, trying to&nbsp;
get reference images, you know …

00:29:52.678 --> 00:29:52.714
HUIZINGA: Yeah. … Into the system?

00:29:52.714 --> 00:29:53.880
BALI: Into the system …
HUIZINGA: Gotcha …

00:29:53.880 --> 00:29:58.040
BALI: … and trying to anchor on that.

00:29:58.040 --> 00:30:02.320
HUIZINGA: Mmmm. So—and we're not going to talk&nbsp;
about that project, I don't think—but … how do&nbsp;&nbsp;

00:30:02.320 --> 00:30:07.988
you assess whether an AI knows? By just asking&nbsp;
it? By prompting and seeing what happens?

00:30:07.988 --> 00:30:14.040
BALI: Yeah, yeah, yeah. So in another project,&nbsp;
what we did was, we asked humans to play a game&nbsp;&nbsp;

00:30:14.040 --> 00:30:20.320
to get cultural artifacts from them. The problem&nbsp;
with asking humans what cultural artifacts are&nbsp;&nbsp;

00:30:20.320 --> 00:30:25.658
important to them is we don't think of like&nbsp;
things as culture, right. [LAUGHS] This is food!

00:30:25.658 --> 00:30:26.420
HUIZINGA: It’s just who we are!

00:30:26.420 --> 00:30:30.840
BALI: This is my food. Like,&nbsp;
you know, it’s not a culturally&nbsp;&nbsp;

00:30:30.840 --> 00:30:36.310
important artifact. This is how I greet&nbsp;
my parents. It’s not like culturally …

00:30:36.310 --> 00:30:39.000
HUIZINGA: So it's just like fish swimming&nbsp;
in water. You don't see the water.

00:30:39.000 --> 00:30:44.160
BALI: Exactly. So we gamified this thing, and&nbsp;
we were able to get certain cultural artifacts,&nbsp;&nbsp;

00:30:44.160 --> 00:30:49.120
and we tried to get generative AI models to tell&nbsp;&nbsp;

00:30:49.120 --> 00:30:54.550
us about the same artifacts. And&nbsp;
it didn't do too well … [LAUGHS]

00:30:54.550 --> 00:30:55.620
HUIZINGA: But that's why it's research!

00:30:55.620 --> 00:30:56.280
BALI: Yes!

00:30:56.280 --> 00:31:02.760
HUIZINGA: You try, you iterate, you try again …&nbsp;
cool. As I mentioned earlier, I was a high school&nbsp;&nbsp;

00:31:02.760 --> 00:31:07.960
English teacher and an English major. I'm not&nbsp;
correcting your grammar because it's fantastic.

00:31:07.960 --> 00:31:09.200
BALI: Thank you.

00:31:09.200 --> 00:31:14.000
HUIZINGA: But as a former educator, one of the&nbsp;
projects I felt was really compelling that you're&nbsp;&nbsp;

00:31:14.000 --> 00:31:19.440
working on is called Shiksha. It's a copilot&nbsp;
in education. Tell our audience about this.

00:31:19.440 --> 00:31:25.720
BALI: So this is actually our proof of concept&nbsp;
for the VeLLM platform. Since almost all of us&nbsp;&nbsp;

00:31:25.720 --> 00:31:31.720
were interested in education, we decided to&nbsp;
go for education as the first use case that&nbsp;&nbsp;

00:31:31.720 --> 00:31:39.280
we're going to work on. And actually, it was&nbsp;
a considered decision to go target teachers&nbsp;&nbsp;

00:31:39.280 --> 00:31:44.880
instead of students. I mean, you must have seen&nbsp;
a lot of work being done on taking generative AI&nbsp;&nbsp;

00:31:44.880 --> 00:31:53.040
to students, right. But we feel that, you know,&nbsp;
teachers are necessary to teach because they're&nbsp;&nbsp;

00:31:53.040 --> 00:31:58.920
not just giving you information about the&nbsp;
subject. They're giving you skills to learn,&nbsp;&nbsp;

00:31:58.920 --> 00:32:05.000
which hopefully will stay with you for a&nbsp;
lifetime, right. And if we enable teachers,&nbsp;&nbsp;

00:32:05.000 --> 00:32:13.400
they will enable so many hundreds of students.&nbsp;
One teacher can enable thousands of students,&nbsp;&nbsp;

00:32:13.400 --> 00:32:18.680
right, over her career. So instead of,&nbsp;
like, going and targeting students,&nbsp;&nbsp;

00:32:18.680 --> 00:32:24.160
if we make it possible for teachers to&nbsp;
do their jobs more effectively or, like,&nbsp;&nbsp;

00:32:24.160 --> 00:32:29.920
you know, help them get over the problems&nbsp;
they have, then we are actually creating&nbsp;&nbsp;

00:32:29.920 --> 00:32:36.720
an ecosystem where things will scale really&nbsp;
fast, really quickly. And in India, you know,&nbsp;&nbsp;

00:32:36.720 --> 00:32:43.280
this is especially true because the government has&nbsp;
actually come up with some digital resources for&nbsp;&nbsp;

00:32:43.280 --> 00:32:50.840
teachers to use, but there's a lot more that can&nbsp;
be done. So we interviewed about a hundred-plus&nbsp;&nbsp;

00:32:50.840 --> 00:32:56.219
teachers across different parts of the country.&nbsp;
And this is the, you know, discover part.

00:32:56.219 --> 00:32:56.234
HUIZINGA: Yeah!

00:32:56.234 --> 00:33:01.310
BALI: And we found out that lesson&nbsp;
plans are a big headache! [LAUGHS]

00:33:01.310 --> 00:33:03.700
HUIZINGA: Yes, they are! Can confirm!

00:33:03.700 --> 00:33:07.360
BALI: Yeah. And they spend a lot&nbsp;
of time doing lesson plans because&nbsp;&nbsp;

00:33:07.360 --> 00:33:11.790
they're required to create a lesson&nbsp;
plan for every class they teach …

00:33:11.790 --> 00:33:13.360
HUIZINGA: Sure. With learning outcomes …

00:33:13.360 --> 00:33:14.386
BALI: Exactly.

00:33:14.386 --> 00:33:14.680
HUIZINGA: All of it.

00:33:14.680 --> 00:33:20.360
BALI: All of it. So that's where we, you&nbsp;
know, zeroed in on—how to make it easier&nbsp;&nbsp;

00:33:20.360 --> 00:33:27.240
for teachers to create lesson plans. And that's&nbsp;
what the Shiksha project is about. You know,&nbsp;&nbsp;

00:33:27.240 --> 00:33:30.680
there is an enrollment process where the&nbsp;
teachers say what subject they’re teaching,&nbsp;&nbsp;

00:33:30.680 --> 00:33:34.731
what classes they’re teaching, what boards,&nbsp;
because there are different boards of education …

00:33:34.731 --> 00:33:34.750
HUIZINGA: Right …

00:33:34.750 --> 00:33:41.160
BALI: … which have different syllabus. So all&nbsp;
that. But after that, it takes less than seven&nbsp;&nbsp;

00:33:41.160 --> 00:33:48.560
minutes for a teacher to create an entire lesson&nbsp;
plan for a particular topic. You know, class&nbsp;&nbsp;

00:33:48.560 --> 00:33:56.520
assignments, class activities, home assignments,&nbsp;
homework—everything! Like the whole thing in seven&nbsp;&nbsp;

00:33:56.520 --> 00:34:01.760
minutes! And these teachers have the ability to go&nbsp;
and correct it. Like, it's an interactive thing.&nbsp;&nbsp;

00:34:01.760 --> 00:34:08.340
So, you know, they might say, I think this&nbsp;
activity is too difficult for my students.

00:34:08.340 --> 00:34:09.160
HUIZINGA: Yeah …

00:34:09.160 --> 00:34:15.000
BALI: Can I have, like, an easier one? Or, can&nbsp;
I change this to this? So it allows them to&nbsp;&nbsp;

00:34:15.000 --> 00:34:22.600
interactively personalize, modify the plan that's&nbsp;
put out. And I find that really exciting. And&nbsp;&nbsp;

00:34:22.600 --> 00:34:28.320
we've tested this with the Sikshana Foundation,&nbsp;
which works with teachers in India. We've tested&nbsp;&nbsp;

00:34:28.320 --> 00:34:33.120
this with them. The teachers are very excited and&nbsp;
now Sikshana wants to scale it to other schools.

00:34:33.120 --> 00:34:38.400
HUIZINGA: Right … well, my first question is,&nbsp;
where were you when I was teaching, Kalika?

00:34:38.960 --> 00:34:40.960
BALI: There was no generative AI!

00:34:40.960 --> 00:34:47.040
HUIZINGA: No. In fact, we just discovered the fax&nbsp;
machine when I started teaching. Oh, that dates&nbsp;&nbsp;

00:34:47.040 --> 00:34:55.640
me! You know, back to what you said about teachers&nbsp;
being instrumental in the lives of their students.&nbsp;&nbsp;

00:34:55.640 --> 00:35:01.160
You know, we can remember our favorite teachers,&nbsp;
our best teachers. We don't remember a machine.

00:35:01.160 --> 00:35:02.000
BALI: No.

00:35:02.000 --> 00:35:08.760
HUIZINGA: And what you've done with this is to&nbsp;
embody the absolute sort of pinnacle of what&nbsp;&nbsp;

00:35:08.760 --> 00:35:15.960
AI can do, which is to be the collaborator,&nbsp;
the assistant, the augmenter, and the helper&nbsp;&nbsp;

00:35:15.960 --> 00:35:22.960
so that the teacher can do that inspirational,&nbsp;
connective-tissue job with the students without&nbsp;&nbsp;

00:35:22.960 --> 00:35:27.800
having to, like, sacrifice the rest of their&nbsp;
life making lesson plans and grading papers. Oh,&nbsp;&nbsp;

00:35:27.800 --> 00:35:35.200
my gosh. OK. On the positive side, we've just&nbsp;
talked about what this work proposes and how&nbsp;&nbsp;

00:35:35.200 --> 00:35:41.040
it's good, but I always like to dig a little bit&nbsp;
into the potential unintended consequences and&nbsp;&nbsp;

00:35:41.040 --> 00:35:45.600
what could possibly go wrong if, in fact, you&nbsp;
got everything right. So I'll anchor this in&nbsp;&nbsp;

00:35:45.600 --> 00:35:52.880
another example. When GPT models first came out,&nbsp;
the first reaction came from educators. It feels&nbsp;&nbsp;

00:35:52.880 --> 00:35:57.440
like we're in a bit of a paradigm shift like we&nbsp;
were when the calculator and the internet even&nbsp;&nbsp;

00:35:57.440 --> 00:36:03.360
came out. [It’s] like, how do we process this? So&nbsp;
I want to go philosophically here and talk about&nbsp;&nbsp;

00:36:03.360 --> 00:36:10.400
how you foresee us adopting and moving forward&nbsp;
with generative AI in education, writ large.

00:36:10.400 --> 00:36:16.640
BALI: Yeah, I think this is a question that&nbsp;
troubles a lot of us and not just in education,&nbsp;&nbsp;

00:36:16.640 --> 00:36:20.630
but in all spheres that generative AI is …

00:36:20.630 --> 00:36:21.849
HUIZINGA: Art …

00:36:21.849 --> 00:36:21.866
BALI: … art …

00:36:21.866 --> 00:36:22.670
HUIZINGA: … writing …

00:36:22.670 --> 00:36:23.066
BALI: … writing …

00:36:23.066 --> 00:36:24.154
HUIZINGA: … journalism …

00:36:24.154 --> 00:36:31.720
BALI: Absolutely. And I think the way I, kind&nbsp;
of, think about it in my head is it's a tool.&nbsp;&nbsp;

00:36:31.720 --> 00:36:38.400
At the end of it, it is a tool. It's a&nbsp;
very powerful tool, but it is a tool,&nbsp;&nbsp;

00:36:38.400 --> 00:36:47.080
and humans must always have the agency over it.&nbsp;
And we need to come up, as a society, you know,&nbsp;&nbsp;

00:36:47.080 --> 00:36:52.920
we need to come up with the norms of using&nbsp;
the tool. And if you think about it, you know,&nbsp;&nbsp;

00:36:52.920 --> 00:37:01.640
internet, taking internet as an example, there is&nbsp;
a lot of harm that internet has propagated, right.&nbsp;&nbsp;

00:37:01.640 --> 00:37:08.400
The darknet and all the other stuff that happens,&nbsp;
right. But on the whole, there are regulations,&nbsp;&nbsp;

00:37:08.400 --> 00:37:17.492
but there are also an actual consensus around what&nbsp;
constitutes the positive use of internet, right.

00:37:17.492 --> 00:37:17.514
HUIZINGA: Sure, yeah.

00:37:17.514 --> 00:37:20.270
BALI: Nobody says that, for&nbsp;
example, deepfakes are …

00:37:20.270 --> 00:37:21.470
HUIZINGA: Mm-hmm. Good …

00:37:21.470 --> 00:37:29.240
BALI: … good, right. So we have to come from there&nbsp;
and think about what kind of regulations we need&nbsp;&nbsp;

00:37:29.240 --> 00:37:34.280
to have in place, what kind of consensus&nbsp;
we need to have in place, what's missing.

00:37:34.280 --> 00:37:38.360
HUIZINGA: Right. Another&nbsp;
project that has been around,&nbsp;&nbsp;

00:37:38.360 --> 00:37:43.720
and it isn't necessarily on top of VeLLM, but&nbsp;
it's called Karya, and you call it a social&nbsp;&nbsp;

00:37:43.720 --> 00:37:48.480
impact organization that serves not just&nbsp;
one purpose, but three. Talk about that.

00:37:48.480 --> 00:37:54.760
BALI: Oh, Karya is my favorite! [LAUGHS] So Karya&nbsp;
started as a research project within Microsoft&nbsp;&nbsp;

00:37:54.760 --> 00:38:00.080
Research India, and this was the brainchild again&nbsp;
of my colleague—I have like some of the most&nbsp;&nbsp;

00:38:00.080 --> 00:38:07.440
amazing colleagues, too, that I work with!—called&nbsp;
Vivek Seshadri. And Vivek wanted to create,&nbsp;&nbsp;

00:38:07.440 --> 00:38:13.840
you know, digital work for people who do not&nbsp;
have access to such work. So he wanted to go&nbsp;&nbsp;

00:38:13.840 --> 00:38:20.640
to the rural communities, to people who belong&nbsp;
to slightly lower socioeconomic demographies,&nbsp;&nbsp;

00:38:20.640 --> 00:38:30.760
and provide work, like microtasks kind of work,&nbsp;
gig work, to them. And he was doing this, and then&nbsp;&nbsp;

00:38:30.760 --> 00:38:37.760
we started talking, and I said, you know, we need&nbsp;
so much data for all these languages and all these&nbsp;&nbsp;

00:38:37.760 --> 00:38:44.440
different tasks, and that could be, like, a really&nbsp;
cool thing to try on Karya, and that's where it&nbsp;&nbsp;

00:38:44.440 --> 00:38:51.560
all started, my involvement with Karya, which is&nbsp;
still pretty strong. And Karya then became such a&nbsp;&nbsp;

00:38:51.560 --> 00:38:58.320
stable project that Microsoft Research India&nbsp;
spun it out. So it's now its own standalone&nbsp;&nbsp;

00:38:58.320 --> 00:39:05.040
startup right now like a social enterprise,&nbsp;
and they work on providing digital work. They&nbsp;&nbsp;

00:39:05.040 --> 00:39:11.440
work on providing skills, like upskilling.&nbsp;
They work on awareness, like, you know,&nbsp;&nbsp;

00:39:12.160 --> 00:39:19.760
making people aware of certain social, financial,&nbsp;
other such trainings. So what's been most amazing&nbsp;&nbsp;

00:39:19.760 --> 00:39:28.080
is that Karya has been able to essentially&nbsp;
collect data for AI in the most ethical way&nbsp;&nbsp;

00:39:28.080 --> 00:39:37.520
possible. They pay their workers a little over&nbsp;
the minimal wage. They also have something called&nbsp;&nbsp;

00:39:37.520 --> 00:39:44.920
data ownership practice, where the data that&nbsp;
is created by, say, me, I have some sort of&nbsp;&nbsp;

00:39:44.920 --> 00:39:53.336
ownership on it. So what that means is that every&nbsp;
time Karya sells a dataset, a royalty comes back …

00:39:53.336 --> 00:39:53.354
HUIZINGA: No … !

00:39:53.354 --> 00:39:55.360
BALI: Yeah! To the workers.

00:39:55.360 --> 00:40:00.440
HUIZINGA: OK, we need to scale this out!&nbsp;
[LAUGHS] OK. So to give a concrete example,&nbsp;&nbsp;

00:40:00.440 --> 00:40:06.760
the three purposes would be educational,&nbsp;
financial—on their end—and data collection,&nbsp;&nbsp;

00:40:06.760 --> 00:40:11.120
which would ultimately support a low-resource&nbsp;
language by having digital assets.

00:40:11.120 --> 00:40:11.630
BALI: Absolutely!

00:40:11.630 --> 00:40:15.535
HUIZINGA: So you could give somebody&nbsp;
something to read in their language …

00:40:15.535 --> 00:40:15.546
BALI: Yeah.

00:40:15.546 --> 00:40:18.880
HUIZINGA: … that would educate them in&nbsp;
the process. They would get paid to do it,&nbsp;&nbsp;

00:40:18.880 --> 00:40:20.320
and then you would have this data.

00:40:20.320 --> 00:40:21.250
BALI: Yes!

00:40:21.250 --> 00:40:25.400
HUIZINGA: OK. So cool. So simple.

00:40:25.400 --> 00:40:28.400
BALI: Like I said, it's my favorite project.

00:40:28.400 --> 00:40:30.760
HUIZINGA: I get that. I totally get that.

00:40:30.760 --> 00:40:37.480
BALI: And they … they’ve been, you know, they&nbsp;
have been winning awards and things all over&nbsp;&nbsp;

00:40:37.480 --> 00:40:44.120
for the work that they're doing right now. And&nbsp;
I am very involved in one project with them,&nbsp;&nbsp;

00:40:44.120 --> 00:40:51.400
which is to do with gender-intentional AI, or&nbsp;
gender-intentional datasets for AI, for Indian&nbsp;&nbsp;

00:40:51.400 --> 00:40:57.240
languages. And that's really crucial because,&nbsp;
you know, we talk about gender bias in datasets,&nbsp;&nbsp;

00:40:57.800 --> 00:41:03.480
etc., but all that understanding comes&nbsp;
from a very Western perspective and for&nbsp;&nbsp;

00:41:03.480 --> 00:41:07.840
languages like English, etc. They do not&nbsp;
translate very well to Indian languages.

00:41:07.840 --> 00:41:08.720
HUIZINGA: Right.

00:41:08.720 --> 00:41:12.920
BALI: And in this particular&nbsp;
project, we're looking at first,&nbsp;&nbsp;

00:41:12.920 --> 00:41:20.840
how to define gender bias. How do&nbsp;
we even get data around gender bias?&nbsp;&nbsp;

00:41:20.840 --> 00:41:24.880
What does it even mean to say that&nbsp;
technology is gender intentional?

00:41:24.880 --> 00:41:29.800
HUIZINGA: Right. All right, well,&nbsp;
let's talk a little bit about what&nbsp;&nbsp;

00:41:29.800 --> 00:41:34.600
I like to call outrageous ideas. And&nbsp;
these are the ones that, you know,&nbsp;&nbsp;

00:41:34.600 --> 00:41:40.080
on the research spectrum from sort of really&nbsp;
practical applied research to blue sky get&nbsp;&nbsp;

00:41:40.080 --> 00:41:45.200
dismissed or viewed as unrealistic or&nbsp;
unattainable. So years ago—here's a&nbsp;&nbsp;

00:41:45.200 --> 00:41:49.280
little story about you—when you told your&nbsp;
tech colleagues that you wanted to work&nbsp;&nbsp;

00:41:49.280 --> 00:41:54.500
with the world's most marginalized languages,&nbsp;
they told you you'd only marginalize yourself.

00:41:54.500 --> 00:41:55.040
BALI: Yes!

00:41:55.040 --> 00:41:58.560
HUIZINGA: But you didn't say&nbsp;
no. You didn't say no. Um,&nbsp;&nbsp;

00:41:58.560 --> 00:42:04.720
two questions. Did you feel like your own&nbsp;
idea was outrageous back then? And do you&nbsp;&nbsp;

00:42:04.720 --> 00:42:09.664
still have anything outrageous&nbsp;
yet to accomplish in this plan?

00:42:09.664 --> 00:42:16.120
BALI: Oh, yeah! I hope so! Yeah. No, I do think,&nbsp;
in some sense, the pushback that I got for my idea&nbsp;&nbsp;

00:42:16.120 --> 00:42:20.800
makes me think it was outrageous. I didn't think&nbsp;
it was outrageous at all at that time! [LAUGHS] I&nbsp;&nbsp;

00:42:20.800 --> 00:42:28.560
thought it was a very reasonable idea! But there&nbsp;
was a very solid pushback and not just from your&nbsp;&nbsp;

00:42:28.560 --> 00:42:34.520
colleagues. You know, for researchers, publishing&nbsp;
papers is important! No one would publish a paper&nbsp;&nbsp;

00:42:34.520 --> 00:42:39.160
which focused only on, say, Indian languages&nbsp;
or low-resource languages. We've come a very&nbsp;&nbsp;

00:42:39.160 --> 00:42:45.920
long way even in the research community on that,&nbsp;
right. We kept pushing, pushing, pushing! And now,&nbsp;&nbsp;

00:42:45.920 --> 00:42:53.000
there are tracks, there are workshops, there are&nbsp;
conferences which are devoted to multilingual and&nbsp;&nbsp;

00:42:53.000 --> 00:42:59.880
low-resource languages. When I said I wanted to&nbsp;
work on Hindi, and Hindi is the biggest language&nbsp;&nbsp;

00:42:59.880 --> 00:43:07.040
in India, right. And even for that, I was told,&nbsp;
why don't you work on German instead? And I’m&nbsp;&nbsp;

00:43:07.040 --> 00:43:12.177
like, there are lots of people working on German&nbsp;
who will solve the problems with German! Nobody&nbsp;&nbsp;

00:43:12.177 --> 00:43:16.920
is looking at Hindi! I mean, people should work on&nbsp;
all the languages. People should work on German,&nbsp;&nbsp;

00:43:16.920 --> 00:43:22.640
but I don't want to work on German! So&nbsp;
there was a lot of pushback back then,&nbsp;&nbsp;

00:43:22.640 --> 00:43:28.600
and I see a little bit of that with the very&nbsp;
low-resource languages even now. And I think&nbsp;&nbsp;

00:43:28.600 --> 00:43:34.840
some people think it's a “feel-good” thing,&nbsp;
whereas I think it's not. I think it's a very&nbsp;&nbsp;

00:43:34.840 --> 00:43:43.320
economically viable, necessary thing to build&nbsp;
technology for these communities, for these&nbsp;&nbsp;

00:43:43.320 --> 00:43:50.590
languages. No one thought Hindi was economically&nbsp;
viable 15 years ago, for whatever reason …

00:43:50.590 --> 00:43:52.154
HUIZINGA: That … that floors me …

00:43:52.154 --> 00:43:55.600
BALI: Yeah, but, you know,&nbsp;
we're not talking about tens&nbsp;&nbsp;

00:43:55.600 --> 00:43:59.720
of thousands of people in some of these&nbsp;
languages; we're talking about millions.

00:43:59.720 --> 00:44:00.480
HUIZINGA: Yeah.

00:44:00.480 --> 00:44:06.000
BALI: I still think that is a job that I&nbsp;
need to continue, you know, pushing back on.

00:44:06.000 --> 00:44:08.400
HUIZINGA: Do you think that any of that sort of&nbsp;&nbsp;

00:44:08.400 --> 00:44:13.280
outrageous reaction was due to the fact&nbsp;
that the technology wasn't as advanced&nbsp;&nbsp;

00:44:13.280 --> 00:44:16.720
as it is now and that it might have&nbsp;
changed in terms of what we can do?

00:44:16.720 --> 00:44:22.320
BALI: There was definitely the aspect of&nbsp;
technology there that it was just quite&nbsp;&nbsp;

00:44:22.320 --> 00:44:27.800
difficult and very, very resource-intensive&nbsp;
to build it for languages which did not have&nbsp;&nbsp;

00:44:27.800 --> 00:44:32.440
resources. You know, there was a time when we&nbsp;
were talking about how to go about doing this,&nbsp;&nbsp;

00:44:32.440 --> 00:44:39.080
and because people in various big tech companies,&nbsp;
people did not really remember a time when,&nbsp;&nbsp;

00:44:39.080 --> 00:44:45.640
for English, they had to start data collection&nbsp;
from scratch because everyone who was working on,&nbsp;&nbsp;

00:44:45.640 --> 00:44:52.000
say, English at that time was building on what&nbsp;
people had done years and years ago. So they&nbsp;&nbsp;

00:44:52.000 --> 00:44:58.600
could not even conceptualize that you had to start&nbsp;
from scratch for anything, right. But now with the&nbsp;&nbsp;

00:44:58.600 --> 00:45:06.280
technology as well, I'm quite optimistic and&nbsp;
trying to think of how cool it would be to do,&nbsp;&nbsp;

00:45:06.280 --> 00:45:11.920
you know, smaller data collections and fine-tuned&nbsp;
models specifically and things like that,&nbsp;&nbsp;

00:45:11.920 --> 00:45:19.501
so I think that the technology is definitely one&nbsp;
big thing, but economics is a big factor, too.

00:45:19.501 --> 00:45:23.560
HUIZINGA: Mmm-hmm. Well, I'm glad that&nbsp;
you said it isn't just the feel good,&nbsp;&nbsp;

00:45:23.560 --> 00:45:27.560
but it actually would make economic sense&nbsp;
because that's some of the driver behind&nbsp;&nbsp;

00:45:27.560 --> 00:45:33.800
what technologies get “greenlit,” as it&nbsp;
were. Is there anything outrageous now&nbsp;&nbsp;

00:45:33.800 --> 00:45:38.874
that you could think of that, even to you,&nbsp;
sounds like, oh, we could never do that …

00:45:38.874 --> 00:45:43.990
BALI: Well … I didn't think HAL was&nbsp;
outrageous, so I’m not … [LAUGHS]

00:45:43.990 --> 00:45:45.914
HUIZINGA: Back to HAL 9000! [LAUGHS]

00:45:45.914 --> 00:45:50.760
BALI: Yeah, so I don't think of things&nbsp;
as outrageous or not. I just think of&nbsp;&nbsp;

00:45:50.760 --> 00:45:55.320
things as things that need to get&nbsp;
done, if that makes any sense?

00:45:55.320 --> 00:46:00.680
HUIZINGA: Totally. Maybe it's, how do&nbsp;
we override “Open the pod bay door,&nbsp;&nbsp;

00:46:00.680 --> 00:46:03.754
HAL”—“No, I'm sorry, Dave.&nbsp;
I can't do that”? [LAUGHS]

00:46:03.754 --> 00:46:05.820
BALI: Yes. [LAUGHS] Yeah…

00:46:05.820 --> 00:46:10.280
HUIZINGA: Well, as we close—and I'm sad to&nbsp;
close because you are so much fun—I want&nbsp;&nbsp;

00:46:10.280 --> 00:46:15.040
to do a little vision casting, but in&nbsp;
reverse. So let's fast-forward 20 years&nbsp;&nbsp;

00:46:15.040 --> 00:46:22.400
and look back. How have the big ideas behind&nbsp;
your life's work impacted the world, and how&nbsp;&nbsp;

00:46:22.400 --> 00:46:28.240
are people better off or different now because&nbsp;
of you and the teams that you've worked with?

00:46:28.240 --> 00:46:36.040
BALI: So the way I see it is that people&nbsp;
across the board, irrespective of the&nbsp;&nbsp;

00:46:36.040 --> 00:46:43.240
language they speak, the communities they&nbsp;
belong to, the demographies they represent,&nbsp;&nbsp;

00:46:43.240 --> 00:46:49.920
can use technology to make their lives,&nbsp;
their work, better. I know it sounds like&nbsp;&nbsp;

00:46:49.920 --> 00:46:56.840
really a very big and almost too good to&nbsp;
be true, but that's what I'm aiming for.

00:46:56.840 --> 00:47:01.760
HUIZINGA: Well, Kalika Bali, I'm&nbsp;
so grateful I got to talk to you&nbsp;&nbsp;

00:47:01.760 --> 00:47:06.600
in person. And thanks for taking time&nbsp;
out from your busy trip from India to&nbsp;&nbsp;

00:47:06.600 --> 00:47:11.061
sit down with me and our audience&nbsp;
and share your amazing ideas.

00:47:11.061 --> 00:47:11.074
[MUSIC PLAYS]

00:47:11.074 --> 00:47:19.320
BALI: Thank you so much, Gretchen.

00:47:19.320 --> 00:47:26.840
[MUSIC FADES]

