WEBVTT

00:00:00.791 --> 00:00:01.347
[MUSIC]

00:00:01.347 --> 00:00:08.800
AMBER TINGLE: Welcome to Abstracts, a Microsoft&nbsp;
Research Podcast that puts the spotlight on&nbsp;&nbsp;

00:00:08.800 --> 00:00:18.040
world-class research—in brief. I'm Amber Tingle.&nbsp;
In this series, members of the research community&nbsp;&nbsp;

00:00:18.040 --> 00:00:28.186
at Microsoft give us a quick snapshot—or a podcast&nbsp;
abstract—of their new and noteworthy papers.

00:00:28.186 --> 00:00:28.690
[MUSIC FADES]

00:00:28.690 --> 00:00:35.400
Our guests today are Shrey Jain and Zoë Hitzig.&nbsp;
Shrey is a product manager at Microsoft, and&nbsp;&nbsp;

00:00:35.400 --> 00:00:43.400
Zoë is a research scientist at OpenAI. They are&nbsp;
two of the corresponding authors on a new paper,&nbsp;&nbsp;

00:00:43.400 --> 00:00:50.080
“Personhood credentials: Artificial intelligence&nbsp;
and the value of privacy-preserving tools to&nbsp;&nbsp;

00:00:50.080 --> 00:00:57.600
distinguish who is real online.” This exploratory&nbsp;
research comprises multidisciplinary collaborators&nbsp;&nbsp;

00:00:57.600 --> 00:01:07.080
from across industry, academia, and the civil&nbsp;
sector. The paper is available now on arXiv. Shrey&nbsp;&nbsp;

00:01:07.080 --> 00:01:13.428
and Zoë, thank you so much for joining us, and&nbsp;
welcome back to the Microsoft Research Podcast.

00:01:13.428 --> 00:01:15.323
SHREY JAIN: Thank you. We're happy to be back.

00:01:15.323 --> 00:01:15.827
ZOË HITZIG: Thanks so much.

00:01:15.827 --> 00:01:20.280
TINGLE: Shrey, let's start with a brief overview&nbsp;
of your paper. Why is this research important,&nbsp;&nbsp;

00:01:20.280 --> 00:01:22.860
and why do you think this is&nbsp;
something we should all know about?

00:01:22.860 --> 00:01:28.120
JAIN: Malicious actors have been exploiting&nbsp;
anonymity as a way to deceive others online.&nbsp;&nbsp;

00:01:28.120 --> 00:01:35.360
And historically, deception has been viewed as&nbsp;
this unfortunate but necessary cost as a way&nbsp;&nbsp;

00:01:35.360 --> 00:01:43.120
to preserve the internet's commitment to privacy&nbsp;
and unrestricted access to information. And today,&nbsp;&nbsp;

00:01:43.120 --> 00:01:48.000
AI is changing the way we should think about&nbsp;
malicious actors' ability to be successful&nbsp;&nbsp;

00:01:48.000 --> 00:01:52.480
in those attacks. It makes it easier to&nbsp;
create content that is indistinguishable&nbsp;&nbsp;

00:01:52.480 --> 00:01:58.000
from human-created content, and it is possible&nbsp;
to do so in a way that is only getting cheaper&nbsp;&nbsp;

00:01:58.000 --> 00:02:03.720
and more accessible. And so this paper aims&nbsp;
to address a countermeasure to protect against&nbsp;&nbsp;

00:02:03.720 --> 00:02:10.720
AI-powered deception at scale while also&nbsp;
protecting privacy. And I think the reason&nbsp;&nbsp;

00:02:10.720 --> 00:02:17.720
why people should care about this problem is for&nbsp;
two reasons. One is it can very soon become very&nbsp;&nbsp;

00:02:17.720 --> 00:02:22.480
logistically annoying to deal with these various&nbsp;
different types of scams that can occur. I think&nbsp;&nbsp;

00:02:22.480 --> 00:02:25.760
we've all been susceptible to different&nbsp;
types of attacks or scams that, you know,&nbsp;&nbsp;

00:02:25.760 --> 00:02:32.640
people have had. But now these scams are going to&nbsp;
become much more persuasive and effective. And so&nbsp;&nbsp;

00:02:32.640 --> 00:02:37.520
for various different recovery purposes, it can&nbsp;
become very challenging to get access back to your&nbsp;&nbsp;

00:02:37.520 --> 00:02:42.800
accounts or rebuild your reputation that someone&nbsp;
may damage online. But more importantly, there's&nbsp;&nbsp;

00:02:42.800 --> 00:02:47.200
also very dangerous things that can happen. Kids&nbsp;
might not be safe online anymore. Or our ability&nbsp;&nbsp;

00:02:47.200 --> 00:02:53.360
to communicate online for democratic processes. A&nbsp;
lot of the way in which we shape political views&nbsp;&nbsp;

00:02:53.360 --> 00:02:59.960
today happens online. And that's also at risk.&nbsp;
And in response to that, we propose in this&nbsp;&nbsp;

00:02:59.960 --> 00:03:07.200
paper a solution titled personhood credentials.&nbsp;
Personhood credentials enable people to prove&nbsp;&nbsp;

00:03:07.200 --> 00:03:12.000
that they are in fact a real person without&nbsp;
revealing anything more about themselves online.

00:03:12.000 --> 00:03:15.080
TINGLE: Zoë, walk us through&nbsp;
what's already been done in&nbsp;&nbsp;

00:03:15.080 --> 00:03:19.560
this field, and what's your unique&nbsp;
contribution to the literature here?

00:03:19.560 --> 00:03:28.000
HITZIG: I see us as intervening on two separate&nbsp;
bodies of work. And part of what we're doing in&nbsp;&nbsp;

00:03:28.000 --> 00:03:35.600
this paper is bringing together those two bodies&nbsp;
of work. There's been absolutely amazing work for&nbsp;&nbsp;

00:03:35.600 --> 00:03:42.840
decades in cryptography and in security. And what&nbsp;
cryptographers have been able to do is to figure&nbsp;&nbsp;

00:03:42.840 --> 00:03:51.000
out protocols that allow people to prove very&nbsp;
specific claims about themselves without revealing&nbsp;&nbsp;

00:03:51.000 --> 00:03:58.360
their full identity. So when you think about&nbsp;
walking into a bar and the bartender asks you to&nbsp;&nbsp;

00:03:58.360 --> 00:04:06.400
prove that you're over 21—or over 18, depending on&nbsp;
where you are—you typically have to show your full&nbsp;&nbsp;

00:04:06.400 --> 00:04:12.840
driver's license. And now that's revealing a lot&nbsp;
of information. It says, you know, where you live,&nbsp;&nbsp;

00:04:12.840 --> 00:04:19.600
whether you're an organ donor. It's revealing a&nbsp;
lot of information to that bartender. And online,&nbsp;&nbsp;

00:04:19.600 --> 00:04:26.480
we don't know what different service providers&nbsp;
are storing about us. So, you know, the bartender&nbsp;&nbsp;

00:04:26.480 --> 00:04:32.560
might not really care where we live or whether&nbsp;
we're an organ donor. But when we're signing up&nbsp;&nbsp;

00:04:32.560 --> 00:04:40.920
for digital services and we have to show a highly&nbsp;
revealing credential like a driver's license just&nbsp;&nbsp;

00:04:40.920 --> 00:04:48.040
to get access to something, we're giving over&nbsp;
too much information in some sense. And so this&nbsp;&nbsp;

00:04:48.040 --> 00:04:55.520
one body of literature that we're really drawing&nbsp;
on is a literature in cryptography. The idea that&nbsp;&nbsp;

00:04:55.520 --> 00:05:01.480
I was talking about there, where you can prove&nbsp;
privately just isolated claims about yourself,&nbsp;&nbsp;

00:05:01.480 --> 00:05:07.680
that's an idea called an anonymous credential.&nbsp;
It allows you to be anonymous with respect to&nbsp;&nbsp;

00:05:07.680 --> 00:05:12.800
some kind of service provider while still&nbsp;
proving a limited claim about yourself,&nbsp;&nbsp;

00:05:12.800 --> 00:05:20.680
like “I am over 18,” or in the case of personhood&nbsp;
credentials, you prove, “I am a person.” So that's&nbsp;&nbsp;

00:05:20.680 --> 00:05:26.720
all one body of literature. Then there's this huge&nbsp;
other body of literature and set of conversations&nbsp;&nbsp;

00:05:26.720 --> 00:05:36.320
happening in policy circles right now around&nbsp;
what to do about AI. Huge questions abounding.&nbsp;&nbsp;

00:05:36.320 --> 00:05:43.400
Shrey and I have written a prior paper called&nbsp;
“Contextual Confidence and Generative AI,” which&nbsp;&nbsp;

00:05:43.400 --> 00:05:51.920
we talked about on this podcast, as well, and in&nbsp;
that paper, we offered a framework for thinking&nbsp;&nbsp;

00:05:51.920 --> 00:05:59.760
about the specific ways that generative&nbsp;
AI, sort of, threatens the foundations of&nbsp;&nbsp;

00:05:59.760 --> 00:06:09.120
our modes of communication online. And we outlined&nbsp;
about 16 different solutions that could help us&nbsp;&nbsp;

00:06:09.120 --> 00:06:18.480
to solve the coming problems that generative AI&nbsp;
might bring to our online ecosystems. And what we&nbsp;&nbsp;

00:06:18.480 --> 00:06:25.040
decided to do in this paper was focus on a set of&nbsp;
solutions that we thought are not getting enough&nbsp;&nbsp;

00:06:25.040 --> 00:06:33.320
attention in those AI and AI policy circles.&nbsp;
And so part of what this paper is doing is&nbsp;&nbsp;

00:06:33.320 --> 00:06:41.280
bringing together these ideas from this long body&nbsp;
of work in cryptography into those conversations.

00:06:41.280 --> 00:06:44.800
TINGLE: I'd like to know&nbsp;
more about your methodology,&nbsp;&nbsp;

00:06:44.800 --> 00:06:48.460
Shrey. How did your team go&nbsp;
about conducting this research?

00:06:48.460 --> 00:06:55.480
JAIN: So we had a wide range of collaborators from&nbsp;
industry, academia, the civil sector who work on&nbsp;&nbsp;

00:06:55.480 --> 00:07:01.120
topics of digital identity, privacy, advocacy,&nbsp;
security, and AI policy which came together to&nbsp;&nbsp;

00:07:01.120 --> 00:07:06.000
think about, what is the clearest way in which we&nbsp;
can explain what we believe is a countermeasure&nbsp;&nbsp;

00:07:06.000 --> 00:07:10.240
that can protect against AI-powered deception&nbsp;
that, from a technological point of view,&nbsp;&nbsp;

00:07:10.240 --> 00:07:14.080
there's already a large body of work that&nbsp;
we can reference but from a “how this can&nbsp;&nbsp;

00:07:14.080 --> 00:07:19.600
be implemented.” Discussing the tradeoffs that&nbsp;
various different types of academics and industry&nbsp;&nbsp;

00:07:19.600 --> 00:07:24.360
leaders are thinking about. Can we communicate&nbsp;
that very clearly? And so the methodology here&nbsp;&nbsp;

00:07:24.360 --> 00:07:31.320
was really about bringing together a wide range of&nbsp;
collaborators to really bridge these two bodies of&nbsp;&nbsp;

00:07:31.320 --> 00:07:36.560
work together and communicate it clearly—not just&nbsp;
the technical solutions but also the tradeoffs.

00:07:36.560 --> 00:07:42.340
TINGLE: So, Zoë, what are the major findings&nbsp;
here, and how are they presented in the paper?

00:07:42.340 --> 00:07:49.240
HITZIG: I am an economist by training. Economists&nbsp;
love to talk about tradeoffs. You know,&nbsp;&nbsp;

00:07:49.240 --> 00:07:53.840
when you have some of this, it means you have&nbsp;
a little bit less of that. It's kind of like&nbsp;&nbsp;

00:07:53.840 --> 00:08:00.920
the whole business of economics. And a key&nbsp;
finding of the paper, as I see it, is that&nbsp;&nbsp;

00:08:00.920 --> 00:08:09.200
we begin with what feels like a tradeoff, which&nbsp;
is on the one hand, as Shrey was saying, we want&nbsp;&nbsp;

00:08:09.200 --> 00:08:17.920
to be able to be anonymous online because that&nbsp;
has great benefits. It means we can speak truth&nbsp;&nbsp;

00:08:17.920 --> 00:08:26.640
to power. It means we can protect civil liberties&nbsp;
and invite everyone into online spaces. You know,&nbsp;&nbsp;

00:08:26.640 --> 00:08:33.480
privacy is a core feature of the internet. And&nbsp;
at the same time, the, kind of, other side of the&nbsp;&nbsp;

00:08:33.480 --> 00:08:40.760
tradeoff that we're often presented is, well, if&nbsp;
you want all that privacy and anonymity, it means&nbsp;&nbsp;

00:08:40.760 --> 00:08:48.200
that you can't have accountability. There's no way&nbsp;
of tracking down the bad actors and making sure&nbsp;&nbsp;

00:08:48.200 --> 00:08:55.400
that they don't do something bad again. And we're&nbsp;
presented with this tradeoff between anonymity on&nbsp;&nbsp;

00:08:55.400 --> 00:09:03.320
the one hand and accountability on the other hand.&nbsp;
All that is to say, a key finding of this paper,&nbsp;&nbsp;

00:09:03.320 --> 00:09:10.360
as I see it, is that personhood credentials and&nbsp;
more generally this class of anonymous credentials&nbsp;&nbsp;

00:09:10.360 --> 00:09:18.280
that allow you to prove different pieces of&nbsp;
your identity online without revealing your&nbsp;&nbsp;

00:09:18.280 --> 00:09:26.360
entire identity actually allow you to evade&nbsp;
the tradeoff and allow you to, in some sense,&nbsp;&nbsp;

00:09:26.360 --> 00:09:32.320
have your cake and eat it, too. What it allows&nbsp;
us to do is to create some accountability,&nbsp;&nbsp;

00:09:32.320 --> 00:09:42.960
to put back some way of tracing people's digital&nbsp;
activities to an accountable entity. What we also&nbsp;&nbsp;

00:09:42.960 --> 00:09:48.360
present in the paper are a number of different,&nbsp;
sort of, key challenges that will have to be&nbsp;&nbsp;

00:09:48.360 --> 00:09:55.640
taken into account in building any kind of&nbsp;
system like this. But we present all of that,&nbsp;&nbsp;

00:09:55.640 --> 00:10:02.800
all of those challenges going forward, as&nbsp;
potentially very worth grappling with because of&nbsp;&nbsp;

00:10:02.800 --> 00:10:11.760
the potential for this, sort of, idea to allow us&nbsp;
to preserve the internet's commitment to privacy,&nbsp;&nbsp;

00:10:11.760 --> 00:10:18.420
free speech, and anonymity while also&nbsp;
creating accountability for harm.

00:10:18.420 --> 00:10:23.760
TINGLE: So Zoë mentioned some of these&nbsp;
tradeoffs. Let's talk a little bit more&nbsp;&nbsp;

00:10:23.760 --> 00:10:29.680
about real-world impact, Shrey.&nbsp;
Who benefits most from this work?

00:10:29.680 --> 00:10:35.520
JAIN: I think there's many different people that&nbsp;
benefit. One is anyone who's communicating or&nbsp;&nbsp;

00:10:35.520 --> 00:10:40.280
doing anything online in that they can have more&nbsp;
confidence in their interactions. And it, kind of,&nbsp;&nbsp;

00:10:40.280 --> 00:10:44.840
builds back on the paper that Zoë and I wrote last&nbsp;
year on contextual confidence and generative AI,&nbsp;&nbsp;

00:10:44.840 --> 00:10:49.720
which is that we want to have confidence in&nbsp;
our interactions, and in order to do that,&nbsp;&nbsp;

00:10:49.720 --> 00:10:53.640
one component is being able to identify who&nbsp;
you're speaking with and also doing it in a&nbsp;&nbsp;

00:10:53.640 --> 00:10:59.640
privacy-preserving way. And I think another person&nbsp;
who benefits is policymakers. I think today,&nbsp;&nbsp;

00:10:59.640 --> 00:11:03.760
when we think about the language and&nbsp;
technologies that are being promoted,&nbsp;&nbsp;

00:11:03.760 --> 00:11:08.720
this complements a lot of the existing work that's&nbsp;
being done on provenance and watermarking. And I&nbsp;&nbsp;

00:11:08.720 --> 00:11:13.080
think the ability for those individuals to be&nbsp;
successful in their mission, which is creating&nbsp;&nbsp;

00:11:13.080 --> 00:11:19.880
a safer online space, this work can help guide&nbsp;
these individuals to be more effective in their&nbsp;&nbsp;

00:11:19.880 --> 00:11:25.400
mission in that it highlights a technology that&nbsp;
is not currently as discussed comparatively to&nbsp;&nbsp;

00:11:25.400 --> 00:11:30.520
these other solutions and complements them&nbsp;
in order to protect online communication.

00:11:30.520 --> 00:11:37.880
HITZIG: You know, social media is flooded&nbsp;
with bots, and sometimes the problem with&nbsp;&nbsp;

00:11:37.880 --> 00:11:44.400
bots is that they're posting fake content,&nbsp;
but other times, the problem with bots is&nbsp;&nbsp;

00:11:44.400 --> 00:11:49.680
that there are just so many of them and&nbsp;
they're all retweeting each other and it's&nbsp;&nbsp;

00:11:49.680 --> 00:11:55.600
very hard to tell what's real. And so what a&nbsp;
personhood credential can do is say, you know,&nbsp;&nbsp;

00:11:55.600 --> 00:12:02.940
maybe each person is only allowed to have five&nbsp;
accounts on a particular social media platform.

00:12:02.940 --> 00:12:09.080
TINGLE: So, Shrey, what's next on your research&nbsp;
agenda? Are there lingering questions—I know&nbsp;&nbsp;

00:12:09.080 --> 00:12:14.180
there are—and key challenges here, and&nbsp;
if so, how do you hope to answer them?

00:12:14.180 --> 00:12:19.240
JAIN: We believe we've aggregated a strong&nbsp;
set of industry, academic, and, you know,&nbsp;&nbsp;

00:12:19.240 --> 00:12:22.880
civil sector collaborators, but we're only a&nbsp;
small subset of the people who are going to&nbsp;&nbsp;

00:12:22.880 --> 00:12:28.560
be interacting with these systems. And so&nbsp;
the first area of next steps is to gather&nbsp;&nbsp;

00:12:28.560 --> 00:12:32.240
feedback about the proposal of a solution&nbsp;
that we've had and how can we improve that:&nbsp;&nbsp;

00:12:32.240 --> 00:12:35.840
are there tradeoffs that we're missing? Are&nbsp;
there technical components that we weren't&nbsp;&nbsp;

00:12:35.840 --> 00:12:41.240
thinking as deeply through? And I think there's&nbsp;
a lot of narrow open questions that come out of&nbsp;&nbsp;

00:12:41.240 --> 00:12:45.960
this. For instance, how do personhood credentials&nbsp;
relate to existing laws regarding identity theft&nbsp;&nbsp;

00:12:45.960 --> 00:12:53.000
or protection laws? In areas where service&nbsp;
providers can't require government IDs,&nbsp;&nbsp;

00:12:53.000 --> 00:12:59.240
how does that apply to personhood credentials that&nbsp;
rely on government IDs? I think that there's a&nbsp;&nbsp;

00:12:59.240 --> 00:13:04.360
lot of these open questions that we address in&nbsp;
the paper that I think need more experimentation&nbsp;&nbsp;

00:13:04.360 --> 00:13:08.680
and thinking through but also a lot of&nbsp;
empirical work to be done. How do people&nbsp;&nbsp;

00:13:08.680 --> 00:13:12.480
react to personhood credentials, and does&nbsp;
it actually enhance confidence in their&nbsp;&nbsp;

00:13:12.480 --> 00:13:17.920
interactions online? I think that there's a lot&nbsp;
of open questions on the actual effectiveness of&nbsp;&nbsp;

00:13:17.920 --> 00:13:21.340
these tools. And so I think there's a large&nbsp;
area of work to be done there, as well.

00:13:21.340 --> 00:13:26.680
HITZIG: I've been thinking a lot about the early&nbsp;
days of the internet. I wasn't around for that,&nbsp;&nbsp;

00:13:26.680 --> 00:13:35.200
but I know that every little decision that&nbsp;
was made in a very short period of time had&nbsp;&nbsp;

00:13:35.200 --> 00:13:41.040
incredibly lasting consequences that we're&nbsp;
still dealing with now. There's an enormous&nbsp;&nbsp;

00:13:41.040 --> 00:13:48.520
path dependence in every kind of technology. And I&nbsp;
feel that right now, we're in that period of time,&nbsp;&nbsp;

00:13:48.520 --> 00:13:55.080
the small window where generative AI is this new&nbsp;
thing to contend with, and it's uprooting many of&nbsp;&nbsp;

00:13:55.080 --> 00:14:02.920
our assumptions about how our systems can work&nbsp;
or should work. And I'm trying to think about&nbsp;&nbsp;

00:14:02.920 --> 00:14:10.880
how to set up those institutions, make these&nbsp;
tiny decisions right so that in the future&nbsp;&nbsp;

00:14:10.880 --> 00:14:18.704
we have a digital architecture that's really&nbsp;
serving the goals that we want it to serve.

00:14:18.704 --> 00:14:22.600
[MUSIC]
TINGLE: Very thoughtful. With that, Shrey Jain,

00:14:22.600 --> 00:14:25.560
Zoë Hitzig, thank you so&nbsp;
much for joining us today.

00:14:25.560 --> 00:14:27.300
HITZIG: Thank you so much, Amber.

00:14:27.300 --> 00:14:32.040
TINGLE: And thanks to our listeners,&nbsp;
as well. If you'd like to learn more&nbsp;&nbsp;

00:14:32.040 --> 00:14:37.680
about Shrey and Zoë's work on personhood&nbsp;
credentials and advanced AI, you'll find&nbsp;&nbsp;

00:14:37.680 --> 00:14:45.720
a link to this paper at aka.ms/abstracts, or&nbsp;
you can read it on arXiv. Thanks again for&nbsp;&nbsp;

00:14:45.720 --> 00:14:59.040
tuning in. I'm Amber Tingle, and we hope&nbsp;
you'll join us next time on Abstracts.

00:14:59.040 --> 00:15:00.013
[MUSIC FADES]