Interactive Transcript
0:02
Hello, and welcome to today's Noon Conference co-presented
0:05
by MRI online and A A WR.
0:08
The A A WR was founded in 1981 to provide a form
0:12
for issues unique to women in radiology, radiation oncology,
0:15
and related professions.
0:17
The association sponsors programs that promote opportunities
0:20
for women and facilitates networking among members
0:24
and other professionals.
0:25
They have membership opportunities for those
0:27
who have completed their training.
0:29
Members in training
0:30
and international radiologist learn more about their mission
0:33
and membership@a.org.
0:36
We're thrilled to partner with a WR on these lectures
0:39
as part of our shared commitment to advancing
0:41
and supporting women in radiology
0:43
and transforming the way radiologists learn and thrive.
0:46
Today, we are honored to welcome Dr.
0:48
Bach Dewan for a lectured entitled AI at the Heart
0:51
of Breast Imaging Innovations and Insights.
0:54
Dr. Dewan is a clinical professor of radiology
0:57
and Eugene BP Frankel Endowed Scholar in Clinical Medicine
1:01
at UT Southwestern Medical Center, where she serves
1:05
as a member of its breast imaging division
1:07
and the Director of Breast Imaging Research.
1:10
She earned her me medical degree in Ankara, Turkey
1:13
and completed her residency training in diagnostic radiology
1:16
at Inca University Medical School, followed
1:19
by Breast Imaging Fellowship at University of Texas,
1:22
MD Anderson Cancer Center in Houston.
1:25
At the end of the lecture, please join her in a q
1:27
and a session where she will address questions you may
1:29
have on today's topic.
1:31
Please remember to use the q
1:32
and a feature to submit your questions so we can get to
1:35
as many as we can before our time is up.
1:37
With that, we are ready to begin today's lecture. Dr.
1:40
Dwan, please take it from here.
1:43
Thank you very much for hosting me
1:45
to share my perspective on artificial
1:48
intelligence and breast imaging.
1:49
Where are we now and where are we going?
1:51
I hope we will have a robust discussion
1:54
after, after the talk.
1:57
All right, so these are my disclosures for this talk.
2:02
I'm gonna start with, uh, a brief timeline of how we came
2:07
to be where we were at in, uh, breast imaging.
2:10
Um, so AI was actually born, uh,
2:13
back in 1956 in Dartmouth College,
2:17
and the initial attempts at computer, uh,
2:19
based image analysis dates back to 1960s.
2:23
And, um, very quickly, uh, development of CAD
2:28
for mammography started in 1980s and nineties,
2:31
and in 1998, FDA approved CAD for mammography, uh,
2:36
which was R two IME checker at the time.
2:40
And then, um, early on, the study suggested
2:43
that this CAD improved cancer detection rates by two to 10%.
2:47
So there were very promising,
2:49
but later down the line, uh,
2:51
subsequent data emerged suggesting that CAD is maybe not
2:55
that helpful in clinical practice.
2:58
Parallel to that, we had, um, development of, uh,
3:05
first applications of convoluted neural networks
3:08
to mammography, which first, uh,
3:10
focused on mass identification.
3:13
And later on in, uh, late two thousands and early 2000
3:17
and tens, advanced methodology
3:20
and, uh, availability of especially digital mammography
3:24
and hardware led to renewed interest in, um,
3:28
CNN architectures and deep learning.
3:31
And over the past decade, really the application
3:34
of deep learning to detection, diagnosis, tasks,
3:37
and mammography and other breast imaging
3:39
modalities absolutely exploded.
3:42
Uh, this includes segmentation, le lesion identification
3:46
and classification of, uh, masses
3:49
and microcalcifications in addition to, uh,
3:52
breast cancer risk prediction of patients
3:54
and density determination.
3:58
And, uh, one of the key, uh,
4:00
events in this timeline is the dream challenge,
4:03
which occurred, uh, in 2016 2017,
4:08
and the front runners really led the artificial intelligence
4:12
applications in mammography.
4:14
And, uh, since, uh, late 2000 tens,
4:19
multiple studies now suggest
4:20
that deep learning can enhance the diagnostic accuracy.
4:25
So when we say, uh, ai, there's so many uses.
4:28
I mean, it's infiltrated our lives in so many ways.
4:31
Uh, when you're scrolling in internet, um,
4:34
if you click on different, um, areas of interest,
4:37
you will see that, uh, the ads actually start showing you
4:41
what, uh, you're interested in.
4:43
And, you know, that's really AI at work, just trying
4:46
to determine your areas of interest.
4:48
When you're in Netflix, it will show you, um, you know,
4:52
your, the, the movies
4:53
that you were interested in in the past.
4:55
It'll show you more of that.
4:56
So, um, this actually goes to, uh, different aspects
5:01
of breast imaging in terms of appointment scheduling
5:04
and fight coordination, dose regulation, image positioning,
5:07
and quality assurance.
5:08
But today, uh, I just want to focus on narrow ai,
5:13
which is detection, diagnosis, triage, segmentation,
5:17
treatment response prediction,
5:19
and risk prediction, which are really tasks related to, uh,
5:24
what we radiologists do in our daily lives.
5:30
Um, a little bit on tech technical terms of, uh,
5:34
artificial intelligence, uh, terminology, machine learning.
5:39
Uh, you may have heard this before, um, machine learning
5:42
or models that, that derive parameters through being trained
5:46
by existing data, meaning we are showing
5:49
the computer what to find.
5:51
And this was essentially what conventional CAD was based on.
5:55
Um, you provide an input
5:58
and then it basically starts to try to find that input in,
6:02
uh, subsequent, uh, images.
6:05
Artificial neural networks
6:07
or is, is a model framework, which consists
6:09
of interconnected units
6:10
or neurons organized into different layers.
6:13
So there's, um, prioritization
6:16
of certain information over others.
6:19
For example, um, patients over the age of 40 are more likely
6:23
to develop breast cancer.
6:25
So, um, and a NN may actually prioritize patient age over
6:30
other findings that it may find on the image.
6:33
Deep learning refers to, uh,
6:36
these artificial neural networks that features
6:39
multiple intermediate or hidden layers
6:41
that we don't necessarily teach the, um, the,
6:45
uh, the computer.
6:46
So, uh, the most widely used, uh, deep learning architecture
6:50
for image classification, operating,
6:53
bypassing filters over on input
6:55
to extract these higher order features
6:58
that are highly predictive of.
6:59
What we're looking for is called
7:01
convolutional neural network.
7:03
So, um, so these are the different types of AI that, uh,
7:08
we use or that are tested in breast imaging.
7:14
Um, so I've been practicing breast imaging since 2001.
7:18
And, you know, since then, we're not foreigners to CAD
7:23
or computer aided detection.
7:25
So when I, I pressed on the CAD function,
7:28
I would see something like this, the triangles, um,
7:32
and the, the stars here would indicate mass
7:36
or asymmetries
7:37
or focal asymmetries on the triangles would indicate,
7:40
you know, calcifications.
7:42
So, uh, but there would be
7:44
multiple such annotations on a mammogram,
7:46
which could get sometimes very confusing.
7:50
So this is an example of, uh, a patient that we've seen at
7:54
that time, see down there, actually she has a mass,
7:57
I circled it, but this was not even, uh, found by the CAT
8:01
or indicated to me to look at by the cat.
8:05
So CAT has been FDA approved since 1998.
8:09
And by 2018, 92%
8:11
of screening mammograms were being interpreted using cad.
8:15
'cause that was, uh, mainly
8:16
because there was an additional code for it,
8:18
and we could bill separately for it.
8:20
But despite initial optimism, the largest, uh, studies, uh,
8:26
down the line failed
8:27
to demonstrate a very significant improvement in diagnostic
8:31
performance with cad.
8:33
Um, this is, uh, one of those landmark studies by, um,
8:38
Lehman and colleagues, which was published in JAMA
8:42
that showed really, um, no improvement in sensitivity,
8:46
specificity or cancer detection rate with cad.
8:48
Um, if you look up there, the cancers detected
8:51
for a thousand exams with CAD was 4.1 in a thousand.
8:55
And without cad, it was exactly the same.
8:58
Um, and same thing with invasive cancers.
9:01
There was some marginal improvements
9:03
with di carcinoma situ detected with cad.
9:07
Um, but if you look at the overall sensitivity, uh,
9:10
it was 85% with cad, 87% without cad.
9:14
And this was based on large national data
9:18
provided by breast cancer civilian consortium.
9:22
Um, so it was, it was pretty, uh, symptomatic of
9:26
what was going on nationwide.
9:29
So why was that happening?
9:31
The traditional CAD systems rely on detection of
9:35
hand engineered features such,
9:38
or, for example, masses, uh, defined by expert knowledge.
9:42
So they were delineated by radiologists at the outset.
9:46
So it primarily functions as a spell checker.
9:50
Uh, what does that mean? It, it,
9:52
it means really we have no hope of, uh, thinking that
9:56
Cath would actually find a cancer that we haven't defined
10:00
to it at the outset when it was being trained,
10:03
and there was no ongoing training in, in other words,
10:07
it didn't self-train as it was seeing more and more cases.
10:11
Um, well, how can deep learning, uh, be beneficial here?
10:16
Deep learning models learn directly from the data without
10:19
the need for explicit delineation
10:22
or feature engineering by the radiologists.
10:24
So they have the potential to learn from features
10:27
that are unseen
10:29
and unknown by radiologists,
10:31
so potentially they can improve the
10:33
radiologist's performance.
10:37
Um, and breast imaging was the per poster child for that,
10:40
because first of all,
10:42
we produced large scale categorical outcomes.
10:46
We have been using birads for many decades
10:49
before it was, it's implemented for other, um,
10:54
organ systems, really.
10:56
And, uh, we generated a large amount of data that is, um,
11:01
audited every year.
11:02
So we were actually recording our outcomes.
11:06
So we, uh, naturally became the most suitable
11:10
first applicators to train, validate,
11:12
and implement robust AI algorithms.
11:16
Um, this is a summary of early studies of deep learning,
11:21
so don't worry, I'm not gonna go over all of them.
11:22
But, um, just in summary, the, uh,
11:27
performance of even the very earliest applications
11:31
of deep learning were quite successful,
11:34
and they were pretty close to radiologist performance.
11:38
Um, I, I mentioned the dream, um, trial
11:42
or the dream challenge, which was, uh, also, uh,
11:47
funded by IBM
11:49
and lots of, um, other technology, uh, pioneers at the time.
11:54
And, uh, those were the, that's really what opened, uh,
11:58
the door to really wide scale application of ai.
12:01
And these are the, uh, the outcomes of the green challenge.
12:04
There were, um, three, um, top contenders.
12:09
Um, there were data sets submitted from both North America
12:13
and Sweden, and it seemed that the, um,
12:17
the performance a UC was, um, pretty good overall.
12:21
And when compared to single radiologists, um, this is,
12:26
um, which is, uh, Kaiser Permanente
12:29
and, uh, the consensus evaluation, uh, the performances
12:33
of those were, uh, pretty close, not quite at the level
12:37
of the radiologist performance,
12:38
but they were pretty close to the radiologist.
12:41
So, uh, no single algorithm reached the published, uh,
12:45
US Community Radiologist average, which at the time
12:49
was benchmarks were, uh, determined, um,
12:53
in, in a recent study.
12:55
Uh, and, and,
12:56
but an ensemble of AI algorithms combined
12:59
with a single radiologist assessment was associated
13:02
with an improved overall mammography performance over
13:05
a single radiologist alone.
13:08
What does an ensemble of AI algorithms means?
13:10
It's a combination of, um, the top performing algorithms.
13:15
So it wasn't a single one,
13:17
but multiple ones that was combined that, uh, was able
13:21
to improve a single radiologist assessments performance.
13:28
And, um, in a reader study of 14 radiologists, actually,
13:32
they just recently showed that, uh, reading
13:35
with AI support increased the main a UC by 0.2% as well.
13:40
So there's a marginal improvement,
13:42
but a, not a huge amount of improvement for, uh,
13:46
for single AI use.
13:49
So, um, can we actually, uh, do superhuman performance?
13:54
Meaning can we, can AI alone do better than the radiologist?
13:59
So, um, there are different reading algorithms, uh,
14:03
implemented in different countries.
14:06
So the settings are different in that in Europe, there's,
14:10
uh, they implement more, uh, double reading,
14:12
meaning a radiologist reads the, uh, mammogram first,
14:16
and the recalls are over read by a second, usually a more,
14:22
uh, experience radiologists.
14:24
And, um, most of the time they're on called
14:28
and in, in the USA, uh,
14:31
we have a single radiologist setting.
14:33
So we have to really test
14:36
how this fits into these two different scenarios.
14:39
So when they looked at that, uh,
14:42
a deep learning based ensemble algorithm, uh, they,
14:47
it did demonstrate superior sensitivity
14:49
and specificity compared to
14:51
the original interpreting radiologists.
14:54
Uh, there was about a 10% increase
14:57
and a, uh, a 5% increase in specificity
15:01
and a 10% increase in sensitivity in the US data.
15:05
And, uh, also, AI outperformed radiologists
15:10
with an overall increase in both
15:12
sensitivity and specificity.
15:14
But if you, if you look at the improvements,
15:16
these were mainly in the single radiologist setting.
15:21
When you look at the double radiologist setting, uh,
15:24
the red.here represents, uh, the consensus reading.
15:28
Um, and the, the, um,
15:33
green dot represents the single radiologist.
15:36
So there was really, uh, 1% improvement in specificity
15:40
and 2% in sensitivity here,
15:42
whereas the consensus was much better than, um,
15:46
both the single reader and the AI here.
15:49
So it depends on how we are reading it.
15:52
So it seems like, uh,
15:54
if there's a single radiologist reading
15:57
AI will be a huge help, uh, which is relevant to us, uh,
16:01
practicing in the, the us.
16:05
So we talked just about mammography.
16:07
What about AI performance and ultrasound?
16:11
Breast ultrasound received, uh,
16:13
less attention than mammography in the literature, uh,
16:16
because it's, um, it's more user dependent
16:20
and, um, equipment dependent.
16:22
And, um, you know, how you scan a patient can make a huge
16:25
difference in the imaging findings.
16:28
But the available data suggests
16:30
that DL can achieve satisfactory performance in
16:33
classification of masses seen on breast ultrasound.
16:37
There are fewer, uh, published comparison studies here,
16:40
but, um, deep learning for breast ultrasound,
16:44
it looks like it may outperform the traditional CAD
16:47
approach, and, uh, performs comparably
16:50
to human radiologists.
16:53
And again, these are, um,
16:55
the published literature on the subject as well.
16:57
But I just want to focus on one really large, uh, study
17:02
which showed, which was a reader study of deep learning
17:05
for benign versus cancer classification versus three
17:08
human readers.
17:10
And the deep learning, uh, achieved a uc of 0.84,
17:15
uh, versus zero point 89, 89,
17:17
and 0.79 for human readers,
17:20
which was pretty much equivalent.
17:22
So this could be really relevant in centers that, um,
17:27
do screening ultrasound for, uh,
17:30
for breast cancer as a supplement.
17:33
What about breast MRI?
17:35
The diagnostic AI development is similarly complicated
17:39
by a 3D problem space
17:41
because we have multiple sequences there to think about
17:45
and the, uh, the dynamic setting.
17:48
So which, um,
17:51
phase in the dynamic setting is the most valuable here,
17:54
and do we discard the rest?
17:55
So, um, for that reason, most classification, uh,
17:59
studies have focused on the subtraction series
18:03
or the early subtraction series
18:05
and targeted classification of 2D or 3D ROIs.
18:10
Um, so comparison studies are rare,
18:12
but again, um, the more recent studies show
18:15
that deep learning may outperform other machine learning
18:19
approaches, while the results
18:20
for the human comparisons are more mixed here.
18:25
Um, so for example, I chose this
18:27
because it's, it's, uh, more representative of, um,
18:32
the what's published in the literature.
18:35
There is a comparison of both radios as well as the humans.
18:40
And, uh, the comparison between the deep learning radios
18:44
and radiologists for classification of lesions.
18:47
On dynamic MRI, it showed that, uh,
18:50
deep learning outperforms the radis models,
18:53
but not quite the radiologist.
18:55
Uh, the red line here shows the radiologist performance,
18:59
whereas the deep learning is the green line.
19:02
So those are two are most close to each other,
19:05
and, uh, the radios approach didn't do quite as well.
19:12
So we have the data out there,
19:15
but how do we actually use it in our clinical practice?
19:19
So one idea is to use AI as second reader,
19:22
and this one is the most popular one, uh,
19:25
AI enterprise the case.
19:27
And then radiologists enterprise the abnormal cases,
19:31
and then radiologists determines the final interpretation.
19:34
BI scan has been shown to improve diagnostic accuracy,
19:37
as I've shown before, and, uh, it would
19:42
translate into double reading the mammogram in
19:45
resource limited settings.
19:47
The other way we can use it is as a triage method.
19:51
AI flags just the abnormal cases.
19:54
Um, you can generate a work list sorted based on
19:58
AI assigned priorities.
20:00
I'm gonna show you an example of that.
20:02
And the radiologist reads cases in order of priority,
20:05
and you can prioritize the negative cases if you like,
20:09
or the, um, highest, uh, probability
20:13
of malignancy cases.
20:15
And, um, theory, this can improve the workflow
20:19
and it can lead to a faster turnaround for protocol results,
20:23
especially if this is done at a mobile setting
20:27
or, uh, in a, in a resource, uh,
20:32
scarce setting where patients have difficulty, uh,
20:35
coming back for their additional workups.
20:38
Another way we can use AI triaging is, uh,
20:42
you can triage negative cases,
20:44
and if you can, um, trust AI sufficiently,
20:48
you can actually just give them a negative report.
20:51
And radiologists can only read the cases flagged
20:55
as abnormal, or you can let AI read all of the cases
21:00
and radiologists reviews only the abnormal
21:04
or uncertain cases, which are not highly, uh,
21:10
ranked as abnormal.
21:11
Um, there's full somewhere in between, um, 40 to 40,
21:15
50% chance of malignancy.
21:18
So in both of these scenarios, um, there's a potential
21:21
to decrease the radiologist workload
21:23
and allow more time for reading positive cases
21:27
and performing other duties for the radiologist.
21:32
Um, so they actually tested the simulation
21:35
of a deep learning triaging scenario,
21:38
and they, uh, this was by a homegrown model
21:42
by MGH
21:43
and MIT, uh, which are really, they, they produce a lot of,
21:48
uh, research on this, on this topic, on large data.
21:53
And, uh, in a retrospective simulation study
21:56
of a triage scenario in which you
21:59
basically give the patient a negative results based on
22:03
just AI prediction.
22:05
Um, then they said, well, 19%
22:08
of the mammograms would've been excluded from review without
22:12
compromising any diagnostic performance
22:15
or missing cancers that would be
22:17
otherwise detected by the radiologist.
22:20
So you have, uh, 90.6%, uh,
22:25
sensitivity with the radiologist and 90.1%.
22:29
So there's a bit of a drop
22:30
with sensitivity there if you are tolerant to that.
22:34
Um, and, uh, about the same specificity, maybe a bit, uh,
22:39
higher specificity.
22:41
And, you know, you could basically OMI reading 19%
22:44
of the mammograms, which doesn't seem like a,
22:48
um, a huge savings.
22:49
But, um, there it is.
22:51
And more recently, uh, AI
22:54
as a standalone mammography reader was tested
22:57
in another homegrown model by, uh, NYU.
23:01
And then they said, well, at least 40%
23:04
of true negative screens could have been read by AI
23:08
with zero basis.
23:09
So this is really, really promising,
23:11
and it seems that the, the higher the number of exams
23:15
that you actually give ai, the better the prediction,
23:20
uh, performance.
23:25
And, uh, so the question here was also I was interested in
23:29
was, um, what about the callbacks?
23:31
Do we have an increase in callbacks?
23:34
And there was actually, um, not an increase in the callbacks
23:38
by AI use, but a decrease of, uh,
23:41
20% if you set the AI triaging to 30%.
23:46
And in the 40% that they were presenting,
23:48
there was an actual savings of the callbacks of about 30%.
23:52
So this was a really, really good forming model.
23:55
Um, so how do we use it in daily practice?
23:59
We do have an integrated, um, ai, um,
24:04
model in mammography model in our, uh, practice.
24:07
And when I first log into my workstation, this is
24:10
what it looks like, and it, on the right,
24:13
you have a case score here.
24:14
These are the screening mammograms of the day.
24:17
And, um, there are the higher case scores of,
24:20
I actually start looking at the higher case scores first,
24:23
so you can, uh, rank them the way you like it, uh,
24:28
order them the way it's ranked,
24:30
and you have the zeros down here.
24:33
So, um, for example, if, um, this is from another day,
24:37
you have these really high ranked, uh,
24:41
screening mammograms.
24:43
Um, I clicked on one of those just to check, uh, the case,
24:48
and it seemed that it actually highlighted these segmental
24:51
calcifications in the right breast,
24:53
which really I would've called back.
24:55
And, uh, since then, the patients had, um, uh,
24:58
a stereotactic biopsy confirming di gluc
25:01
carcinoma CY as well.
25:03
So this is really helpful in getting the more abnormal
25:07
cases read early on
25:09
and working out the patient
25:11
potentially while she's still there.
25:14
So what about, uh, mammograms as predictors of risk?
25:18
This is actually for me a more, uh, exciting area
25:23
of, uh, developments
25:25
because I mean, yes, there's improvements
25:28
and there's triaging available for, um,
25:32
for image interpretation,
25:34
but we could do things, AI can actually do things
25:37
that we can't do with our naked eye.
25:41
So density has long been a big issue
25:45
with us reading screening mammograms, and, um, not just
25:50
because it masks cancer,
25:51
but mammography breast density is associated with
25:55
over two times increased relative risk for cancer in women
25:59
with extremely dense breast.
26:01
Um, and we do have automated quantitative tissue density
26:05
tools that provide objective measurements
26:08
that are commercially and publicly available.
26:10
It's, I think we widely use it,
26:12
but it's not just about quantification.
26:15
And there is, um, density signatures in there
26:19
that actually predict the patient's future breast, uh,
26:22
breast cancer risk,
26:24
the parenchymal complexity features which are independently
26:28
associated with the breast cancer risk.
26:30
This is not about the percent of breast density,
26:33
but the quality
26:35
or the texture of the, um, the breast density,
26:38
which we can't always quantify with our naked eye.
26:42
So, um, this is really, really interesting
26:46
because, uh, it, again, the,
26:48
the very productive group from MIT
26:50
and MGH have come up with, uh, uh,
26:54
predictive, uh, model that used mammograms,
26:59
and they took, uh, patients traditional, uh, risk,
27:04
uh, features, and then they fed that to the model.
27:07
And then they also used the, uh, the normal mammograms.
27:12
And they also used, um, compare that
27:15
with Tyro CIC version eight,
27:17
which is the most common model we use
27:19
to predict the patient's future breast cancer risk.
27:22
And we base our, uh, our decisions as
27:25
to whether patient needs supplemental imaging
27:27
or supplemental intense screening based on
27:30
this clinical model.
27:32
Um, they found that, uh, traditional risk factors
27:35
didn't do very well.
27:38
Um, they were better than tyro kic, though Kuzi did
27:43
not very well at all, uh,
27:44
to predict the patient's five year breast cancer risk.
27:47
But if you looked at mammograms alone, um,
27:51
it actually outperformed both tyro CIC
27:54
and the traditional risk factors.
27:56
Um, and if you actually combine those traditional risk
28:00
factors with imaging that actually perform the best.
28:04
So I think this is very, uh, tra groundbreaking, uh,
28:08
developments in
28:09
that we could do better actually at predicting which
28:12
patients would benefit from, uh, supplemental screening
28:16
or more in intense screening, um, in this patient group.
28:21
Um, and again, uh, you could use AI for,
28:25
for different reasons as well
28:27
besides that of predicting risk.
28:30
So could we predict, uh, the presence
28:34
of external lymph node metastasis?
28:36
So this was, uh, a study that used ultrasound ROI images,
28:40
and, you know, they, they found much, much,
28:44
much better prediction than what you would normally do, uh,
28:48
by naked eye or radiologist evaluation of traditional, um,
28:53
uh, values or parameters that we would use.
28:57
Um, another, um, clinical decision making use for AI is,
29:03
uh, preoperative prediction of pathologic complete response.
29:07
So her two cancers, her two positive cancers, as you know,
29:11
they have a very high response rate.
29:14
Um, but having HER two positive cancer alone could be a huge
29:17
predictor of response,
29:19
but that did only, uh, perform with an a UC of 0.7.
29:24
Uh, but when you added that to imaging based, um,
29:28
or AI based, uh, prediction
29:30
and the HER two status, it certainly improved
29:34
that prediction because this is an area we're not
29:36
very, very good at.
29:38
In, uh, in, in breast imaging, unfortunately,
29:43
we also used it for a prediction of axillary metastasis, um,
29:48
predicting that on breast MRI here in UT Southwestern, uh,
29:52
we developed both a clinical model
29:54
and an imaging based model.
29:56
And, um, so you're seeing our results based on test data,
30:01
um, for clinical node prediction
30:04
and, uh, pathologic node prediction in patients
30:08
who actually had surgery.
30:10
And we had a pretty, uh, respectable outcome with this.
30:14
And more importantly, we didn't really have
30:17
to look at the patient's axilla at all.
30:20
These were just made based on, uh,
30:23
just looking at the tumor characteristics
30:26
or derived from, uh, from MRI features of the tumor alone.
30:32
Um, so when we looked at the, uh, on the,
30:36
on the test set performance, we had a zero point 87.
30:41
Um, again, this means that it's actually applicable, uh,
30:45
clinically and, uh, overall, um, median sensitivity
30:50
of 89%
30:51
and the pretty high specificity of 76%,
30:57
um, key points from that study was
31:01
MRI based hybrid model showed an improvement over
31:06
the 77.6% radiologist sensitivity
31:09
that we had in our institution,
31:12
and we had a 55% specificity in that half
31:16
of our patients actually underwent, um, uh,
31:21
uh, a biopsy.
31:23
But, you know, half
31:25
of them actually had a positive result after that.
31:28
So we were sending patients to biopsy for, um,
31:32
finding not a lot of cancers there.
31:35
Uh, and one of the benefits of the model was scalability.
31:38
So when with all AI models, really, um, if you
31:43
set the specificity rate to 71%,
31:46
our model yielded 91% sense,
31:50
which means we only missed at 9% of the metastasis.
31:56
And if we had implemented our model,
31:59
it would've helped avoid more than health of
32:02
benign operative sentinel lymph node biopsies,
32:04
which unfortunately most of the, uh,
32:07
invasive cancer patients have, while we were able
32:10
to detect 95% of patients with OID metastasis.
32:15
Um, so one of the concerns about using these models is,
32:20
well, I mean, it performs good in your institution,
32:23
but what about if I implemented in my institution,
32:27
how would it do then?
32:28
So we have an advantage here, uh, UT Southwestern
32:32
where we are affiliated in both a tertiary care center,
32:36
the university hospital, and a safety net hospital.
32:41
So we have collected data from both about 400 patients.
32:46
And what we did was we did an exercise in which we developed
32:50
the model in one, uh, hospital's, uh, data,
32:54
which are completely, uh, different, uh,
32:58
patient characteristics, even cancer age, even the types
33:01
of cancers are slightly different in the safety net hospital
33:05
versus the university hospital.
33:08
And we were able to, the performance, see
33:11
that the performance of the date of the models were the same
33:14
when we developed it in one and applied it to the other.
33:17
And that's really what we ought to look at in all, um,
33:21
AI vendors that are, uh, marketing their, um,
33:24
their product out there, how would it perform, um, not in a,
33:28
in a test or a lab environment,
33:30
but in a real life environment.
33:32
So, um, this is the promise,
33:36
and now I'm going to shift gears
33:38
and, uh, explore, you know, what are some of the problems
33:42
that we're dealing with daily, um, clinical implementation.
33:47
So, uh, the legal re responsibility is a big one.
33:50
Uh, who takes the responsibility of the decisions
33:54
of the AI systems hospital vendor
33:57
or the radiologist, um, who would handle the disagreements
34:01
between the AI and the radiologist?
34:04
Um, so if I overruled a lesion flagged by ai,
34:08
what would be the liability of that?
34:10
And if a radiologist cannot identify a specific lesion,
34:14
how will that case will be managed?
34:17
So if ai, you know, flags something,
34:20
and then you work it up,
34:21
and then there's, there's no correlate for that.
34:24
So how would we go about that?
34:26
Um, and the AI detected cases, um,
34:31
so would they lead to more recalls for the radiologist?
34:34
And that's a, that's a a real question
34:37
because there's some data out there that suggests that if,
34:41
uh, AI does the tech cases
34:44
and, you know, in a triaging environment where, uh,
34:47
radiologists are only looking at the positive cases
34:51
and no negative cases, that would actually
34:55
increase our callback rates,
34:56
which are already under fire right now.
35:00
Um, transparency and reliability are also major ones.
35:05
Uh, interpretability
35:06
of an AI algorithm is key in under understanding
35:08
how prediction of are made,
35:11
and some black box models are generated
35:14
directly from the data by an algorithm.
35:15
So we don't know how, uh, the, uh,
35:19
AI model finds these cancers
35:22
because we haven't really shown it how to find it.
35:26
So it is very imperative to make model development code
35:29
and testing data sets accessible to the public.
35:32
And this is also one of the things we ought to look for
35:36
when, uh, we are looking at marketed products.
35:40
There is a true concern about, uh, model drift,
35:43
which is a concept that is
35:48
degradation of a model's prediction power due
35:51
to changes in the environment over time.
35:53
So it's very important to do real time surveillance
35:56
to monitor for unexpected model performances,
36:00
to make sure the algorithm, uh, meets the standards
36:03
and doing exactly what it's supposed to do
36:06
at the outset when you, we are first, uh, installing them.
36:10
And, uh, there's now, uh, uh, a recommendation by a CR to do
36:15
actually, um, a periodic checking on this.
36:20
And this is an example comparing three commercial artificial
36:24
intelligence algorithms for, uh, independent assessment
36:27
of screening mammograms.
36:29
And, you know, these were done at the patient level on non
36:33
annotated mammograms, and the counselors were not localized
36:37
and without insight into why AI decisions were made.
36:41
So we don't actually know that the cancers reported here.
36:46
Were actually the same ones that were, um,
36:50
detected down the line at diagnostic workup.
36:53
There was a, there was a cancer prediction done,
36:56
but it wasn't, we don't know if it's the same one
36:59
as the actual diagnosed cancer.
37:02
It could be elsewhere in the breast,
37:04
or it could be the opposite breast.
37:06
So, um, when vendors are touting their, uh, performances,
37:10
it's very important to be careful
37:12
and look, um, between the lines there.
37:16
So approval and regulation of AI systems, um, this is,
37:21
um, this is a, a developing area.
37:24
Um, there's increased FDA evidentially, uh,
37:28
regulatory standards recently,
37:31
but, um, development
37:33
of more improved course marketing surveillance
37:36
and trials are needed.
37:38
Um, currently, uh, they're regulated as software
37:41
as a medical device, um,
37:44
and which means software intended to be used for one
37:47
or more medical purposes that performs
37:50
these purposes without being part
37:52
of a hardware medical device.
37:53
So that's, that's just the code that they have.
37:57
And, uh, the concern with that is AI use adaptive algorithms
38:01
that are dynamic can change in, respond to new information.
38:06
So adaptive algorithms are very difficult
38:08
to regulate in the current framework
38:11
because the product initially reviewed may be different from
38:14
the one the consumer ultimately encounters.
38:18
So most products are actually locked
38:21
after FDA, um, approval that's obtained.
38:25
So this is good and bad.
38:26
So you can actually look at the data presented to A FDA
38:31
and you know, you can assume
38:33
that's the same performance you're gonna get,
38:35
but, uh, it could be bad
38:37
because there's new data that
38:40
could have developed over time.
38:42
And, um, for example, you know, tomosynthesis
38:45
or synthetic, uh, mammograms.
38:49
Um, so you don't know that they're doing, um,
38:53
as well on that new data
38:55
because there hasn't really been an interval training on
38:58
that, that algorithm.
39:01
So there has been discussions about this where, you know,
39:04
clinical prospective trials, uh, may be needed
39:09
for AI products before clearance or approval,
39:12
but it, it's not in implementation at this point.
39:17
Um, I'm a research director in my institution,
39:20
so I always look at what are the barriers and research.
39:24
Um, we need large, high quality data sets.
39:27
I showed you the, the higher, the more the data,
39:30
the better the algorithm, uh,
39:33
with accurate ground truth is needed, which, um, you know,
39:37
we, I think we have a lot with mammography,
39:39
but not so much with the other, um, uh, modalities.
39:44
So we gotta be very careful with those.
39:47
And the training data sets
39:48
that are often collected from the same institution.
39:52
Uh, and cross validation is now mandatory in
39:55
publication standards.
39:56
I mean, if you go and look at, uh, majority of the, uh,
40:01
respectable, um, journals, they, they request this now.
40:07
So, um, and the cyber attacks, of course, um,
40:12
you need to invest in a strong security infrastructure.
40:15
When we are dealing with these third party vendors.
40:19
Um, there are even more biases that I wanna touch on,
40:22
which is, um, bias is a big one,
40:25
and there may be several biases in these algorithms
40:29
that are developed.
40:31
Representation is a big one.
40:33
Um, FDA clearance of AI products for mammography screening
40:38
did not necessarily report the race
40:40
and ethnicity of the patients included.
40:42
And you know, when you look at the most available training
40:45
data sets used in AI development of vendors,
40:49
they actually contain mostly mammograms of, uh, white women.
40:54
And, uh, this was a very interesting study in JAMA published
40:58
a few years back, and it showed that patient cohorts used
41:01
to train AI algorithms in the us, uh,
41:05
originated mainly from three states, California,
41:09
Massachusetts, and Europe.
41:11
So this is really important when you're looking
41:14
to implement it in your own institution.
41:16
If it's not representative,
41:18
if your population is not representative, then um,
41:22
you may not be, you may, it may not perform as well as, uh,
41:27
what's published out there.
41:30
There are socioeconomic disadvantages.
41:32
There's missing data on disparate
41:34
or socioeconomically disadvantaged populations,
41:37
which may lead to underperformance
41:39
of the AI in the underrepresented individuals.
41:42
And this is primarily why we did that
41:45
exercise in our own data, uh, in our own, uh, publications
41:48
with, uh, lymph node metastasis prediction, uh,
41:52
comparing the safety net versus, um, the CUH,
41:57
because we know we rotate at both sites,
42:00
and we knew that the patient profiles were very different.
42:03
And also our resource poor communities may not be able
42:06
to take advantage of AI's potential patient care benefits
42:10
because they actually lack the resources
42:13
to purchase these commercially available AI products.
42:16
I mean, the promise is always that these, uh,
42:20
will actually help with the disparities
42:22
and, um, you know, where radiologists are not available,
42:25
you can deploy them, but really they're being marketed
42:29
and you, you know, it's, it may not be possible
42:32
for resource poor communities to access these tools.
42:37
Um, this is a very, very interesting, uh, study
42:39
that I wanted to share with you.
42:41
Um, I had the opportunity to actually work with one of the,
42:45
um, PI BIS of this study, um, the,
42:50
and it is on AI recognition
42:53
of patient race in medical imaging.
42:56
So they actually looked at the capability of AI
43:00
to predict patient race without actually giving
43:03
the AI this information.
43:05
And if you look at it, I mean,
43:08
the AI can accurately predict race, um, quite well,
43:12
especially in chest x-ray.
43:14
I was actually gratified to see mammography is kind of
43:17
down there in the rankings, but still pretty high.
43:21
Um, so that actually tells you there is a bias right there
43:25
because AI can actually predict what the patient's, um,
43:30
race is without you realizing that it did.
43:34
Um, again, uh, more implementation challenges,
43:38
translation from proof of concept
43:40
to a commercially available products is a problem.
43:42
I showed you a lot of homegrown, uh,
43:47
AI models, but, um, a lot of them are not actually available
43:51
for patient use at this time.
43:54
And again, recognition of biases that may arise
43:58
with human AI interaction.
43:59
What does that mean? If AI says,
44:02
and a mammogram is negative, I may be more inclined
44:05
to call it negative and ignore, uh, cancer that's,
44:09
I may have otherwise called back,
44:11
or if AI says a mammogram is, uh, has cancer in it,
44:16
I may actually look for something that I haven't,
44:19
otherwise I wouldn't otherwise call back.
44:22
And this is called automation bias.
44:25
And then there is the reimbursement considerations.
44:28
There's, uh, currently no, uh, existing CPT code for,
44:32
uh, AI in mammo.
44:34
There are two category three codes for quantitative analysis
44:37
with ultrasound applicable to ai.
44:40
So we don't know if there will be reimbursement
44:44
and, uh, if that reimbursement happens,
44:48
if it's gonna be a positive reimbursement
44:50
or a negative reimbursement.
44:51
And we can actually maybe chat more about that
44:54
after, um, after the lecture.
44:58
And, uh, incorporation into existing PACS
45:01
and EMR systems requires a substantial, um,
45:05
infrastructure sometimes
45:08
and the real time surveillance to monitor
45:11
for unexpected model performance that may be secondary
45:14
to underlying biases.
45:16
Um, I don't know that a lot
45:18
of institutions are prepared for this.
45:20
I know that we are starting to, um,
45:23
do these routine audits in my institution.
45:25
We're getting ready, but it's really difficult
45:28
to find expertise and allocate time
45:32
and, um, resources to, to this, to make sure
45:36
that the AI is performing as it should, uh,
45:39
not at the outset, but after, uh, multiple years.
45:45
Um, these are the training data sets available
45:47
for breast imaging and AI algorithms.
45:50
Again, this is really relevant, uh,
45:53
to the resources out there, but
45:56
whenever, uh, someone tries to develop an algorithm,
45:58
usually this is the first go-to, uh, resource, they go
46:02
to one of these, um, you know, the high number of image
46:07
data sets, free data sets available,
46:10
and, you know, this is where they start out.
46:15
And lastly, I wanted to touch on what about the patients?
46:19
What do they think? We don't have much data on, um,
46:24
the US population,
46:25
but there was a recent Dutch population which was published
46:29
in JACR.
46:31
And it interestingly said that most patients did not fully,
46:36
did not support fully independent use
46:38
of AI-based screening interpretation,
46:40
meaning they didn't want to just get their results from, uh,
46:44
the computer right after they have their mammograms.
46:48
About, uh, 42% were opposed to the idea of using AI
46:53
to select cases that require a second reading.
46:56
And there was really no agreement on
46:58
who would be responsible for a diagnostic error.
47:01
And, uh, I mean, I think if we polled the US population,
47:06
I don't think we would get very, very different results,
47:09
but this was what the patients thought.
47:13
In closing, I want to say
47:15
that AI could improve radiology accuracy
47:17
and workload by serving as a second reader
47:20
or by excluding the normal findings from human review,
47:24
which could, um, improve our workload.
47:28
Um, it holds tremendous potential to improve the risk
47:30
and prognostic prediction,
47:32
which is really impressive in my opinion.
47:35
However, there are various ethical medicolegal
47:38
and logistical issues pertaining to its full implementation,
47:42
which remain unsolved.
47:44
And we need more high quality research,
47:47
especially prospective data, to see
47:51
how it performs in real life setting
47:54
and not just in the lab setting.
47:58
Uh, radiologists are poised
47:59
to take a prominent role in the ongoing discourse about
48:02
incorporation of AI into clinical practices.
48:05
What I believe, and there are, um,
48:10
in the future, there are, um, opinions that, uh,
48:14
if AI lives up to its potential breast imaging,
48:17
radiologists could have more time
48:18
to pursue the more personally
48:20
and professionally fulfilling aspects of their jobs,
48:23
including a patient interaction collaboration
48:26
with colleagues and teaching of trainees.
48:29
This is a quote from, uh, Dr.
48:31
Smitherman who, uh, brought on the economic impact
48:34
of AI on breast imaging and JBI.
48:37
Um, I, I think that could very much happen,
48:40
but, um, it's all about how we, uh, implemented.
48:46
I wanna thank all of my research team here
48:49
and, um, I,
48:52
we wouldn't have generated our data without their invaluable
48:55
help and also point that, uh, a new really good summary
49:00
that we published on this topic in JBI.
49:04
Thank you very much.
49:06
Thank you so much for that awesome lecture, Dr. Dewan.
49:09
And we will open up the floor for any questions folks have,
49:14
and if you could submit those through the q
49:16
and a feature, that would be great.
49:20
Um, we've got one in there now.
49:22
Um, did AI decrease the workload for prediction
49:26
or does it increase the workload?
49:31
So, um, with prediction, uh, I understand that.
49:37
So when you say prediction, I guess my,
49:40
that question would be, would that be prediction
49:43
of versus malignant,
49:45
or would this be prediction of, um,
49:49
future risk of the patient?
49:53
So for future risk assessment, um, of breast cancer risk,
49:58
um, I think we might consider that it,
50:01
it may increase the workload of the, in that, uh,
50:05
if the pa if we the patient is, uh, average risk,
50:09
and you know, we're thinking that the patient's
50:12
risk prediction increases, then we can expect to see more,
50:16
um, additional imaging or supplemental screening
50:19
or MRI, ultrasound et cetera, for those patients.
50:23
But if we are, um,
50:25
talking about benign versus malignant classification, um,
50:29
it's predicted to decrease it
50:31
because the idea is to really not look at those ones
50:36
that are predicted as zero
50:38
and chance of malignancy by AI at all.
50:40
Just like the workflow I showed
50:42
that we have here in UT Southwestern.
50:46
I actually have a question. The, um,
50:48
that patient slide you had about folks not
50:53
feeling super comfortable getting a report, uh, using ai,
50:58
is there something to be done
51:00
with helping folks be more comfortable with that?
51:03
Is it patient education?
51:04
Is it attending more lectures like these?
51:07
What, what is, what is something
51:09
that could make patients feel a little more comfortable
51:11
with this new landscape?
51:15
Um, that's a good question.
51:16
So we, in that study,
51:18
and we also did a study here, uh, in our center as well,
51:22
it seems that they hire the higher the patient's education,
51:27
uh, there there is an increase in, uh, trusting
51:31
or, uh, relying more on ai.
51:35
So whether that's, um, whether that's relevant to,
51:40
uh, a higher interaction with AI
51:42
or not in their daily lives, um, you know, that's,
51:46
that's a question we didn't ask
51:48
and they didn't ask in that study either.
51:49
But, uh, patient education seems to be one factor.
51:54
And, um, here in our, uh, data set, we found
52:00
that, uh, patient race did play a role, um, with, um,
52:06
non-Hispanic African American patients
52:09
and, um, Hispanic patients showing, um, more, um,
52:15
of distrust towards AI than, um,
52:19
non-Hispanic white patients.
52:20
So, so there are a lot of factors that play into this,
52:24
but, um, maybe it is patient education
52:27
and it could be that, uh, sometimes patients' interaction
52:32
with AI are not positive at all in daily life,
52:35
and they think that's gonna, what's gonna happen here.
52:38
Hmm.
52:39
Yeah. Interesting. I'm, I'm curious to see
52:41
how this evolves as, as this becomes implemented. Other
52:45
Component of that, sorry to interrupt you,
52:47
is they don't want
52:48
to lose their interaction with the doctor.
52:52
That's a big, um, factor in the patients not wanting
52:56
to just interact with the AI
52:57
because that interaction is very valuable to them.
53:04
Yeah. Interesting. Thank you.
53:09
Do you have any experience in using AI
53:11
and mammography with genetical tests as a predictor of risk?
53:15
Or what is your opinion about that?
53:18
That's a good question. We haven't done that.
53:20
We don't have that, uh, software I showed in my talk,
53:25
but I believe it has now been made available into a
53:30
commercially available product, um, by the group
53:34
that developed it.
53:35
But we don't have that implemented here.
53:40
Does your practice currently allow AI negative readings
53:42
to not be reviewed by a rad radiologist?
53:46
We don't because, uh,
53:47
the regulations don't allow for that at this time.
53:50
So how is this gonna play out?
53:52
Is really, it's gonna be very,
53:54
very interesting in the future
53:56
because, um, of the liability situation,
54:00
we don't know who is to take the liability for an,
54:04
for wrong AI interpretation or a wrong AI recall.
54:09
So, uh, at this time we are
54:12
basically using it as a second reader.
54:15
Um, but uh, there, I think there's implementation in Europe
54:20
actually for, to that effect
54:21
because it, they're doing double reading
54:25
and, uh, they're really, really behind on screening, reading
54:29
and, uh, their payment model is
54:32
based on that double reading.
54:33
So, uh, because it, on calls, the,
54:37
their call right back rates are really, really low,
54:40
much lower than average American, right?
54:42
So, uh, they have, uh, they can maintain their, um,
54:48
national mammography, uh, programs
54:51
where they offer it to everybody.
54:53
So in, in the US it's opportunistic
54:55
mammography screening as you know.
54:57
So, uh, with them, actually, I think a lot
55:01
of them are thinking about going live with, uh, AI reading
55:05
and only having, uh, the, the positive
55:09
or abnormal ones being looked at by the radiologist
55:12
because they're really behind and backlogged
55:14
and there's really not a whole lot
55:15
of press radiologists in the pipeline being trained.
55:19
So, um, I think that's the situation there.
55:22
So we may see it roll out in Europe
55:24
before it does in the UQS.
55:27
Gotcha. Let's say the AI gets something wrong.
55:31
How do you rectify it
55:34
or retrain it to correct itself in the future
55:38
for any other case it might see?
55:42
So at the time we received, um,
55:46
the AI model, it's actually locked.
55:50
So because, you know, I told you about the model drift
55:52
and model drift can occur even after being locked,
55:55
but, um, the, the, the changes
55:59
and, you know, the additional developments in the,
56:03
in the model is not allowed.
56:06
So that's both a good thing and a bad thing.
56:09
It's a, it is a, it's a good thing
56:10
because you know what to expect, but it's a bad thing
56:13
because, um, I mean, it would not be bad for you
56:17
to actually train it on your own dataset
56:19
and make sure it performs better in your own dataset,
56:23
but that's not currently allowed, uh,
56:26
because FDA requests that from the vendors.
56:29
So in answer to your question, I don't,
56:32
we can't really improve
56:34
or change, uh, commercial models, um, performance.
56:38
We can't, we might be able
56:39
to do it without commercialization if we have a homegrown
56:43
model that we're using, um, it might be possible,
56:47
but, um, it, it's not possible
56:50
for a commercial model at this time.
56:53
Gotcha. Alright, I think we're going
56:55
to wrap up today's noon conference.
56:56
Thank you all for participating in our noon conference
56:59
and asking such great questions.
57:00
You can access the recording of today's conference
57:03
and all our previous noom conferences
57:04
by creating a free MRI online account.
57:07
We'll also email out a link to the replay later today.
57:10
Be sure to join us next week on August 22nd at 12:00 PM
57:14
Eastern, where Dr. Reed Omari,
57:16
we'll deliver a lecture in entitled Radiology's Opportunity
57:20
to Benefit Patients and the Planet.
57:22
You can register for that@mrionline.com
57:24
and follow us on social media
57:26
for updates on future NOOM conferences.
57:28
Thanks again for learning with us and have a great day.