There is an AI web that will do this. I think I used this in the past to strip voice from a music track: https://www.lalal.ai/stem-splitter/
The splitter might let you isolate the voice and remove the music.
I tried that one first. It works, but my clip is too long.
This looks kind of shady but maybe it'll work: https://vocalremover.org/
How long is your clip? Maybe break it into smaller samples then stich it back together in audacity.
Current clip is 1:21:00
EDIT: Replied on wrong comment, sorry.
