vane

2y

Part 3

https://devrant.com/rants/9881158/...

I dropped subtitles and started extracting audio from movie, after that I use whisper to convert speech to text.

I parse srt from whisper, adjust timestamps to get >= arbitrary amount of voice seconds. I put text to vector database with timestamps and movie file name.

I query database by ex. “I don’t know” and extract first n results, after that I walk trough movies and extract parts with found text.

I normalize and merge parts into one movie.

Results are satisfying so now I decided to try to find a common dialogue that I can watch by combining multiple persons speaking from multiple movies.

Might also try to extract person from one movie and put it to other movie.

rant

artificial intelligence

movies

Ranter

Comments

2

vane

10560

2y

sample video

'r2'

https://odysee.com/9d50f132-8f87-44...
1

Wisecrack

9330

2y

Oh, this is fucking cool. Been waiting to see some proper applications of whisper.

Related Rants

devRant © 2021 Hexical Labs LLC
Privacy Policy | Terms of Service