I want to write the code that was proposed in the "VSUMM" article.
In one step there is k-means algorithm. It's written in the article but didn't describe how it'll work. According to the article there is a video at first. The video will be split into frames, then according to some formula the k will be computed and k_mean algorithm get started.
There is a data set of video frames which are images. How could I apply k-means on them?
What I've done till now:
- Put all images in k groups. Each group in a row of a cell array which the first data of it, is the mean value. this values are the names of images indeed.
- Calculate the Euclidean distance between each image and the other mean value and I put the images that has minimum distance to previous keys in that group.
But now I'm stuck at the 3rd step, and I don't know what to do.
I've got k group of images in cell array which values are just the name of images but according to k-means the new k is the mean of elements in a group. But these elements in my theory are just the name of images. So what should I do? What should this mean be? Is it correct if i get the mean of images' name?