I am trying to create an algorithm that, given a set of elements that are in a site, would return the one that is more likely to be the "one" describing a product.
This is not a 100% accurate algorithm of course, and will need of people-based selection afterwards, but I am trying to get at least a list of the 3 most probable images. Here is what I do so far:
1 - Get rid of which square size is smaller than 50x50px
2 - Calculate average square size of all in the page
3 - Give a score to each , depending on the difference with the average square size (the bigger square size, the more score)
4 - If width of is > than (height * 5), score = score * 0.5 (I do this because this is likely to be a banner in the page).
Here is what I would like to change, but I fail to see what would be a good way to do so. In step 3, what I am doing right now is giving the following score:
score_of_image = average_square_size - square_size_image
I would like to have a particular amount of points that would be shared out to all the depending on their size. The share out should reflect the size of one picture related to the others. What would be a good algorithm in order to share out these points?