I want to develop a sentence matching engine. The engine will generate a result which is the most matching sentence with the input. Even if it is a least match, the engine must generate an output from the data set( a text file with a lot of sentences).
eg: Input :
Hello I am Nidhin Joseph
Data set:
1). Hello, How are you?
2). And I am Nidhin.
3). I am Nidhin Joseph Hello.
Among these three, according to my requirements, the most matching one is
the third sentence.
I am ranking on the basis of both word hit and word order.
My input : {"Hello","I","am","Nidhin","Joseph"}
My output : {"I","am","Nidhin","Joseph","Hello"}
Here no of word hits= 4
No of relatively ordered words = 4
I don't know whether i was able to convey you my idea. If i made it, then please
tell me, if a similar library is already available in Java.
If not, please lead me in some right direction so that i can develop it in an easier way.