Consultation only - no coding is needed this phase
We need expert from this field, ideally who already have experience on related jobs.
Objective: to find similar entry across database
Scenario: in a database of around 1000 articles, find out those which might be similar with an indicator of similarity strength
Definition of similar: (important)
- fuzzy string searching / approximate string matching is one of the approach but not the only
- we need to identify words in different spelling but in similar meaning, ie. mall vs market vs mart, happy vs excited vs pleasure, 18/6/2014 vs mid Jun 14 etc
- in other words, a database of similar/related words will be needed
If you are interested please let me know your approach in brief as well as any reference of related job you involved.
We can provide sample data on request.