data sets in Hive that appropriate for keyword search
I want to do keyword search over Hive. I would like to ask what kind of data sets did you use in Hive.
Actually, I want to find some use cases for keyword search. For example, there are three tables A, B and C distributed over HDFS. When the user issues a keyword query Q, the system should return results that contain Q by joining Tables A, B, and C.
So, my question is, apart from TPC-H, are there any data sets (contains three or more tables) that appropriate for the keyword search scenario?
Thanks a lot!!!