Real time hive import from MySQL -
i have requirement data getting inserted mysql database @ high rate (30000 - 40000 rows every couple of mins). need import data mysql hive after doing processing, e.g. group by, summation. , use hive datastore run analytics processes.
i using sqoop handle import part. afaik, oozie can used configure sqoop imports scheduled jobs (run after every 5 mins). possible configure sqoop imports on basis of update/insert in mysql, every insert in table1 in mysql, sqoop job should run import newly inserted data (in append mode sqoop), data in hive should real time.
Comments
Post a Comment