python - How to connect from pyspark to hbase -
i have 2 clusters in azure, 1 spark 2.0 , hbase. can fill , use hbase perfectly, want connect spark hbase , calculations. use python don't know scala.
my first questions is: what's best way it? spark -> hive -> hbase or spark directly hbase?
could have examples inserting , querying?
thanks?
if you're cool spark hbase directly via spark-to-hbase connector, zhan zhang has pretty blog post @ http://hortonworks.com/blog/spark-hbase-dataframe-based-hbase-connector/ providing examples.
hth!
Comments
Post a Comment