spark streaming - Does graphloader can be distributed ? I have cluster machines on production mode -
i use dse graph loader reading input files hadoop distributed file systems.
i insert data dse graph cluster(on multiple machines) in distributed way.how can done?
the dse graph loader command line utility supports loading data many sources including csv, text, json, gryo, hdfs , aws s3 sources. cannot run hadoop/spark job.
to parallelize injest multiple threads, configure parameter load_threads (default 1). documentation can found here: configuring dse graph loader
Comments
Post a Comment