kafka Connect Nebula

kafka-connect-nebula is a Kafka sink connector for capturing all row from kafka and streaming the rows to NebulaGraph.

Target node/edge types for Kafka topics

Kafka topics can be mapped to existing NebulaGraph node/edge types in the Kafka configuration.

The connector converts the row data with json string format, Struct format, Map format into a Node record or a Edge record according to the sink config file, and then write the node/edge record into NebulaGraph.

Configs for Nebula kafka connect.

You need to specify configuration settings for your connector.

These can be found in the config/connect-nebula-sink.properties file.

config name	required	description	default value	available values
name	true	A unique name for the con nector.	_	kafka-nebula-sink
connector.class	true	The entry class for this connector.	_	com.vesoft.nebula.connector.sink.NebulaSinkConnector
tasks.max	true	The maximum number of tasks that should be created for this connector. The connector may create fewer tasks if it cannot achieve this level of parallelism.	-	Int number
topics	true	kafka topic need to sink data into NebulaGraph	_	_
nebula.graph.servers	true	NebulaGraph graphd servers address, split multiple addresses by English comma	_	_
nebula.user	true	NebulaGraph user	_	_
nebula.passwd	false	NebulaGraph password for user	_	_
nebula.authOptions	false	NebulaGraph other auth options	_	_
nebula.schema	false	NebulaGraph home schema path	-	_
nebula.date.format	false	NebulaGraph date format	-	_
nebula.localDatetime.format	false	NebulaGraph local datetime format	-	_
nebula.zonedDatetime.format	false	NebulaGraph zoned datetime format	-	_
nebula.localTime.format	false	NebulaGraph local time format	-	_
nebula.zonedTime.format	false	NebulaGraph zoned time format	-	_
nebula.enable.tls	false	Enable the tls for NebulaGraph	false	_
nebula.disable.verify.cert	false	Disable the verification for server certification	false	_
nebula.tls.ca.path	false	The path of ca certification	_	_
nebula.tls.cert.path	false	The path of client certification	_	_
nebula.tls.key.path	false	The path of client private key	_	_
nebula.connect.timeout	false	The connect timeout for NebulaGraph Client	3000(ms)	-
nebula.request.timeout	false	The request timeout for NebulaGraph Client	3000(ms)	-
nebula.sink.partitions	false	The partitions for sink connector	10	-
nebula.graph.name	true	NebulaGraph graph name to sink data into	_	_
nebula.data.type	false	NebulaGraph data type to sink data into.NODE:sink kafka data into node type;EDGE:sink kafka data into edge type;BOTH:sink kafka data into both one node and one edge	NODE	NODE,EDGE,BOTH
nebula.node.typeName	false	The node type name in NebulaGraph,it's required when nebula.data.type is NODE or BOTH	_	_
nebula.edge.typeName	false	The edge type name, it's required when nebula.data.type is EDGE or BOTH	-	-
nebula.node.primaryKeys	false	The nebula node type primary keys, it's required when nebula.data.type is NODE or BOTH and node's pk is multiple.	_	_
kafka.node.primaryKeys	false	The kafka property name to sink the data as node primarykey, it's required when nebula.data.type is NODE or BOTH for update or delete sink mode.	_	_
nebula.node.property.names	false	The node properties need to sink into, it's required when nebula.data.type is NODE or BOTH.	-	-
kafka.node.property.names	false	The kafka properties need to write to NebulaGraph, it's required when nebula.data.type is NODE or BOTH.	-	-
nebula.edge.srcPks	false	The nebula edge's src node type primary keys, it's required when nebula.data.type is EDGE or BOTH and src node's pk is multiple.	-	-
kafka.edge.srcPks	false	The kafka property name to sink the data as edge's src node primary key,it's required when nebula.data.type is EDGE or BOTH.	-	-
nebula.edge.dstPks	false	The nebula edge's dst node type primary keys, it's required when nebula.data.type is EDGE or BOTH and dst node's pk is multiple.	-	-
kafka.edge.dstPks	false	The kafka property name to sink the data as edge's dst node primary key,it's required when nebula.data.type is EDGE or BOTH.	-	-
nebula.edge.property.names	false	The node properties need to sink into, it's required when nebula.data.type is NODE or BOTH	-	-
kafka.edge.property.names	false	The kafka properties need to write to NebulaGraph, it's required when nebula.data.type is EDGE or BOTH	-	-
nebula.batchSize	false	The batch size when sink data into NebulaGraph	2000	-
nebula.sink.mode	false	The sink mode for NebulaGraph.	INSERT	INSERT,UPDATE,DELETE
kafka.null.value	false	The value in kafka will be treat as null.	\N	-
nebula.sink.interval.time.mills	false	The interval time between retrys.	0	-
nebula.sink.retry.times	false	The retry times when sink into NebulaGraph failed.	0	-
key.converter	false	Use this parameter to override the default key converter class set by the worker.	-	org.apache.kafka.connect.storage.StringConverter
value.converter	false	Use this parameter to override the default value converter class set by the worker.		org.apache.kafka.connect.storage.StringConverter

Standalone Quickstart

In this quickstart, we use movie dataset and produce the movie json data with kafka-console-producer, and consumer the json data into NebulaGraph.

NOTE: You must have the Confluent Kafka Platform installed in order to run the example.

0. prepare the Graph schema in NebulaGraph

execute the gql statement to create a graph with movie schema.

1. get the connector jar

git clone https://github.com/vesoft-inc/nebula-ng-kafka-connector.git
mvn clean package -Dmaven.test.skip=true -Dmaven.javadoc.skip=true

after the command finished, there will be kafka-connect-nebula-5.0-SNAPSHOT.jar in kafka-connector/connector/target.

2. Put the connector jar to Kafka lib

put the kafka-connect-nebula-5.0-SNAPSHOT.jar into your kafka env: KAFKA_HOME/libs

3. config the connect-nebula-sink.properties

you can get the demo config in quickstart/connect-nebula-sink_*.properties.

Please update the config value according to your Kafka environment and NebulaGraph environment.

4. producer data into Kafka

you can produce the json data with quickstart/producer_data.sh

5. consumer data from kafka into NebulaGraph

${KAFKA_HOME}/bin/connect-standalone.sh ${KAFKA_HOME}/config/standalone.properties connect-nebula-sink_*.properties

please make sure you consume the nodes first.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
config		config
connector		connector
quickstart		quickstart
.gitignore		.gitignore
README.md		README.md
checkstyle-suppressions.xml		checkstyle-suppressions.xml
nebula_java_style_checks.xml		nebula_java_style_checks.xml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

kafka Connect Nebula

Target node/edge types for Kafka topics

Configs for Nebula kafka connect.

Standalone Quickstart

0. prepare the Graph schema in NebulaGraph

1. get the connector jar

2. Put the connector jar to Kafka lib

3. config the connect-nebula-sink.properties

4. producer data into Kafka

5. consumer data from kafka into NebulaGraph

About

Uh oh!

Releases 2

Packages

Contributors 20

Uh oh!

Languages

vesoft-inc/nebula-kafka-connector

Folders and files

Latest commit

History

Repository files navigation

kafka Connect Nebula

Target node/edge types for Kafka topics

Configs for Nebula kafka connect.

Standalone Quickstart

0. prepare the Graph schema in NebulaGraph

1. get the connector jar

2. Put the connector jar to Kafka lib

3. config the connect-nebula-sink.properties

4. producer data into Kafka

5. consumer data from kafka into NebulaGraph

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 20

Uh oh!

Languages

Packages