In this section, we are going to show you how to set up your development environment for SeaTunnel, and then run a simple example in your JetBrains IntelliJ IDEA.
You can develop or test SeaTunnel code in any development environment that you like, but here we use JetBrains IDEA as an example to teach you to step by step.
Before we start talking about how to set up the environment, we need to do some preparation work. Make sure you already have installed the following software:
- Git installed.
- Java ( JDK8/JDK11 are supported by now) installed and
JAVA_HOME
set. - Scala (only scala 2.11.12 supported by now) installed.
- JetBrains IDEA installed.
First of all, you need to clone the SeaTunnel source code from GitHub.
git clone git@github.com:apache/seatunnel.git
After cloning the source code, you should run the ./mvnw
command to install the subproject to the maven local repository.
Otherwise, your code could not start in JetBrains IntelliJ IDEA correctly.
./mvnw install -Dmaven.test.skip
After you install the maven, you can use the following command to compile and package.
mvn clean package -pl seatunnel-dist -am -Dmaven.test.skip=true
If you want to build submodules separately, you can use the following command to compile and package.
# This is an example of building the redis connector separately
mvn clean package -pl seatunnel-connectors-v2/connector-redis -am -DskipTests -T 1C
Now, you can open your JetBrains IntelliJ IDEA and explore the source code. But before building Scala code in IDEA, you should also install JetBrains IntelliJ IDEA's Scala Plugin. See Install Plugins For IDEA if you want to.
Before running the following example, you should also install JetBrains IntelliJ IDEA's Lombok plugin. See install plugins for IDEA if you want to.
Apache SeaTunnel uses Spotless
for code style and format checks. You can run the following command and Spotless
will automatically fix the code style and formatting errors for you:
./mvnw spotless:apply
You could copy the pre-commit hook
file /tools/spotless_check/pre-commit.sh
to your .git/hooks/
directory so that every time you commit your code with git commit
, Spotless
will automatically fix things for you.
After all the above things are done, you just finish the environment setup and can run an example we provide to you out
of box. All examples are in module seatunnel-examples
, you could pick one you are interested in, Running Or Debugging
It In IDEA as you wish.
Here we use seatunnel-examples/seatunnel-engine-examples/src/main/java/org/apache/seatunnel/example/engine/SeaTunnelEngineExample.java
as an example, when you run it successfully you can see the output as below:
2024-08-10 11:45:32,839 INFO org.apache.seatunnel.core.starter.seatunnel.command.ClientExecuteCommand -
***********************************************
Job Statistic Information
***********************************************
Start Time : 2024-08-10 11:45:30
End Time : 2024-08-10 11:45:32
Total Time(s) : 2
Total Read Count : 5
Total Write Count : 5
Total Failed Count : 0
***********************************************
All our examples use simple source and sink to make it less dependent and easy to run. You can change the example configuration
in resources/examples
. You can change your configuration as below, if you want to use PostgreSQL as the source and
sink to console.
Please note that when using connectors other than FakeSource and Console, you need to modify the dependencies in the pom.xml
file of the corresponding submodule of seatunnel-example.
env {
parallelism = 1
job.mode = "BATCH"
}
source {
Jdbc {
driver = org.postgresql.Driver
url = "jdbc:postgresql://host:port/database"
username = postgres
password = "123456"
query = "select * from test"
table_path = "database.test"
}
}
sink {
Console {}
}