Compile And Package
1. Preparation
Environment requirements: Version of JDK must be higher then JDK8, Oracle/Sun and OpenJDK are both supported.
After obtaining the project code from github repository https://github.com/apache/linkis, use maven to compile the project installation package.
1.1 Source code acquisition
- Method 1: Obtain the source code of the project from github repository https://github.com/apache/linkis.
Notice : The official recommended versions for compiling Linkis are hadoop-2.7.2, hive-1.2.1, spark-2.4.3, and Scala-2.11.12.
If you want to compile Linkis with another version of Hadoop, Hive, Spark, please refer to: How to Modify Linkis dependency of Hadoop, Hive, Spark
2. Fully compile Linkis
step1 Compile for the first time (not the first time you can skip this step)
If you are compiling and using it locally for the first time, you must first execute the following command in the root directory of the Linkis source code package:
cd linkis-x.x.x
mvn -N install
step2 Execute compilation
Execute the following commands in the root directory of the Linkis source code package:
cd linkis-x.x.x
mvn clean install
step3 Obtain the installation package
The compiled complete installation package is in the assembly-combined-package->target directory of the project:
#Detailed path is as follows
linkis-x.x.x/assembly-combined-package/target/wedatasphere-linkis-x.x.x-dist.tar.gz
3. Compile a single module
step1 Compile for the first time (skip this step for non-first time)
If you are compiling and using it locally for the first time, you must first execute the following command in the root directory of the Linkis source code package:
cd linkis-x.x.x
mvn -N install
step2 Enter the corresponding module to compile
Enter the corresponding module to compile, for example, if you want to recompile Entrance, the command is as follows:
cd linkis-x.x.x/linkis-computation-governance/linkis-entrance
mvn clean install
step3 Obtain the installation package
Get the installation package, there will be a compiled package in the ->target directory of the corresponding module:
linkis-x.x.x/linkis-computation-governance/linkis-entrance/target/linkis-entrance.x.x.x.jar
4. Compile an engine
Here's an example of the Spark engine that builds Linkis:
step1 Compile for the first time (skip this step for non-first time)
If you are using it locally for the first time, you must first execute the following command in the root directory of the Linkis source code package:
cd linkis-x.x.x
mvn -N install
step2 Enter the corresponding module to compile
Enter the directory where the Spark engine is located to compile and package, the command is as follows:
cd linkis-x.x.x/linkis-engineconn-plugins/engineconn-plugins/spark
mvn clean install
step3 Obtain the installation package
Get the installation package, there will be a compiled package in the ->target directory of the corresponding module:
linkis-x.x.x/linkis-engineconn-plugins/engineconn-plugins/spark/target/linkis-engineplugin-spark-x.x.x.jar
How to install Spark engine separately? Please refer to Linkis Engine Plugin Installation Document
5. How to modify the Hadoop, Hive, and Spark versions that Linkis depends on
Please note: Hadoop is a big data basic service, Linkis must rely on Hadoop for compilation; If you don't want to use an engine, you don't need to set the version of the engine or compile the engine plug-in.
Specifically, the version of Hadoop can be modified in a different way than Spark, Hive, and other computing engines, as described below:
5.1 How to modify the Hadoop version that Linkis depends on
Enter the source package root directory of Linkis, and manually modify the Hadoop version information of the pom.xml file, as follows:
cd linkis-x.x.x
vim pom.xml
<properties>
<hadoop.version>2.7.2</hadoop.version> <!--> Modify the Hadoop version number here <-->
<scala.version>2.11.12</scala.version>
<jdk.compile.version>1.8</jdk.compile.version>
</properties>
Please note: If your hadoop version is hadoop3, you need to modify the pom file of linkis-hadoop-common Because under hadoop2.8, hdfs-related classes are in the hadoop-hdfs module, but in hadoop 3.X the corresponding classes are moved to the module hadoop-hdfs-client, you need to modify this file: pom:Linkis/linkis-commons/linkis-hadoop-common/pom.xml Modify the dependency hadoop-hdfs to hadoop-hdfs-client:
<dependency>
<groupId>org.apache.hadoop</groupId>
<artifactId>hadoop-hdfs</artifactId> <!-- Just replace this line with <artifactId>hadoop-hdfs-client</artifactId>-->
<version>${hadoop.version}</version>
</dependency>
Modify hadoop-hdfs to:
<dependency>
<groupId>org.apache.hadoop</groupId>
<artifactId>hadoop-hdfs-client</artifactId>
<version>${hadoop.version}</version>
</dependency>
5.2 How to modify the Spark and Hive versions that Linkis depends on
Here's an example of changing the version of Spark. Go to the directory where the Spark engine is located and manually modify the Spark version information of the pom.xml file as follows:
cd linkis-x.x.x/linkis-engineconn-plugins/engineconn-plugins/spark
vim pom.xml
<properties>
<spark.version>2.4.3</spark.version> <!--> Modify the Spark version number here <-->
</properties>
Modifying the version of other engines is similar to modifying the Spark version. First, enter the directory where the relevant engine is located, and manually modify the engine version information in the pom.xml file.
Then please refer to 4. Compile an engine