SeaTunnel is a next-generation, super high-performance, distributed, massive data integration tool. It efficiently and stably synchronizes tens of billions of data daily and has been adopted by numerous companies.
SeaTunnel specializes in data integration and synchronization, addressing common challenges in this domain:
-
Diverse Data Sources: With hundreds of data sources and evolving technologies, finding a tool capable of rapidly adapting to these sources can be challenging.
-
Complex Synchronization Scenarios: Data synchronization requirements encompass offline-full synchronization, offline-incremental synchronization, CDC, real-time synchronization, and full database synchronization.
-
Resource Efficiency: Many existing data integration and synchronization tools demand significant computing resources and JDBC connections for real-time synchronization, adding strain to enterprises.
-
Quality and Monitoring: Data integration and synchronization processes often encounter data loss or duplication. A lack of monitoring makes it difficult to gain insights during task execution.
-
Diverse Connectors: SeaTunnel offers support for over 100 connectors, with ongoing expansion. You can explore the list of supported and planned connectors here.
-
Batch-Stream Integration: Connectors developed using the SeaTunnel Connector API are highly adaptable to various synchronization scenarios, simplifying data integration management.
-
Distributed Snapshot Algorithm: Ensures data consistency across synchronized data.
-
Multi-Engine Support: Defaulting to the SeaTunnel Zeta Engine, SeaTunnel also supports Flink and Spark as execution engines, catering to various enterprise requirements.
-
JDBC Multiplexing and Log Parsing: Supports multi-table or whole database synchronization, efficiently addressing over-JDBC connections and CDC multi-table synchronization issues.
-
High Throughput and Low Latency: Provides stable, high-throughput data synchronization with low latency.
-
Real-Time Monitoring: Detailed monitoring information throughout the synchronization process offers insights into data volume, size, QPS, and more.
-
Two Job Development Methods: Supports both coding and canvas design. The SeaTunnel web project offers visual job management, scheduling, running, and monitoring.
In addition to these features, SeaTunnel provides a versatile Connector API that is not tied to a specific execution engine, ensuring flexibility and compatibility with various engines.
The SeaTunnel runtime process involves configuring job information, selecting the execution engine, and parallelizing data using Source Connectors. These connectors can easily be developed and extended to meet specific needs. By default, SeaTunnel uses the SeaTunnel Engine. If desired, it can package the Connector for execution using Flink or Spark.
- Supported Source Connectors: Check Out
- Supported Sink Connectors: Check Out
- Transform Connectors: Check Out
For a list of connectors and their health status, visit the Connector Status.
Download SeaTunnel directly from the official website.
SeaTunnel defaults to using the SeaTunnel Zeta Engine as the runtime execution engine for data synchronization. For the best functionality and performance, we recommend using the Zeta engine. However, SeaTunnel also supports Flink and Spark as execution engines.
-
SeaTunnel Zeta Engine: Quick Start
-
Spark: Quick Start
-
Flink: Quick Start
-
Weibo, Value-added Business Department Data Platform: Weibo's business utilizes a customized version of SeaTunnel for task monitoring in real-time streaming computing.
-
Tencent Cloud: Collects and extracts data from various business services into Apache Kafka, storing it in Clickhouse.
-
Sina, Big Data Operation Analysis Platform: Performs real-time and offline analysis of data operation and maintenance for services such as Sina News, CDN, and writes the data into Clickhouse.
-
Sogou, Sogou Qiqian System: Uses SeaTunnel as an ETL tool to establish a real-time data warehouse system.
-
Yonghui Superstores Founders' Alliance-Yonghui Yunchuang Technology, Member E-commerce Data Analysis Platform: Provides real-time streaming and offline SQL computing of e-commerce user behavior data for Yonghui Life.
For more use cases, please refer to the SeaTunnel blog.
SeaTunnel Web is a web project that provides visual management of jobs, scheduling, running and monitoring capabilities. It is developed based on the SeaTunnel Connector API and the SeaTunnel Zeta Engine. It is a web project that can be deployed independently. It is also a sub-project of SeaTunnel. For more information, please refer to SeaTunnel Web
This project adheres to the Contributor Covenant Code of Conduct. By participating, you are expected to uphold this code. Please follow the REPORTING GUIDELINES to report unacceptable behavior.
Thanks to all developers!
Please follow this document.
-
Mail list: [email protected]. Mail to
[email protected]
, follow the reply to subscribe to the mail list. -
Slack: Join SeaTunnel Slack
-
Twitter: ASFSeaTunnel on Twitter
-
Bilibili (for Chinese users)
SeaTunnel enriches the [CNCF CLOUD NATIVE Landscape](https://landscape.cncf.io/?landscape=observability-and-analysis&license=apache-license-2-0).
Various companies and organizations use SeaTunnel for research, production, and commercial products. Visit our website to find the user page.