Announcement

Free to view yesterday and today
Customer Service: cat_manager

Trino: Distributed SQL Query Engine for Big Data

Trino is a high-performance, distributed SQL query engine for big data. It enables querying data where it lives, including HDFS, S3, Cassandra, MySQL, and many others, without needing to move data.

Java
Added on 2025年6月15日
View on GitHub
Trino: Distributed SQL Query Engine for Big Data preview
11,432
Stars
3,229
Forks
Java
Language

Project Introduction

Summary

Trino is an open-source distributed SQL query engine designed to query large datasets residing in various data sources simultaneously. It provides a single point of access using standard SQL.

Problem Solved

Traditional data warehousing requires moving all data into a central repository, which is expensive and time-consuming. Trino solves this by allowing users to query data directly in its source location, bridging silos and enabling real-time analytics across diverse systems.

Core Features

Connectors

Pluggable architecture supporting connectivity to a wide array of data sources (HDFS, S3, relational databases, NoSQL, etc.).

Distributed Execution

Scalable query execution across a cluster of machines for high performance on large datasets.

Tech Stack

Java
Maven
Netty
Jackson
Aeron

Use Cases

Trino's architecture makes it ideal for various scenarios requiring querying diverse data stores using SQL:

Data Lake Analytics

Details

Querying petabytes of data stored in data lakes on S3, HDFS, or similar object storage using standard SQL interfaces.

User Value

Enables analysts and data scientists to directly query raw data without complex ETL pipelines.

Cross-Source Reporting

Details

Joining data from different systems like a PostgreSQL database, a Kafka topic, and S3 data simultaneously within a single SQL query.

User Value

Simplifies reporting and BI by providing a unified view across disparate data sources.

Recommended Projects

You might be interested in these projects

raphamorimrio

Rio is a high-performance, hardware-accelerated terminal emulator designed for both desktop environments and web browsers, leveraging your GPU for smoother rendering and improved responsiveness.

Rust
5075182
View Details

krahetshello-algo

An open-source, animated tutorial for data structures and algorithms, featuring runnable code examples in multiple programming languages.

Java
11237413968
View Details

alibabaSentinel

Sentinel is a powerful flow control component designed for microservices, enhancing reliability, resilience, and real-time monitoring in cloud-native environments.

Java
227458093
View Details