Apache DataFusion

Apache DataFusion

Apache Software Foundation
+

Related Products

  • Google Cloud BigQuery
    1,867 Ratings
    Visit Website
  • RaimaDB
    9 Ratings
    Visit Website
  • Teradata VantageCloud
    975 Ratings
    Visit Website
  • FusionAuth
    169 Ratings
    Visit Website
  • DbVisualizer
    523 Ratings
    Visit Website
  • Google Cloud Platform
    60,422 Ratings
    Visit Website
  • Azore CFD
    22 Ratings
    Visit Website
  • Epsilon3
    263 Ratings
    Visit Website
  • TeamDesk
    92 Ratings
    Visit Website
  • SureSync
    13 Ratings
    Visit Website

About

Apache DataFusion is an extensible, high-performance query engine written in Rust that utilizes Apache Arrow as its in-memory format. Designed for developers building data-centric systems such as databases, data frames, machine learning, and streaming applications, DataFusion offers SQL and DataFrame APIs, a vectorized, multi-threaded, streaming execution engine, and support for partitioned data sources. It natively supports formats like CSV, Parquet, JSON, and Avro, and allows for seamless integration with object stores including AWS S3, Azure Blob Storage, and Google Cloud Storage. The engine features a comprehensive query planner, a state-of-the-art optimizer with capabilities like expression coercion and simplification, projection and filter pushdown, sort and distribution-aware optimizations, and automatic join reordering. DataFusion is highly customizable, enabling the addition of user-defined scalar, aggregate, and window functions, custom data sources, query languages, etc.

About

Serverless, interactive querying for analyzing data in IBM Cloud Object Storage. Query your data directly where it is stored, there's no ETL, no databases, and no infrastructure to manage. IBM Cloud SQL Query uses Apache Spark, an open-source, fast, extensible, in-memory data processing engine optimized for low latency and ad hoc analysis of data. No ETL or schema definition needed to enable SQL queries. Analyze data where it sits in IBM Cloud Object Storage using our query editor and REST API. Run as many queries as you need; with pay-per-query pricing, you pay only for the data scan. Compress or partition data to drive savings and performance. IBM Cloud SQL Query is highly available and executes queries using compute resources across multiple facilities. IBM Cloud SQL Query supports a variety of data formats such as CSV, JSON and Parquet, and allows for standard ANSI SQL.

About

Knowing of data wrangling habits, Polars exposes a complete Python API, including the full set of features to manipulate DataFrames using an expression language that will empower you to create readable and performant code. Polars is written in Rust, uncompromising in its choices to provide a feature-complete DataFrame API to the Rust ecosystem. Use it as a DataFrame library or as a query engine backend for your data models.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Professional developers and data engineers seeking a solution for building data-centric systems

Audience

All types of businesses and organizations requiring a solution to query their data directly where it is stored

Audience

IT teams seeking a dataframe interface solution thats implemented in Rust

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

$5.00/Terabyte-Month
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Apache Software Foundation
Founded: 2019
United States
datafusion.apache.org

Company Information

IBM
Founded: 1911
United States
www.ibm.com/cloud/sql-query

Company Information

Polars
www.pola.rs/

Alternatives

Alternatives

Apache DataFusion

Apache DataFusion

Apache Software Foundation

Alternatives

BigLake

BigLake

Google
Apache Spark

Apache Spark

Apache Software Foundation
Apache DataFusion

Apache DataFusion

Apache Software Foundation
Apache Phoenix

Apache Phoenix

Apache Software Foundation
TimescaleDB

TimescaleDB

Tiger Data

Categories

Categories

Categories

Relational Database Features

ACID Compliance
Data Failure Recovery
Multi-Platform
Referential Integrity
SQL DDL Support
SQL DML Support
System Catalog
Unicode Support

Integrations

Apache Arrow
Apache Avro
Apache Parquet
Apache Spark
Autymate
Azure Blob Storage
C
Data Sentinel
Flyte
Google Cloud Storage
Google Sheets
IBM Cloud Object Storage
JSON
Lyftrondata
Microsoft Excel
Node.js
Python
SDF
SQL
ZenML

Integrations

Apache Arrow
Apache Avro
Apache Parquet
Apache Spark
Autymate
Azure Blob Storage
C
Data Sentinel
Flyte
Google Cloud Storage
Google Sheets
IBM Cloud Object Storage
JSON
Lyftrondata
Microsoft Excel
Node.js
Python
SDF
SQL
ZenML

Integrations

Apache Arrow
Apache Avro
Apache Parquet
Apache Spark
Autymate
Azure Blob Storage
C
Data Sentinel
Flyte
Google Cloud Storage
Google Sheets
IBM Cloud Object Storage
JSON
Lyftrondata
Microsoft Excel
Node.js
Python
SDF
SQL
ZenML
Claim Apache DataFusion and update features and information
Claim Apache DataFusion and update features and information
Claim IBM Cloud SQL Query and update features and information
Claim IBM Cloud SQL Query and update features and information
Claim Polars and update features and information
Claim Polars and update features and information