Best ETL Software for Databricks Data Intelligence Platform

Compare the Top ETL Software that integrates with Databricks Data Intelligence Platform as of November 2025

This a list of ETL software that integrates with Databricks Data Intelligence Platform. Use the filters on the left to add additional filters for products that have integrations with Databricks Data Intelligence Platform. View the products that work with Databricks Data Intelligence Platform in the table below.

What is ETL Software for Databricks Data Intelligence Platform?

ETL software is used to extract, transform and load data between multiple databases in order to organize and structure it for further analysis. Compare and read user reviews of the best ETL software for Databricks Data Intelligence Platform currently available using the table below. This list is updated regularly.

  • 1
    Google Cloud BigQuery
    BigQuery is an ideal tool for Extract, Transform, Load (ETL) processes, enabling businesses to automate data ingestion, transformation, and loading for analytics. It allows users to transform raw data into useful formats using SQL queries and integrates with various ETL tools to streamline workflows. The platform’s scalability ensures that ETL jobs run smoothly, even with vast amounts of data. New users can take advantage of the $300 in free credits to explore BigQuery’s ETL capabilities and experience the seamless processing of data for analytics. With its high-performance query engine, BigQuery ensures that ETL processes are fast and efficient, regardless of data size.
    Starting Price: Free ($300 in free credits)
    View Software
    Visit Website
  • 2
    dbt

    dbt

    dbt Labs

    dbt modernizes the “T” in ETL: Transformation. Instead of relying on legacy pipelines or black-box transformations, dbt empowers data teams to build, test, and document transformations directly inside the data warehouse or lakehouse. With dbt, teams can: - Transform raw data into analytics-ready models using SQL and Jinja. - Ensure reliability with built-in testing, version control, and CI/CD. - Standardize workflows across teams with reusable models and shared documentation. - Leverage modern platforms like Snowflake, Databricks, BigQuery, and Redshift for scalable transformation. By focusing on the transformation layer, dbt helps organizations shorten pipeline development cycles, reduce data debt, and deliver trusted insights faster — complementing ingestion and loading tools in a modern ELT stack.
    Starting Price: $100 per user/ month
    View Software
    Visit Website
  • 3
    Hevo

    Hevo

    Hevo Data

    Hevo Data is a no-code, bi-directional data pipeline platform specially built for modern ETL, ELT, and Reverse ETL Needs. It helps data teams streamline and automate org-wide data flows that result in a saving of ~10 hours of engineering time/week and 10x faster reporting, analytics, and decision making. The platform supports 100+ ready-to-use integrations across Databases, SaaS Applications, Cloud Storage, SDKs, and Streaming Services. Over 500 data-driven companies spread across 35+ countries trust Hevo for their data integration needs. Try Hevo today and get your fully managed data pipelines up and running in just a few minutes.
    Starting Price: $249/month
  • 4
    Gathr.ai

    Gathr.ai

    Gathr.ai

    Gathr is a Data+AI fabric, helping enterprises rapidly deliver production-ready data and AI products. Data+AI fabric enables teams to effortlessly acquire, process, and harness data, leverage AI services to generate intelligence, and build consumer applications— all with unparalleled speed, scale, and confidence. Gathr’s self-service, AI-assisted, and collaborative approach enables data and AI leaders to achieve massive productivity gains by empowering their existing teams to deliver more valuable work in less time. With complete ownership and control over data and AI, flexibility and agility to experiment and innovate on an ongoing basis, and proven reliable performance at real-world scale, Gathr allows them to confidently accelerate POVs to production. Additionally, Gathr supports both cloud and air-gapped deployments, making it the ideal choice for diverse enterprise needs. Gathr, recognized by leading analysts like Gartner and Forrester, is a go-to-partner for Fortune 500
    Leader badge
    Starting Price: $0.25/credit
  • 5
    Rivery

    Rivery

    Rivery

    Rivery’s SaaS ETL platform provides a fully-managed solution for data ingestion, transformation, orchestration, reverse ETL and more, with built-in support for your development and deployment lifecycles. Key Features: Data Workflow Templates: Extensive library of pre-built templates that enable teams to instantly create powerful data pipelines with the click of a button. Fully managed: No-code, auto-scalable, and hassle-free platform. Rivery takes care of the back end, allowing teams to spend time on priorities rather than maintenance. Multiple Environments: Construct and clone custom environments for specific teams or projects. Reverse ETL: Automatically send data from cloud warehouses to business applications, marketing clouds, CPD’s, and more.
    Starting Price: $0.75 Per Credit
  • 6
    HighByte Intelligence Hub
    HighByte Intelligence Hub is a DataOps software solution purpose-built for industrial data. The Intelligence Hub enables manufacturers to securely collect, model, and stream industrial datasets to and from IT systems without writing or maintaining code. The software is deployed at the Edge to merge real-time, transactional, and time-series data into a single payload for consuming applications. With the Intelligence Hub, users can speed system integration time, rapidly leverage contextualized data for analytics, ML, and AI agents, and govern data standards across the enterprise. HighByte Intelligence Hub provides the critical data infrastructure for Industry 4.0. HighByte Intelligence Hub is a software solution that solves data architecture and integration problems at scale for industrial operations. The Intelligence Hub combines Edge operations, advanced data contextualization, and the ability to deliver unique and specific data to multiple end applications in a code-free solution.
    Starting Price: 17,500 per year
  • 7
    Dagster

    Dagster

    Dagster Labs

    Dagster is a next-generation orchestration platform for the development, production, and observation of data assets. Unlike other data orchestration solutions, Dagster provides you with an end-to-end development lifecycle. Dagster gives you control over your disparate data tools and empowers you to build, test, deploy, run, and iterate on your data pipelines. It makes you and your data teams more productive, your operations more robust, and puts you in complete control of your data processes as you scale. Dagster brings a declarative approach to the engineering of data pipelines. Your team defines the data assets required, quickly assessing their status and resolving any discrepancies. An assets-based model is clearer than a tasks-based one and becomes a unifying abstraction across the whole workflow.
    Starting Price: $0
  • 8
    Prophecy

    Prophecy

    Prophecy

    Prophecy enables many more users - including visual ETL developers and Data Analysts. All you need to do is point-and-click and write a few SQL expressions to create your pipelines. As you use the Low-Code designer to build your workflows - you are developing high quality, readable code for Spark and Airflow that is committed to your Git. Prophecy gives you a gem builder - for you to quickly develop and rollout your own Frameworks. Examples are Data Quality, Encryption, new Sources and Targets that extend the built-in ones. Prophecy provides best practices and infrastructure as managed services – making your life and operations simple! With Prophecy, your workflows are high performance and use scale-out performance & scalability of the cloud.
    Starting Price: $299 per month
  • 9
    Ascend

    Ascend

    Ascend

    Ascend gives data teams a unified and automated platform to ingest, transform, and orchestrate their entire data engineering and analytics engineering workloads, 10X faster than ever before.​ Ascend helps gridlocked teams break through constraints to build, manage, and optimize the increasing number of data workloads required. Backed by DataAware intelligence, Ascend works continuously in the background to guarantee data integrity and optimize data workloads, reducing time spent on maintenance by up to 90%. Build, iterate on, and run data transformations easily with Ascend’s multi-language flex-code interface enabling the use of SQL, Python, Java, and, Scala interchangeably. Quickly view data lineage, data profiles, job and user logs, system health, and other critical workload metrics at a glance. Ascend delivers native connections to a growing library of common data sources with our Flex-Code data connectors.
    Starting Price: $0.98 per DFC
  • 10
    Arcion

    Arcion

    Arcion Labs

    Deploy production-ready change data capture pipelines for high-volume, real-time data replication - without a single line of code. Supercharged Change Data Capture. Enjoy automatic schema conversion, end-to-end replication, flexible deployment, and more with Arcion’s distributed Change Data Capture (CDC). Leverage Arcion’s zero data loss architecture for guaranteed end-to-end data consistency, built-in checkpointing, and more without any custom code. Leave scalability and performance concerns behind with a highly-distributed, highly parallel architecture supporting 10x faster data replication. Reduce DevOps overhead with Arcion Cloud, the only fully-managed CDC offering. Enjoy autoscaling, built-in high availability, monitoring console, and more. Simplify & standardize data pipelines architecture, and zero downtime workload migration from on-prem to cloud.
    Starting Price: $2,894.76 per month
  • 11
    Boltic

    Boltic

    Boltic

    Build and orchestrate ETL pipelines with ease on Boltic. Extract, transform, and load data from multiple sources to any destination without writing code. Use advanced transformations and build end-to-end data pipelines for analytics-ready data. Integrate data from a list of 100+ pre-built Integrations and join multiple data sources together with a few clicks to work on the cloud. Add Boltic’s No-code transformation or use Script Engine to design custom scripts on integrated data for data exploration and cleansing. Invite team members to come together and solve organisation-wide problems faster by working on a secure cloud data operations platform. Schedule ETL pipelines to run automatically at pre-defined time intervals to make importing, cleaning, transforming, storing, and sharing data easier. Track and analyze key metrics of business with the help of AI & ML. Gain insights into business and monitor for potential issues or opportunities.
    Starting Price: $249 per month
  • 12
    Streamkap

    Streamkap

    Streamkap

    Streamkap is a streaming data platform that makes streaming as easy as batch. Stream data from database (change data capturee) or event sources to your favorite database, data warehouse or data lake. Streamkap can be deployed as a SaaS or in a bring your own cloud (BYOC) deployment.
    Starting Price: $600 per month
  • 13
    5X

    5X

    5X

    5X is an all-in-one data platform that provides everything you need to centralize, clean, model, and analyze your data. Designed to simplify data management, 5X offers seamless integration with over 500 data sources, ensuring uninterrupted data movement across all your systems with pre-built and custom connectors. The platform encompasses ingestion, warehousing, modeling, orchestration, and business intelligence, all rendered in an easy-to-use interface. 5X supports various data movements, including SaaS apps, databases, ERPs, and files, automatically and securely transferring data to data warehouses and lakes. With enterprise-grade security, 5X encrypts data at the source, identifying personally identifiable information and encrypting data at a column level. The platform is designed to reduce the total cost of ownership by 30% compared to building your own platform, enhancing productivity with a single interface to build end-to-end data pipelines.
    Starting Price: $350 per month
  • 14
    Fivetran

    Fivetran

    Fivetran

    Fivetran is a leading data integration platform that centralizes an organization’s data from various sources to enable modern data infrastructure and drive innovation. It offers over 700 fully managed connectors to move data automatically, reliably, and securely from SaaS applications, databases, ERPs, and files to data warehouses and lakes. The platform supports real-time data syncs and scalable pipelines that fit evolving business needs. Trusted by global enterprises like Dropbox, JetBlue, and Pfizer, Fivetran helps accelerate analytics, AI workflows, and cloud migrations. It features robust security certifications including SOC 1 & 2, GDPR, HIPAA, and ISO 27001. Fivetran provides an easy-to-use, customizable platform that reduces engineering time and enables faster insights.
  • 15
    Snowplow Analytics

    Snowplow Analytics

    Snowplow Analytics

    Snowplow is a best-in-class data collection platform built for Data Teams. With Snowplow you can collect rich, high-quality event data from all your platforms and products. Your data is available in real-time and is delivered to your data warehouse of choice where it can easily be joined with other data sets and used to power BI tools, custom reports or machine learning models. The Snowplow pipeline runs in your cloud account (AWS and/or GCP), giving you complete ownership of your data. Snowplow frees you to ask and answer any questions relevant to your business and use case, using your preferred tools and technologies.
  • 16
    Lyftrondata

    Lyftrondata

    Lyftrondata

    Whether you want to build a governed delta lake, data warehouse, or simply want to migrate from your traditional database to a modern cloud data warehouse, do it all with Lyftrondata. Simply create and manage all of your data workloads on one platform by automatically building your pipeline and warehouse. Analyze it instantly with ANSI SQL, BI/ML tools, and share it without worrying about writing any custom code. Boost the productivity of your data professionals and shorten your time to value. Define, categorize, and find all data sets in one place. Share these data sets with other experts with zero codings and drive data-driven insights. This data sharing ability is perfect for companies that want to store their data once, share it with other experts, and use it multiple times, now and in the future. Define dataset, apply SQL transformations or simply migrate your SQL data processing logic to any cloud data warehouse.
  • 17
    RestApp

    RestApp

    RestApp

    RestApp is a No Code Data Activation platform that empowers anyone with an all-in-one solution to connect, model, and sync any data with his favorite tools. RestApp enables Data & Ops teams to activate data in minutes with No-Code by: - Connecting with your favorite databases and business apps - Modeling your data with drag-and-drop SQL, NoSQL and Python functions and then easily creating and sharing your queries with your teammates - Syncing automatically your data with your tools. With RestApp, you are notably able by using our templates to: - Computing your main financial KPIs: churn rate, MRR, ARR, ACV, ARPU, LVT - Computing your customers’ lead scoring - Generate automatic cohort analysis
    Starting Price: Free
  • 18
    Numbers Station

    Numbers Station

    Numbers Station

    Accelerating insights, eliminating barriers for data analysts. Intelligent data stack automation, get insights from your data 10x faster with AI. Pioneered at the Stanford AI lab and now available to your enterprise, intelligence for the modern data stack has arrived. Use natural language to get value from your messy, complex, and siloed data in minutes. Tell your data your desired output, and immediately generate code for execution. Customizable automation of complex data tasks that are specific to your organization and not captured by templated solutions. Empower anyone to securely automate data-intensive workflows on the modern data stack, free data engineers from an endless backlog of requests. Arrive at insights in minutes, not months. Uniquely designed for you, tuned for your organization’s needs. Integrated with upstream and downstream tools, Snowflake, Databricks, Redshift, BigQuery, and more coming, built on dbt.
  • 19
    TiMi

    TiMi

    TIMi

    With TIMi, companies can capitalize on their corporate data to develop new ideas and make critical business decisions faster and easier than ever before. The heart of TIMi’s Integrated Platform. TIMi’s ultimate real-time AUTO-ML engine. 3D VR segmentation and visualization. Unlimited self service business Intelligence. TIMi is several orders of magnitude faster than any other solution to do the 2 most important analytical tasks: the handling of datasets (data cleaning, feature engineering, creation of KPIs) and predictive modeling. TIMi is an “ethical solution”: no “lock-in” situation, just excellence. We guarantee you a work in all serenity and without unexpected extra costs. Thanks to an original & unique software infrastructure, TIMi is optimized to offer you the greatest flexibility for the exploration phase and the highest reliability during the production phase. TIMi is the ultimate “playground” that allows your analysts to test the craziest ideas!
  • 20
    Microsoft Power Query
    Power Query is the easiest way to connect, extract, transform and load data from a wide range of sources. Power Query is a data transformation and data preparation engine. Power Query comes with a graphical interface for getting data from sources and a Power Query Editor for applying transformations. Because the engine is available in many products and services, the destination where the data will be stored depends on where Power Query was used. Using Power Query, you can perform the extract, transform, and load (ETL) processing of data. Microsoft’s Data Connectivity and Data Preparation technology that lets you seamlessly access data stored in hundreds of sources and reshape it to fit your needs—all with an easy to use, engaging, no-code experience. Power Query supports hundreds of data sources with built-in connectors, generic interfaces (such as REST APIs, ODBC, OLE, DB and OData) and the Power Query SDK to build your own connectors.
  • 21
    Polytomic

    Polytomic

    Polytomic

    From your app database, data warehouse, spreadsheets, or even arbitrary APIs. No coding is required. See a live view of all the customer data you need right in Salesforce, Marketo, HubSpot, and other business systems. Automatically pipe combined data from any number of databases, data warehouses, spreadsheets, and APIs. Choose which fields to sync so you only get the data you care about. Integrate with all of your favorite tools at the click of a button. Point-and-click interface to sync the data you need from your databases and spreadsheets to your business applications. Give your customer success and sales teams a full view of all your customer data right from their sales CRM. Automatic syncs from your data warehouses and databases to all business systems and spreadsheets. See all proprietary user and company attributes automatically synced to your CRM. Give your support team an instant live view of the customer data they need right from their support system.
  • 22
    Unstructured

    Unstructured

    Unstructured

    80% of enterprise data exists in difficult-to-use formats like HTML, PDF, CSV, PNG, PPTX, and more. Unstructured effortlessly extracts and transforms complex data for use with every major vector database and LLM framework. Unstructured allows data scientists to pre-process data at scale so they spend less time collecting and cleaning, and more time modeling and analyzing. Our enterprise-grade connectors capture data wherever it lives, so we can transform it into AI-friendly JSON files for companies who are eager to fold AI into their business. You can count on Unstructured to deliver data that's curated, clean of artifacts, and most importantly, LLM-ready.
  • 23
    TROCCO

    TROCCO

    primeNumber Inc

    TROCCO is a fully managed modern data platform that enables users to integrate, transform, orchestrate, and manage their data from a single interface. It supports a wide range of connectors, including advertising platforms like Google Ads and Facebook Ads, cloud services such as AWS Cost Explorer and Google Analytics 4, various databases like MySQL and PostgreSQL, and data warehouses including Amazon Redshift and Google BigQuery. The platform offers features like Managed ETL, which allows for bulk importing of data sources and centralized ETL configuration management, eliminating the need to manually create ETL configurations individually. Additionally, TROCCO provides a data catalog that automatically retrieves metadata from data analysis infrastructure, generating a comprehensive catalog to promote data utilization. Users can also define workflows to create a series of tasks, setting the order and combination to streamline data processing.
  • 24
    Talend Data Catalog
    Talend Data Catalog gives your organization a single, secure point of control for your data. With robust tools for search and discovery, and connectors to extract metadata from virtually any data source, Data Catalog makes it easy to protect your data, govern your analytics, manage data pipelines, and accelerate your ETL processes. Data Catalog automatically crawls, profiles, organizes, links, and enriches all your metadata. Up to 80% of the information associated with the data is documented automatically and kept up-to-date through smart relationships and machine learning, continually delivering the most current data to the user. Make data governance a team sport with a secure single point of control where you can collaborate to improve data accessibility, accuracy, and business relevance. Support data privacy and regulatory compliance with intelligent data lineage tracing and compliance tracking.
  • 25
    DataNimbus

    DataNimbus

    DataNimbus

    DataNimbus is an AI-powered platform that streamlines payments and accelerates AI adoption through innovative, cost-efficient solutions. By seamlessly integrating with Databricks components like Spark, Unity Catalog, and ML Ops, DataNimbus enhances scalability, governance, and runtime operations. Its offerings include a visual designer, a marketplace for reusable connectors and machine learning blocks, and agile APIs, all designed to simplify workflows and drive data-driven innovation.
  • 26
    Precisely Connect
    Integrate data seamlessly from legacy systems into next-gen cloud and data platforms with one solution. Connect helps you take control of your data from mainframe to cloud. Integrate data through batch and real-time ingestion for advanced analytics, comprehensive machine learning and seamless data migration. Connect leverages the expertise Precisely has built over decades as a leader in mainframe sort and IBM i data availability and security to lead the industry in accessing and integrating complex data. Access to all your enterprise data for the most critical business projects is ensured by support for a wide range of sources and targets for all your ELT and CDC needs.
  • Previous
  • You're on page 1
  • Next